In this paper we introduce a new method for detecting a type of English Multiword Expressions (MWEs), which is phrasal verbs, into an English-Arabic phrase-based statistical machine translation (PBSMT) system. The detection starts with parsing the English side of the parallel corpus, detecting various linguistic patterns for phrasal verbs and finally integrate them into the En-Ar PBSMT system. In addition, the paper explores the effect of cliticizing specific words in English that have no Arabic equivalent. The results, which reported with the BLEU scores, showed that some patterns achieved
This paper describes our participation in the SemEval-2016 task 5, Aspect Based Sentiment Analysis (ABSA). We participated in two slots in the sentence level ABSA (Subtask 1) namely: aspect category extraction (Slot 1) and sentiment polarity extraction (Slot 3) in English Restaurants and Laptops reviews. For Slot 1, we applied different models for each domain. In the restaurants domain, we used an ensemble classifier for each aspect which is a combination of a Convolutional Neural Network (CNN) classifier initialized with pretrained word vectors, and a Support Vector Machine (SVM) classifier
This paper presents a model that was developed to address SemEval Task 7: "Determining Sentiment Intensity of English and Arabic Phrases", with focus on 'Arabic Phrases'. The goal of this task is to determine the degree to which some given term is associated with positive sentiment. The underlying premise behind the model that we have adopted is that determining the context (positive or negative) in which a term usually occurs can determine its strength. Since the focus is on Twitter terms, Twitter was used to collect tweets for each term for which a strength value was to be derived. An Arabic
Optic Disc (OD) localization is a fundamental step in developing computer-assisted diagnostics. In this work, an ultrafast method to locate the OD in retinal fundus images is presented. The proposed method is based on transforming the localization problem into two 1D problems by projecting the image features onto two perpendicular directions. Image features such as the directionality of the retinal vessels, the brightness and the size of the OD have been used in the current method. Two publicly available databases were used to evaluate the accuracy and the computation time of the proposed
The human β-coronavirus SARS-CoV-2 epidemic started in late December 2019 in Wuhan, China. It causes Covid-19 disease which has become pandemic. Each of the five-known human β-coronaviruses has four major structural proteins (E, M, N and S) and 16 non-structural proteins encoded by ORF1a and ORF1b together (ORF1ab) that are involved in virus pathogenicity and infectivity. Here, we performed detailed positive selection analyses for those six genes among the four previously known human β-coronaviruses and within 38 SARS-CoV-2 genomes to assess signatures of adaptive evolution using maximum
Time series forecasting is a challenging problem, that has a wide variety of application domains such as in engineering, environment, finance and others. When confronted with a time series forecasting application, typically a number of different forecasting models are tested and the best one is considered. Alternatively, instead of choosing the single best method, a wiser action could be to choose a group of the best models and then to combine their forecasts. In this study we propose a combined model consisting of Multi-layer perceptron (MLP), Gaussian Processes Regression (GPR) and a
National Basketball Association (NBA) All Star Game is a demonstration game played between the selected Western and Eastern conference players. The selection of players for the NBA All Star game purely depends on votes. The fans and coaches vote for the players and decide who is going to make the All Star roster. A player who continues to receive enough votes in following years will play more All Star games. The selection of All Star players in NBA is subjective based on voting and there are no selection criteria that take out the human bias and opinion. Analyzing data from previous sports
Human Microbiome plays a critical role in health and the environment. Colorectal cancer (CRC) is the most common cause of death in many countries, and hence early diagnosis of CRC may help in increasing the survival rate. Tracking changes in the microbiome structure of human gut opens new gates towards the detection and prediction of the risk of CRC. Recently, machine learning became a powerful technique in many bioinformatics fields, one of which is metagenomics. Metagenomics is defined as the study of a collection of microbial genomes isolated directly and sequenced from its natural habitats
Nowadays, most service providers adopt Cloud Computing technology. Moving to Cloud creates new risks and challenges. The Cloud era is to outsource our services to Cloud Service Provider (CSP). However, we have to develop a strong governance framework to review the service level, to manage risk effectively and to certify that our critical information is secure. In this paper, we develop an innovative governance model. It is based on the theoretical Guo, Z., Song, M. and Song, J governance model for Cloud computing. We distribute Cloud Control Matrix (CCM) on the Guo's model categories. This
Most existing research in the area of emotions recognition has focused on short segments or utterances of speech. In this paper we propose a machine learning system for classifying the overall sentiment of long conversations as being Positive or Negative. Our system has three main phases, first it divides a call into short segments, second it applies machine learning to recognize the emotion for each segment, and finally it learns a binary classifier that takes the recognized emotions of individual segments as features. We investigate different approaches for this final phase by varying how