Aly A. Fahmy | Researchain

Archive Network Publication Hotspot Collaboration

Network

Latest external collaboration on country level. Dive into details by clicking on the dots.

Explore More

Hotspot

Dive into the research topics where Aly A. Fahmy is active.

Explore More

Publication

Featured researches published by Aly A. Fahmy.

International Journal of Computer Applications | 2013

A Survey of Text Similarity Approaches

Wael Hassan Gomaa; Aly A. Fahmy

ABSTRACT Measuring the similarity between words, sentences, paragraphs and documents is an important component in various tasks such as information retrieval, document clustering, word-sense disambiguation, automatic essay scoring, short answer grading, machine translation and text summarization. This survey discusses the existing works on text similarity through partitioning them into three approaches; String-based, Corpus-based and Knowledge-based similarities. Furthermore, samples of combination between these similarities are presented. General Terms Text Mining, Natural Language Processing. Keywords BasedText Similarity, Semantic Similarity, String-Based Similarity, Corpus-Based Similarity, Knowledge-Based Similarity. NeedlemanWunsch 1. INTRODUCTION Text similarity measures play an increasingly important role in text related research and applications in tasks Nsuch as information retrieval, text classification, document clustering, topic detection, topic tracking, questions generation, question answering, essay scoring, short answer scoring, machine translation, text summarization and others. Finding similarity between words is a fundamental part of text similarity which is then used as a primary stage for sentence, paragraph and document similarities. Words can be similar in two ways lexically and semantically. Words are similar lexically if they have a similar character sequence. Words are similar semantically if they have the same thing, are opposite of each other, used in the same way, used in the same context and one is a type of another. DistanceLexical similarity is introduced in this survey though different String-Based algorithms, Semantic similarity is introduced through Corpus-Based and Knowledge-Based algorithms. String-Based measures operate on string sequences and character composition. A string metric is a metric that measures similarity or dissimilarity (distance) between two text strings for approximate string matching or comparison. Corpus-Based similarity is a semantic similarity measure that determines the similarity between words according to information gained from large corpora. Knowledge-Based similarity is a semantic similarity measure that determines the degree of similarity between words using information derived from semantic networks. The most popular for each type will be presented briefly. This paper is organized as follows: Section two presents String-Based algorithms by partitioning them into two types character-based and term-based measures. Sections three and four introduce Corpus-Based and knowledge-Based algorithms respectively. Samples of combinations between similarity algorithms are introduced in section five and finally section six presents conclusion of the survey.

International journal of artificial intelligence | 2012

A machine Learning Approach for Opinion Holder Extraction in Arabic Language

Mohamed Elarnaoty; Samir E. AbdelRahman; Aly A. Fahmy

Opinion mining aims at extracting useful subjective information from reliable amounts of text. Opinion mining holder recognition is a task that has not been considered yet in Arabic Language. This task essentially requires deep understanding of clauses structures. Unfortunately, the lack of a robust, publicly available, Arabic parser further complicates the research. This paper presents a leading research for the opinion holder extraction in Arabic news independent from any lexical parsers. We investigate constructing a comprehensive feature set to compensate the lack of parsing structural outcomes. The proposed feature set is tuned from English previous works coupled with our proposed semantic field and named entities features. Our feature analysis is based on Conditional Random Fields (CRF) and semi-supervised pattern recognition techniques. Different research models are evaluated via cross-validation experiments achieving 54.03 F-measure. We publicly release our own research outcome corpus and lexicon for opinion mining community to encourage further research.

International Journal of Advanced Computer Science and Applications | 2012

Short Answer Grading Using String Similarity And Corpus-Based Similarity

Wael Hassan Gomaa; Aly A. Fahmy

Most automatic scoring systems use pattern based that requires a lot of hard and tedious work. These systems work in a supervised manner where predefined patterns and scoring rules are generated. This paper presents a different unsupervised approach which deals with students’ answers holistically using text to text similarity. Different String-based and Corpus-based similarity measures were tested separately and then combined to achieve a maximum correlation value of 0.504. The achieved correlation is the best value achieved for unsupervised approach Bag of Words (BOW) when compared to previous work.

intelligent systems design and applications | 2012

Genetic Algorithms for community detection in social networks

Ahmed Ibrahem Hafez; Neveen I. Ghali; Aboul Ella Hassanien; Aly A. Fahmy

Community detection in complex networks has attracted a lot of attention in recent years. Community detection can be viewed as an optimization problem, in which an objective function that captures the intuition of a community as a group of nodes with better internal connectivity than external connectivity is chosen to be optimized. Many single-objective optimization techniques have been used to solve the problem however those approaches have its drawbacks since they try optimizing one objective function and this results to a solution with a particular community structure property. More recently researchers viewed the problem as a multi-objective optimization problem and many approaches have been proposed to solve it. However which objective functions could be used with each other is still under debated since many objective functions have been proposed over the past years and in somehow most of them are similar in definition. In this paper we use Genetic Algorithm (GA) as an effective optimization technique to solve the community detection problem as a single-objective and multi-objective problem, we use the most popular objectives proposed over the past years, and we show how those objective correlate with each other, and their performances when they are used in the single-objective Genetic Algorithm and the Multi-Objective Genetic Algorithm and the community structure properties they tend to produce.

IBICA | 2014

Networks Community Detection Using Artificial Bee Colony Swarm Optimization

Ahmed Ibrahem Hafez; Hossam M. Zawbaa; Aboul Ella Hassanien; Aly A. Fahmy

Community structure identification in complex networks has been an important research topic in recent years. Community detection can be viewed as an optimization problem in which an objective quality function that captures the intuition of a community as a group of nodes with better internal connectivity than external connectivity is chosen to be optimized. In this work Artificial bee colony (ABC) optimization has been used as an effective optimization technique to solve the community detection problem with the advantage that the number of communities is automatically determined in the process. However, the algorithm performance is influenced directly by the quality function used in the optimization process. A comparison is conducted between different popular communities’ quality measures when used as an objective function within ABC. Experiments on real life networks show the capability of the ABC to successfully find an optimized community structure based on the quality function used.

ieee international conference on intelligent systems | 2015

Community Detection Algorithm Based on Artificial Fish Swarm Optimization

Eslam Ali Hassan; Ahmed Ibrahem Hafez; Aboul Ella Hassanien; Aly A. Fahmy

Community structure identification in complex networks has been an important research topic in recent years. Community detection can be viewed as an optimization problem in which an objective quality function that captures the intuition of a community as a group of nodes with better internal connectivity than external connectivity is chosen to be optimized. In this paper Artificial Fish Swarm optimization (AFSO) has been used as an effective optimization technique to solve the community detection problem with the advantage that the number of communities is automatically determined in the process. However, the algorithm performance is influenced directly by the quality function used in the optimization process. A comparison is conducted between different popular communities’ quality measures and other well-known methods. Experiments on real life networks show the capability of the AFSO to successfully find an optimized community structure based on the quality function used.

Computer Speech & Language | 2014

Automatic scoring for answers to Arabic test questions

Wael Hassan Gomaa; Aly A. Fahmy

Abstract Most research in the automatic assessment of free text answers written by students address English language. This paper handles the assessment task in Arabic language. This research focuses on applying multiple similarity measures separately and in combination. Many aspects are introduced that depend on translation to overcome the lack of text processing resources in Arabic, such as extracting model answers automatically from an already built database and applying K-means clustering to scale the obtained similarity values. Additionally, this research presents the first benchmark Arabic data set that contains 610 students’ short answers together with their English translations.

hybrid artificial intelligence systems | 2015

A Discrete Bat Algorithm for the Community Detection Problem

Eslam Ali Hassan; Ahmed Ibrahem Hafez; Aboul Ella Hassanien; Aly A. Fahmy

Community detection in networks has raised an important research topic in recent years. The problem of detecting communities can be modeled as an optimization problem where a quality objective function that captures the intuition of a community as a set of nodes with better internal connectivity than external connectivity is selected to be optimized. In this work the Bat algorithmwas used as an optimization algorithm to solve the community detection problem. Bat algorithm is a new Nature-inspired metaheuristic algorithm that proved its good performance in a variety of applications. However, the algorithm performance is influenced directly by the quality function used in the optimization process. Experiments on real life networks show the ability of the Bat algorithm to successfully discover an optimized community structure based on the quality function used and also demonstrate the limitations of the BA when applied to the community detection problem.

asian conference on pattern recognition | 2011

Using advanced Hidden Markov Models for online Arabic handwriting recognition

Ibrahim Hosny; Sherif M. Abdou; Aly A. Fahmy

Online handwriting recognition of Arabic script is a difficult problem since it is naturally both cursive and unconstrained. The analysis of Arabic script is further complicated due to obligatory dots/stokes that are placed above or below most letters and usually are written delayed in order. This paper introduces a Hidden Markov Model (HMM) based system to provide solutions for most of the difficulties inherent in recognizing Arabic script. A preprocessing for the delayed strokes to match the structure of the HMM model is introduced. The used HMM models are trained with Writer Adaptive Training (WAT) to minimize the variance between writers in the training data. Also the models discrimination power is enhanced with Discriminative training. The system performance is evaluated using an international test set from the ADAB completion and shows a promising performance compared with the state-of-art systems.

international symposium on signal processing and information technology | 2010

Deep Belief Network for clustering and classification of a continuous data

Mostafa A. Salama; Aboul Ella Hassanien; Aly A. Fahmy

Deep Belief Network (DBN) is a deep architecture that consists of a stack of Restricted Boltzmann Machines (RBM). The deep architecture has the benefit that each layer learns more complex features than layers before it. DBN and RBM could be used as a feature extraction method also used as neural network with initially learned weights. The approach proposed depends on DBN in clustering and classification of continuous input data without using back propagation in the DBN architecture. DBN should have a better a performance than the traditional neural network due the initialization of the connecting weights rather than just using random weights in NN. Each layer in DBN (RBM) depends on Contrastive Divergence method for input reconstruction which increases the performance of the network.

Explore More