Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Fodé Camara is active.

Publication


Featured researches published by Fodé Camara.


Proceedings of the 4th International Conference on Frontiers of Educational Technologies | 2018

A Novel Term Weighting Scheme Model

Demba Kandé; Reine Marie Marone; Samba Ndiaye; Fodé Camara

The use of textual data has increased exponentially in recent years due to the networking infrastructure such as Facebook, Twitter, Wikipedia, Blogs, and so one. Analysis of this massive textual data can help to automatically categorize and label new content. Before classification process, term weighting scheme is the crucial step for representing the documents in a way suitable for classification algorithms. In this paper, we are conducting a survey on the term weighting schemes and we propose an efficient term weighting scheme that provide a better classification accuracy than those obtained with the famous TF-IDF, the recent IF-IGM and the others term weighting schemes in the literature.


Archive | 2018

A Parallelized Spark Based Version of mRMR

Reine Marie Marone; Fodé Camara; Samba Ndiaye

Nowadays, we are surrounded by enormous large-scale high dimensional data called big data and it is crucial to reduce the dimensionality of data for machine learning problems. That’s why feature selection plays a vital role in the process of machine learning because it aims to reduce high-dimensionality by removing irrelevant and redundant features from original data. However some characteristics of big data like data velocity, volume and data variety have brought new challenges in the field of feature selection. In fact, most of existing feature selection algorithms were designed for running on a single machine (centralized computing architecture) and do not scale well when dealing with big data. Their efficiency may significantly deteriorate to the point of becoming inapplicable. For this reason, there is an increasing need for scalable yet efficient feature selection methods. That’s why we present here a distributed and effective version of the mRMR (Max-Relevance and Min-Redundancy) algorithm to face real-world problems of data mining and evaluate the empirical performance of the proposed algorithms in selecting features in several public datasets. When we compared the efficiency and the scalability of our parallelized method in comparison with the centralized one we have found out that our parallelized method have given better results.


international conference on cloud computing | 2017

S-FPG: A parallel version of FP-Growth algorithm under Apache Spark™

Aissatou Diaby dite Gassama; Fodé Camara; Samba Ndiaye

Frequent Itemsets Mining (FIM) is an essential data mining task, with many real world applications such as market basket analysis, outlier detection, and so one. Many efficient single-node FIM algorithms such as the well-known FP-Growth algorithm have been proposed in the last two decades. However, as large-scale datasets are usually adopted nowadays, these algorithms become inefficient to mine frequent itemsets over big data. Scalable parallel algorithms hold the key to solving the problem in this context. However, the existing parallel versions of FP-Growth algorithm implemented with the disk-based MapReduce model are not efficient enough for iterative computation. In this paper, we propose an implementation of scalable parallel FP-Growth using the inmemory parallel computing framework Apache Spark™. Our experimental results demonstrated that the proposed algorithm can scale well and efficiently process large datasets.


advanced data mining and applications | 2009

A Secure Protocol to Maintain Data Privacy in Data Mining

Fodé Camara; Samba Ndiaye; Yahya Slimani

Recently, privacy issues have becomes important in data mining, especially when data is horizontally or vertically partitioned. For the vertically partitioned case, many data mining problems can be reduced to securely computing the scalar product. Among these problems, we can mention association rule mining over vertically partitioned data. Efficiency of a secure scalar product can be measured by the overhead of communication needed to ensure this security. Several solutions have been proposed for privacy preserving association rule mining in vertically partitioned data. But the main drawback of these solutions is the excessive overhead communication needed for ensuring data privacy. In this paper we propose a new secure scalar product with the aim to reduce the overhead communication.


Archive | 2012

A Novel RFE-SVM-based Feature Selection Approach for Classification

Mouhamadou Lamine Samb; Fodé Camara; Samba Ndiaye; Yahya Slimani; Mohamed Amir Esseghir; Cheikh Anta


Archive | 2014

Approche de sélection d’attributs pour la classification basée sur l’algorithme RFE-SVM

Yahya Slimani; Mohamed Amir Essegir; Mouhamadou Lamine Samb; Fodé Camara; Samba Ndiaye


soft computing | 2017

A large-scale filter method for feature selection based on spark

Reine Marie Marone; Fodé Camara; Samba Ndiaye


ieee international conference computer and communications | 2017

LSIS: Large scale instance selection algorithm for big data

Reine Marie Marone; Fodé Camara; Samba Ndiaye


Archive | 2012

Knowledge Discovery Results as a Threat to Privacy

Fodé Camara; Samba Ndiaye; Yahya Slimani


Archive | 2012

An Approach to Overcome Inference Channels on k-anonymous Data

Ousseynou Sané; Fodé Camara; Samba Ndiaye; Yahya Slimani; Cheikh Anta

Collaboration


Dive into the Fodé Camara's collaboration.

Top Co-Authors

Avatar

Samba Ndiaye

Cheikh Anta Diop University

View shared research outputs
Top Co-Authors

Avatar

Reine Marie Marone

Cheikh Anta Diop University

View shared research outputs
Top Co-Authors

Avatar

Yahya Slimani

Tunis El Manar University

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Demba Kandé

Cheikh Anta Diop University

View shared research outputs
Researchain Logo
Decentralizing Knowledge