Is this you? Create Your Porfile

Naeemul Hassan

University of Texas at Arlington

Archive Network Publication Hotspot Collaboration

Network

Latest external collaboration on country level. Dive into details by clicking on the dots.

Explore More

Hotspot

Dive into the research topics where Naeemul Hassan is active.

Explore More

Publication

Featured researches published by Naeemul Hassan.

IEEE Transactions on Knowledge and Data Engineering | 2014

On Skyline Groups

Nan Zhang; Chengkai Li; Naeemul Hassan; Sundaresan Rajasekaran; Gautam Das

We formulate and investigate the novel problem of finding the skyline k-tuple groups from an n-tuple data set-i.e., groups of k tuples which are not dominated by any other group of equal size, based on aggregate-based group dominance relationship. The major technical challenge is to identify effective anti-monotonic properties for pruning the search space of skyline groups. To this end, we first show that the anti-monotonic property in the well-known Apriori algorithm does not hold for skyline group pruning. Then, we identify two anti-monotonic properties with varying degrees of applicability: order-specific property which applies to SUM, MIN, and MAX as well as weak candidate-generation property which applies to MIN and MAX only. Experimental results on both real and synthetic data sets verify that the proposed algorithms achieve orders of magnitude performance gain over the baseline method.

conference on information and knowledge management | 2012

On skyline groups

Chengkai Li; Nan Zhang; Naeemul Hassan; Sundaresan Rajasekaran; Gautam Das

very large data bases | 2014

Data in, fact out: automated monitoring of facts by FactWatcher

Naeemul Hassan; Afroza Sultana; You Wu; Gensheng Zhang; Chengkai Li; Jun Yang; Cong Yu

Towards computational journalism, we present FactWatcher, a system that helps journalists identify data-backed, attention-seizing facts which serve as leads to news stories. FactWatcher discovers three types of facts, including situational facts, one-of-the-few facts, and prominent streaks, through a unified suite of data model, algorithm framework, and fact ranking measure. Given an append-only database, upon the arrival of a new tuple, FactWatcher monitors if the tuple triggers any new facts. Its algorithms efficiently search for facts without exhaustively testing all possible ones. Furthermore, FactWatcher provides multiple features in striving for an end-to-end system, including fact ranking, fact-to-statement translation and keyword-based fact search.

knowledge discovery and data mining | 2017

Toward Automated Fact-Checking: Detecting Check-worthy Factual Claims by ClaimBuster

Naeemul Hassan; Fatma Arslan; Chengkai Li; Mark Tremayne

This paper introduces how ClaimBuster, a fact-checking platform, uses natural language processing and supervised learning to detect important factual claims in political discourses. The claim spotting model is built using a human-labeled dataset of check-worthy factual claims from the U.S. general election debate transcripts. The paper explains the architecture and the components of the system and the evaluation of the model. It presents a case study of how ClaimBuster live covers the 2016 U.S. presidential election debates and monitors social media and Australian Hansard for factual claims. It also describes the current status and the long-term goals of ClaimBuster as we keep developing and expanding it.

international conference on data engineering | 2014

Incremental discovery of prominent situational facts

Afroza Sultana; Naeemul Hassan; Chengkai Li; Jun Yang; Cong Yu

We study the novel problem of finding new, prominent situational facts, which are emerging statements about objects that stand out within certain contexts. Many such facts are newsworthy-e.g., an athletes outstanding performance in a game, or a viral videos impressive popularity. Effective and efficient identification of these facts assists journalists in reporting, one of the main goals of computational journalism. Technically, we consider an ever-growing table of objects with dimension and measure attributes. A situational fact is a “contextual” skyline tuple that stands out against historical tuples in a context, specified by a conjunctive constraint involving dimension attributes, when a set of measure attributes are compared. New tuples are constantly added to the table, reflecting events happening in the real world. Our goal is to discover constraint-measure pairs that qualify a new tuple as a contextual skyline tuple, and discover them quickly before the event becomes yesterdays news. A brute-force approach requires exhaustive comparison with every tuple, under every constraint, and in every measure subspace. We design algorithms in response to these challenges using three corresponding ideas-tuple reduction, constraint pruning, and sharing computation across measure subspaces. We also adopt a simple prominence measure to rank the discovered facts when they are numerous. Experiments over two real datasets validate the effectiveness and efficiency of our techniques.

sensors applications symposium | 2011

Design and deployment of a robust remote river level sensor network

Zamshed Iqbal Chowdhury; Masudul Haider Imtiaz; Muhammad Moinul Azam; Mst.Rumana Aktar Sumi; Md.Rakibur Rahman; Farzana Alam; Ishtiak Hussain; Naeemul Hassan

This paper evaluates an automated river level monitoring network. This network contains multiple nodes on which measuring modules are installed. These modules collect raw data and transmit them periodically to a central monitoring system. This monitoring system contains a database that processes the raw data and extracts information. Based on this information, various approximations are made such as water level rise rate, time remaining to exceed the critical level etc. The whole network is implemented as a prototype which yielded satisfactory result.

Computer Methods and Programs in Biomedicine | 2018

Classification of cancer cells using computational analysis of dynamic morphology

Mohammad Raziul Hasan; Naeemul Hassan; Rayan Khan; Young Tae Kim; Samir M. Iqbal

BACKGROUND AND OBJECTIVE Detection of metastatic tumor cells is important for early diagnosis and staging of cancer. However, such cells are exceedingly difficult to detect from blood or biopsy samples at the disease onset. It is reported that cancer cells, and especially metastatic tumor cells, show very distinctive morphological behavior compared to their healthy counterparts on aptamer functionalized substrates. The ability to quickly analyze the data and quantify the cell morphology for an instant real-time feedback can certainly contribute to early cancer diagnosis. A supervised machine learning approach is presented for identification and classification of cancer cell gestures for early diagnosis. METHODS We quantified the morphologically distinct behavior of metastatic cells and their healthy counterparts captured on aptamer-functionalized glass substrates from time-lapse optical micrographs. As a proof of concept, the morphologies of human glioblastoma (hGBM) and astrocyte cells were used. The cells were captured and imaged with an optical microscope. Multiple feature vectors were extracted to quantify and differentiate the complex physical gestures of cancerous and non-cancerous cells. Three different classifier models, Support Vector Machine (SVM), Random Forest Tree (RFT), and Naïve Bayes Classifier (NBC) were trained with the known dataset using machine learning algorithms. The performances of the classifiers were compared for accuracy, precision, and recall measurements using five-fold cross-validation technique. RESULTS All the classifier models detected the cancer cells with an average accuracy of at least 82%. The NBC performed the best among the three classifiers in terms of Precision (0.91), Recall (0.9), and F1-score (0.89) for the existing dataset. CONCLUSIONS This paper presents a standalone system built on machine learning techniques for cancer screening based on cell gestures. The system offers rapid, efficient, and novel identification of hGBM brain tumor cells and can be extended to define single cell analysis metrics for many other types of tumor cells.

advances in social networks analysis and mining | 2017

Diving Deep into Clickbaits: Who Use Them to What Extents in Which Topics with What Effects?

Main Uddin Rony; Naeemul Hassan; Mohammad Yousuf

The use of alluring headlines (clickbait) to tempt the readers has become a growing practice nowadays. For the sake of existence in the highly competitive media industry, most of the on-line media including the mainstream ones, have started following this practice. Although the wide-spread practice of clickbait makes the readers reliability on media vulnerable, a large scale analysis to reveal this fact is still absent. In this paper, we analyze 1.67 million Facebook posts created by 153 media organizations to understand the extent of clickbait practice, its impact and user engagement by using our own developed clickbait detection model. The model uses distributed sub-word embeddings learned from a large corpus. The accuracy of the model is 98.3%. Powered with this model, we further study the distribution of topics in clickbait and non-clickbait contents.

conference on information and knowledge management | 2014

Anything You Can Do, I Can Do Better: Finding Expert Teams by CrewScout

Naeemul Hassan; Huadong Feng; Ramesh Venkataraman; Gautam Das; Chengkai Li; Nan Zhang

CrewScout is an expert-team finding system based on the concept of skyline teams and efficient algorithms for finding such teams. Given a set of experts, CrewScout finds all k-expert skyline teams, which are not dominated by any other k-expert teams. The dominance between teams is governed by comparing their aggregated expertise vectors. The need for finding expert teams prevails in applications such as question answering, crowdsourcing, panel selection, and project team formation. The new contributions of this paper include an end-to-end system with an interactive user interface that assists users in choosing teams and an demonstration of its application domains.

conference on information and knowledge management | 2017

Regularized and Retrofitted models for Learning Sentence Representation with Context

Tanay Kumar Saha; Shafiq R. Joty; Naeemul Hassan; Mohammad Al Hasan

Vector representation of sentences is important for many text processing tasks that involve classifying, clustering, or ranking sentences. For solving these tasks, bag-of-word based representation has been used for a long time. In recent years, distributed representation of sentences learned by neural models from unlabeled data has been shown to outperform traditional bag-of-words representations. However, most existing methods belonging to the neural models consider only the content of a sentence, and disregard its relations with other sentences in the context. In this paper, we first characterize two types of contexts depending on their scope and utility. We then propose two approaches to incorporate contextual information into content-based models. We evaluate our sentence representation models in a setup, where context is available to infer sentence vectors. Experimental results demonstrate that our proposed models outshine existing models on three fundamental tasks, such as, classifying, clustering, and ranking sentences.

Explore More