Sanjaya Wijeratne
Wright State University
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Sanjaya Wijeratne.
international conference on semantic systems | 2013
Kalpa Gunaratna; Krishnaprasad Thirunarayan; Prateek Jain; Amit P. Sheth; Sanjaya Wijeratne
Linked Open Data (LOD) cloud has gained significant attention in the Semantic Web community recently. Currently it consists of approximately 295 interlinked datasets with over 50 billion triples including 500 million links, and continues to expand in size. This vast source of structured information has the potential to have a significant impact on knowledge-based applications. However, a key impediment to the use of LOD cloud is limited support for data integration tasks over concepts, instances, and properties. Efforts to address this limitation over properties have focused on matching data-type properties across datasets; however, matching of object-type properties has not received similar attention. We present an approach that can automatically match object-type properties across linked datasets, primarily exploiting and bootstrapping from entity co-reference links such as owl:sameAs. Our evaluation, using sample instance sets taken from Freebase, DBpedia, LinkedMDB, and DBLP datasets covering multiple domains shows that our approach matches properties with high precision and recall (on average, F measure gain of 57% - 78%).
advances in social networks analysis and mining | 2016
Lakshika Balasuriya; Sanjaya Wijeratne; Derek Doran; Amit P. Sheth
Most street gang members use Twitter to intimidate others, to present outrageous images and statements to the world, and to share recent illegal activities. Their tweets may thus be useful to law enforcement agencies to discover clues about recent crimes or to anticipate ones that may occur. Finding these posts, however, requires a method to discover gang member Twitter profiles. This is a challenging task since gang members represent a very small population of the 320 million Twitter users. This paper studies the problem of automatically finding gang members on Twitter. It outlines a process to curate one of the largest sets of verifiable gang member profiles that have ever been studied. A review of these profiles establishes differences in the language, images, YouTube links, and emojis gang members use compared to the rest of the Twitter population. Features from this review are used to train a series of supervised classifiers. Our classifier achieves a promising F1 score with a low false positive rate.
arXiv: Computation and Language | 2017
Sanjaya Wijeratne; Lakshika Balasuriya; Amit P. Sheth; Derek Doran
Emoji have grown to become one of the most important forms of communication on the web. With its widespread use, measuring the similarity of emoji has become an important problem for contemporary text processing since it lies at the heart of sentiment analysis, search, and interface design tasks. This paper presents a comprehensive analysis of the semantic similarity of emoji through embedding models that are learned over machine-readable emoji meanings in the EmojiNet knowledge base. Using emoji descriptions, emoji sense labels and emoji sense definitions, and with different training corpora obtained from Twitter and Google News, we develop and test multiple embedding models to measure emoji similarity. To evaluate our work, we create a new dataset called EmoSim508, which assigns human-annotated semantic similarity scores to a set of 508 carefully selected emoji pairs. After validation with EmoSim508, we present a real-world use-case of our emoji embedding models using a sentiment analysis task and show that our models outperform the previous best-performing emoji embedding model on this task. The EmoSim508 dataset and our emoji embedding models are publicly released with this paper and can be downloaded from http://emojinet.knoesis.org/.
arXiv: Artificial Intelligence | 2017
Amit P. Sheth; Sujan Perera; Sanjaya Wijeratne; Krishnaprasad Thirunarayan
Machine Learning has been a big success story during the AI resurgence. One particular stand out success relates to learning from a massive amount of data. In spite of early assertions of the unreasonable effectiveness of data, there is increasing recognition for utilizing knowledge whenever it is available or can be created purposefully. In this paper, we discuss the indispensable role of knowledge for deeper understanding of content where (i) large amounts of training data are unavailable, (ii) the objects to be recognized are complex, (e.g., implicit entities and highly subjective content), and (iii) applications need to use complementary or related data in multiple modalities/media. What brings us to the cusp of rapid progress is our ability to (a) create relevant and reliable knowledge and (b) carefully exploit knowledge to enhance ML/NLP techniques. Using diverse examples, we seek to foretell unprecedented progress in our ability for deeper understanding and exploitation of multimodal data and continued incorporation of knowledge in learning techniques.
Drug and Alcohol Dependence | 2015
Raminta Daniulaityte; Ramzi W. Nahhas; Sanjaya Wijeratne; Robert G. Carlson; Francois R. Lamy; Silvia S. Martins; Edward W. Boyer; G. Alan Smith; Amit P. Sheth
social informatics | 2016
Sanjaya Wijeratne; Lakshika Balasuriya; Amit P. Sheth; Derek Doran
arXiv: Social and Information Networks | 2016
Sanjaya Wijeratne; Lakshika Balasuriya; Derek Doran; Amit P. Sheth
international conference on weblogs and social media | 2017
Sanjaya Wijeratne; Lakshika Balasuriya; Amit P. Sheth; Derek Doran
intelligence and security informatics | 2015
Sanjaya Wijeratne; Derek Doran; Amit P. Sheth; Jack L. Dustin
Drug and Alcohol Dependence | 2015
Raminta Daniulaityte; Robert G. Carlson; Farahnaz Golroo; Sanjaya Wijeratne; Edward W. Boyer; Silvia S. Martins; Ramzi W. Nahhas; Amit P. Sheth