Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Raghuram Krishnapuram is active.

Publication


Featured researches published by Raghuram Krishnapuram.


analytics for noisy unstructured text data | 2008

Rule based synonyms for entity extraction from noisy text

Rema Ananthanarayanan; Vijil Chenthamarakshan; Prasad M. Deshpande; Raghuram Krishnapuram

Identification of named entities such as person, organization and product names from text is an important task in information extraction. In many domains, the same entity could be referred to in multiple ways due to variations introduced by different user groups, variations of spellings across regions or cultures, usage of abbreviations, typographical errors and other reasons associated with conventional usage. Identifying a piece of text as a mention of an entity in such noisy data is difficult, even if we have a dictionary of possible entities. Previous approaches treat the synonym problem as part entity disambiguation and use learning-based methods that use the context of the words to identify synonyms. In this paper, we show that existing domain knowledge, encoded as rules, can be used effectively to address the synonym problem to a considerable extent. This makes the disambiguation task simpler, without the need for much training data. We look at a subset of application scenarios in named entity extraction, categorize the possible variations in entity names, and define rules for each category. Using these rules, we generate synonyms for the canonical list and match these synonyms to the actual occurrence in the data sets. In particular, we describe the rule categories that we developed for several named entities and report the results of applying our technique of extracting named entities by generating synonyms for two different domains.


web age information management | 2012

WYSIWYE: An Algebra for Expressing Spatial and Textual Rules for Information Extraction

Vijil Chenthamarakshan; Ramakrishna Varadarajan; Prasad M. Deshpande; Raghuram Krishnapuram; Knut Stolze

The visual layout of a webpage can provide valuable clues for certain types of Information Extraction (IE) tasks. In traditional rule based IE frameworks, these layout cues are mapped to rules that operate on the HTML source of the webpages. In contrast, we have developed a framework in which the rules can be specified directly at the layout level. This has many advantages, since the higher level of abstraction leads to simpler extraction rules that are largely independent of the source code of the page, and, therefore, more robust. It can also enable specification of new types of rules that are not otherwise possible. To the best of our knowledge, there is no general framework that allows declarative specification of information extraction rules based on spatial layout. Our framework is complementary to traditional text based rules framework and allows a seamless combination of spatial layout based rules with traditional text based rules. We describe the algebra that enables such a system and its efficient implementation using standard relational and text indexing features of a relational database. We demonstrate the simplicity and efficiency of this system for a task involving the extraction of software system requirements from software product pages.


Archive | 2002

Method and apparatus for populating a predefined concept hierarchy or other hierarchical set of classified data items by minimizing system entrophy

Krishna Prasad Chitrapura; Raghuram Krishnapuram; Sachindra Joshi


IEEE Transactions on Intelligent Transportation Systems | 2012

Vehicular Traffic Density State Estimation Based on Cumulative Road Acoustics

Vivek Tyagi; Shivkumar Kalyanaraman; Raghuram Krishnapuram


conference on information and knowledge management | 2001

Mining generalised disjunctive association rules

Amit Anil Nanavati; Krishna Prasad Chitrapura; Sachindra Joshi; Raghuram Krishnapuram


Archive | 2001

Clustering data including those with asymmetric relationships

Krishna Kummamuru; Raghuram Krishnapuram; Pradeep Dubey


Archive | 2002

Personalized product recommendation

Jayanta Basak; Raghuram Krishnapuram


Archive | 2005

Web page preview without browsing to web page

Shourya Roy; Raghuram Krishnapuram


Archive | 2003

Determining structural similarity in semi-structured documents

Neeraj Agrawal; Sachindra Joshi; Raghuram Krishnapuram; Sumit Negi


Archive | 2004

Methods, apparatus and computer programs for characterizing web resources

Sachindra Joshi; Raghuram Krishnapuram; Shourya Roy

Researchain Logo
Decentralizing Knowledge