Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Roxana Girju is active.

Publication


Featured researches published by Roxana Girju.


Computational Linguistics | 2006

Automatic Discovery of Part-Whole Relations

Roxana Girju; Adriana Badulescu; Dan I. Moldovan

An important problem in knowledge discovery from text is the automatic extraction of semantic relations. This paper presents a supervised, semantically intensive, domain independent approach for the automatic detection of part-whole relations in text. First an algorithm is described that identifies lexico-syntactic patterns that encode part-whole relations. A difficulty is that these patterns also encode other semantic relations, and a learning method is necessary to discriminate whether or not a pattern contains a part-whole relation. A large set of training examples have been annotated and fed into a specialized learning system that learns classification rules. The rules are learned through an iterative semantic specialization (ISS) method applied to noun phrase constituents. Classification rules have been generated this way for different patterns such as genitives, noun compounds, and noun phrases containing prepositional phrases to extract part-whole relations from them. The applicability of these rules has been tested on a test corpus obtaining an overall average precision of 80.95% and recall of 75.91%. The results demonstrate the importance of word sense disambiguation for this task. They also demonstrate that different lexico-syntactic patterns encode different semantic information and should be treated separately in the sense that different clarification rules apply to different patterns.


meeting of the association for computational linguistics | 2003

Automatic Detection of Causal Relations for Question Answering

Roxana Girju

Causation relations are a pervasive feature of human language. Despite this, the automatic acquisition of causal information in text has proved to be a difficult task in NLP. This paper provides a method for the automatic detection and extraction of causal relations. We also present an inductive learning approach to the automatic discovery of lexical and semantic constraints necessary in the disambiguation of causal relations that are then used in question answering. We devised a classification of causal questions and tested the procedure on a QA system.


meeting of the association for computational linguistics | 2007

SemEval-2007 Task 04: Classification of Semantic Relations between Nominals

Roxana Girju; Preslav Nakov; Vivi Nastase; Stan Szpakowicz; Peter D. Turney; Deniz Yuret

The NLP community has shown a renewed interest in deeper semantic analyses, among them automatic recognition of relations between pairs of words in a text. We present an evaluation task designed to provide a framework for comparing different approaches to classifying semantic relations between nominals in a sentence. This is part of SemEval, the 4th edition of the semantic evaluation event previously known as SensEval. We define the task, describe the training/test data and their creation, list the participating systems and discuss their results. There were 14 teams who submitted 15 systems.


Computer Speech & Language | 2005

On the semantics of noun compounds

Roxana Girju; Dan I. Moldovan; Marta Tatu; Daniel Antohe

This paper provides new insights on the semantic characteristics of two and three noun compounds. An analysis is performed using two sets of semantic classification categories: a list of 8 prepositional paraphrases previously proposed by Lauer [Designing statistical language learners: experiments on noun compounds, Ph.D. Thesis, Macquarie University, Australia] and a new set of 35 semantic relations introduced by us. We show the distribution of these semantic categories on a corpus of noun compounds and present several models for the bracketing and the semantic classification of noun compounds. The results are compared against state-of-the-art models reported in the literature.


north american chapter of the association for computational linguistics | 2004

Models for the semantic classification of noun phrases

Dan I. Moldovan; Adriana Badulescu; Marta Tatu; Daniel Antohe; Roxana Girju

This paper presents an approach for detecting semantic relations in noun phrases. A learning algorithm, called semantic scattering, is used to automatically label complex nominals, genitives and adjectival noun phrases with the corresponding semantic relation.


empirical methods in natural language processing | 2009

Cross-Cultural Analysis of Blogs and Forums with Mixed-Collection Topic Models

Michael J. Paul; Roxana Girju

This paper presents preliminary results on the detection of cultural differences from peoples experiences in various countries from two perspectives: tourists and locals. Our approach is to develop probabilistic models that would provide a good framework for such studies. Thus, we propose here a new model, ccLDA, which extends over the Latent Dirichlet Allocation (LDA) (Blei et al., 2003) and cross-collection mixture (ccMix) (Zhai et al., 2004) models on blogs and forums. We also provide a qualitative and quantitative analysis of the model on the cross-cultural data.


language resources and evaluation | 2009

Classification of semantic relations between nominals

Roxana Girju; Preslav Nakov; Vivi Nastase; Stan Szpakowicz; Peter D. Turney; Deniz Yuret

The NLP community has shown a renewed interest in deeper semantic analyses, among them automatic recognition of semantic relations in text. We present the development and evaluation of a semantic analysis task: automatic recognition of relations between pairs of nominals in a sentence. The task was part of SemEval-2007, the fourth edition of the semantic evaluation event previously known as SensEval. Apart from the observations we have made, the long-lasting effect of this task may be a framework for comparing approaches to the task. We introduce the problem of recognizing relations between nominals, and in particular the process of drafting and refining the definitions of the semantic relations. We show how we created the training and test data, list and briefly describe the 15 participating systems, discuss the results, and conclude with the lessons learned in the course of this exercise.


International Journal on Artificial Intelligence Tools | 2001

AN INTERACTIVE TOOL FOR THE RAPID DEVELOPMENT OF KNOWLEDGE BASES

Dan I. Moldovan; Roxana Girju

It is widely accepted that more knowledge means more intelligence. In many knowledge intensive applications, it is necessary to have extensive domain-specific knowledge in addition to general-purpo...


Information Processing and Management | 2010

A knowledge-rich approach to identifying semantic relations between nominals

Roxana Girju; Brandon Beamer; Alla Rozovskaya; Andrew Fister; Suma Bhat

This paper describes a state-of-the-art supervised, knowledge-intensive approach to the automatic identification of semantic relations between nominals in English sentences. The system employs a combination of rich and varied sets of new and previously used lexical, syntactic, and semantic features extracted from various knowledge sources such as WordNet and additional annotated corpora. The system ranked first at the third most popular SemEval 2007 Task -Classification of Semantic Relations between Nominals and achieved an F-measure of 72.4% and an accuracy of 76.3%. We also show that some semantic relations are better suited for WordNet-based models than other relations. Additionally, we make a distinction between out-of-context (regular) examples and those that require sentence context for relation identification and show that contextual data are important for the performance of a noun-noun semantic parser. Finally, learning curves show that the task difficulty varies across relations and that our learned WordNet-based representation is highly accurate so the performance results suggest the upper bound on what this representation can do.


international conference on computational linguistics | 2009

Using a Bigram Event Model to Predict Causal Potential

Brandon Beamer; Roxana Girju

This paper addresses the problem of causal knowledge discovery. Using online screenplays, we generate a corpus of temporally ordered events. We then introduce a measure we call causal potential which is easily calculated with statistics gathered over the corpus and show that this measure is highly correlated with an event pairs tendency of encoding a causal relation. We suggest that causal potential can be used in systems whose task is to determine the existence of causality between temporally adjacent events, when critical context is either missing or unreliable. Moreover, we argue that our model should therefore be used as a baseline for standard supervised models which take into account contextual information.

Collaboration


Dive into the Roxana Girju's collaboration.

Top Co-Authors

Avatar

Dan I. Moldovan

University of Texas at Dallas

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Sanda M. Harabagiu

University of Texas at Dallas

View shared research outputs
Top Co-Authors

Avatar

Adriana Badulescu

University of Texas at Dallas

View shared research outputs
Top Co-Authors

Avatar

Paul Morarescu

University of Texas at Dallas

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Daniel Antohe

University of Texas at Dallas

View shared research outputs
Top Co-Authors

Avatar

Marian Olteanu

University of Texas at Dallas

View shared research outputs
Top Co-Authors

Avatar

Orest Bolohan

University of Texas at Dallas

View shared research outputs
Researchain Logo
Decentralizing Knowledge