Alan Lee
University of Pennsylvania
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Alan Lee.
meeting of the association for computational linguistics | 2005
Nikhil Dinesh; Alan Lee; Eleni Miltsakaki; Rashmi Prasad; Aravind K. Joshi; Bonnie Webber
The annotations of the Penn Discourse Treebank (PDTB) include (1) discourse connectives and their arguments, and (2) attribution of each argument of each connective and of the relation it denotes. Because the PDTB covers the same text as the Penn TreeBank WSJ corpus, syntactic and discourse annotation can be compared. This has revealed significant differences between syntactic structure and discourse structure, in terms of the arguments of connectives, due in large part to attribution. We describe these differences, an algorithm for detecting them, and finally some experimental results. These results have implications for automating discourse annotation based on syntactic annotation.
international conference on computational linguistics | 2008
Eleni Miltsakaki; Livio Robaldo; Alan Lee; Aravind K. Joshi
An important aspect of discourse understanding and generation involves the recognition and processing of discourse relations. These are conveyed by discourse connectives, i.e., lexical items like because and as a result or implicit connectives expressing an inferred discourse relation. The Penn Discourse TreeBank (PDTB) provides annotations of the argument structure, attribution and semantics of discourse connectives. In this paper, we provide the rationale of the tagset, detailed descriptions of the senses with corpus examples, simple semantic definitions of each type of sense tags as well as informal descriptions of the inferences allowed at each level.
Proceedings of the Workshop on Sentiment and Subjectivity in Text | 2006
Rashmi Prasad; Nikhil Dinesh; Alan Lee; Aravind K. Joshi; Bonnie Webber
An emerging task in text understanding and generation is to categorize information as fact or opinion and to further attribute it to the appropriate source. Corpus annotation schemes aim to encode such distinctions for NLP applications concerned with such tasks, such as information extraction, question answering, summarization, and generation. We describe an annotation scheme for marking the attribution of abstract objects such as propositions, facts and eventualities associated with discourse relations and their arguments annotated in the Penn Discourse TreeBank. The scheme aims to capture the source and degrees of factuality of the abstract objects. Key aspects of the scheme are annotation of the text spans signalling the attribution, and annotation of features recording the source, type, scopal polarity, and determinacy of attribution.
meeting of the association for computational linguistics | 2016
Bonnie Webber; Rashmi Prasad; Alan Lee; Aravind K. Joshi
English grammars indicate a variety of relations holding between conjoined VPs. VPs conjoined by and evince such senses as Result, Temporal Sequence and Concession. Although all these senses are ones associated with discourse relations, conjoined VPs have not been fully included in discourse annotation. Because of the value of discourse-annotated corpora for developing approaches to automated sense recognition, we have added their annotation to the Penn Discourse TreeBank. This paper describes how tokens were identified; how the process of span and sense annotation was modified and extended in order to keep the annotation of intra-sentential multi-clausal structures consistent with the rest of the corpus; and what the resulting corpus looks like, in terms of token frequency and common sense patterns.
Proceedings of the Workshop on Current Trends in Biomedical Natural Language Processing | 2008
Hong Yu; Nadya Frid; Susan Weber McRoy; Rashmi Prasad; Alan Lee; Aravind K. Joshi
The goal of the Penn Discourse Treebank (PDTB) project is to develop a large-scale corpus, annotated with coherence relations marked by discourse connectives. Currently, the primary application of the PDTB annotation has been to news articles. In this study, we tested whether the PDTB guidelines can be adapted to a different genre. We annotated discourse connectives and their arguments in one 4,937-token full-text biomedical article. Two linguist annotators showed an agreement of 85% after simple conventions were added. For the remaining 15% cases, we found that biomedical domain-specific knowledge is needed to capture the linguistic cues that can be used to resolve inter-annotator disagreement. We found that the two annotators were able to reach an agreement after discussion. Thus our experiments suggest that the PDTB annotation can be adapted to new domains by minimally adjusting the guidelines and by adding some further domain-specific linguistic cues.
empirical methods in natural language processing | 2015
Rashmi Prasad; Bonnie Webber; Alan Lee; Sameer Pradhan; Aravind K. Joshi
It is in PropBank’s ARGM annotation of clausal adjuncts that sentential semantics meets discourse relation annotation in the Penn Discourse TreeBank. This paper discusses complementarities between the two annotation systems: How PropBank ARGM annotation can be used to seed annotation of additional discourse relations in the PDTB, and how PDTB annotation can be used to refine or enrich PropBank ARGM annotation.
language resources and evaluation | 2008
Rashmi Prasad; Nikhil Dinesh; Alan Lee; Eleni Miltsakaki; Livio Robaldo; Aravind K. Joshi; Bonnie Webber
Archive | 2006
Rashmi Prasad; Eleni Miltsakaki; Nikhil Dinesh; Alan Lee; Aravind K. Joshi; Livio Robaldo; Bonnie Webber
international conference on computational linguistics | 2008
Emily Pitler; Mridhula Raghupathy; Hena Mehta; Ani Nenkova; Alan Lee; Aravind K. Joshi
Traitement Automatique des Langues | 2006
Rashmi Prasad; Nikhil Dinesh; Alan Lee; Aravind K. Joshi; Bonnie Webber