Eva Hajičová
Charles University in Prague
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Eva Hajičová.
Archive | 2003
Alena Böhmová; Jan Hajic; Eva Hajičová; Barbora Hladká
The availability of annotated data (with as rich and “deep” annotation as possible) is desirable in any new developments. Textual data are being used for so-called training phase of various empirical methods solving various problems in the field of computational linguistics. While there are many methods that use texts in their plain (or raw) form (in most cases for so-called unsupervised training), more accurate results may be obtained if annotated corpora are available. The data annotation itself is a complex task. While morphologically annotated corpora (pioneered by Henry Kucera in the 60’s) are now available for English and other languages, syntactically annotated corpora are rare. Inspired by the Penn Treebank, the most widely used syntactically annotated corpus of English, we decided to develop a similarly sized corpus of Czech with a rich annotation scheme.
international conference on computational linguistics | 1982
Eva Hajičová; Jarka Vrbova
The elements of the stock of knowledge shared by the speaker and the hearer change their salience, in the sense of being immediately accessible in the hearers memory. The hierarchy of salience is argued to be a basic component of a mechanism serving for the identification of reference. Some of the regularities of this mechanism are discussed, the description of which is a necessary prerequisite of an automatic understanding of connected texts.
Journal of Pragmatics | 1987
Eva Hajičová; Petr Sgall
Abstract The objective of the present paper is to point out the existence of a principle governing the ordering of a sentences major constituents (Section 1). This principle is closely related to the pragmatically based (though linguistically structured) topic-focus articulation of the (meaning of a) sentence, and therefore we also mention the operational tests used to analyze this articulation and characterize the semantic relevance of topic and focus, and of the communicative dynamism (CD) (section 2). We show how these phenomena can be economically described on the basis of Chomskys theta theory, if the ordering principle is treated by the categorial component (Section 3).
Language | 1971
David G. Lockwood; Petr Sgall; Ladislav Nebeský; Alla Goralčíková; Eva Hajičová; Ladislav Nebesky
Come with us to read a new book that is coming recently. Yeah, this is a new coming book that many people really want to read will you be one of them? Of course, you should be. It will not make you feel so hard to enjoy your life. Even some people think that reading is a hard to do, you must be sure that you can do it. Hard will be felt when you have no ideas about what kind of book to read. Or sometimes, your reading material is not interesting enough.
text speech and dialogue | 2001
Eva Hajičová; Jan Hajic; Barbora Hladká; Martin Holub; Petr Pajas; Veronika Reznícková; Petr Sgall
The Prague Dependency Treebank (PDT) project is conceived of as a many-layered scenario, both from the point of view of the stratal annotation scheme, from the division-of-labor point of view, and with regard to the level of detail captured at the highest, tectogrammatical layer. The following aspects of the present status of the PDT are discussed in detail: the now-available PDT version 1.0, annotated manually at the morphemic and analytic layers, including the recent experience with post-annotation checking; the ongoing effort of tectogrammatical layer annotation, with a specific attention to the so-called model collection; and to two different areas of exploitation of the PDT, for linguistic research purposes and for information retrieval application purposes.
conference of the european chapter of the association for computational linguistics | 1993
Eva Hajičová; Petr Sgall; Hana Skoumalová
An algorithm for automatic identification of topic and focus of the sentence is presented, based on dependency syntax and using written input, which is much more ambiguous than spoken utterance.
international conference on computational linguistics | 2000
Eva Buráňová; Eva Hajičová; Petr Sgall
After a brief characterization of the theory of the topic-focus articulation of the sentence (TFA), rules are formulated that determine the assignment of appropriate values of the TFA attribute in the process of syntactico-semantic tagging of a very large corpus of Czech.
international conference on computational linguistics | 1990
Eva Hajičová; Petr Kuboň; Vladislav Kuboň
The hierarchy of salience of the items of the knowledge assumed by the speaker to be shared by him and by the hearer constitutes one aspect of a dynamic account of discourse (Sect. 1). It is claimed that a representation of this hierarchy is a good support for discourse analysis (reference assignement, Sect. 2) and for discourse production (pronominalization, definite description, Sect. 3).
Theoretical Linguistics | 1977
Petr Sgall; Eva Hajičová; Oldřich Procházka
We want to present some support for the suggestion that a counterpart of Carnaps intensional structure may be specified, for natural language, as the semantic representations of sentences, if belief (and other intensional) contexts are kept apart from metalinguistic assertions. Semantic representations should include the topic/focus articulation and other empirically based issues that can be checked linguistically, while the interplay of meaning postulates and a translation procedure should account for the relationship between semantic representations and propositions. It is also suggested that the relationship between concepts and objects ist not fully symmetrical to that between propositions and truth values.
international conference on computational linguistics | 1992
Eva Hajičová; Vladislav Kuboň; Petr Kuboň
The paper develops further the idea of using the notion of the stock of shared knowledge (SSK) for anaphora resolution following a more subtle treatment of the influence of the topic/focus articulation of the sentence on the degrees of salience of items of the SSK. An algorithmic evaluation procedure of the SSK is formulated taking into account the notions of contextual boundness, syntactic associations, complexity of the sentences and existence/nonexistence of possible competitors, and a general evaluating function is proposed, essential for the process of anaphora resolution. In the present paper the analysis is performed for Czech; however, the considerations are claimed to be of a universal validity, the actual relations between different factors and the values, of course, being language-dependent.