Christian Biemann
Leipzig University
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Christian Biemann.
international conference on computational linguistics | 2005
Christian Biemann; Sven Teresniak
This work presents an unsupervised solution to language identification. The method sorts multilingual text corpora on the basis of sentences into the different languages that are contained and makes no assumptions on the number or size of the monolingual fractions. Evaluation on 7-lingual corpora and bilingual corpora show that the quality of classification is comparable to supervised approaches and works almost error-free from 100 sentences per language on.
conference on intelligent text processing and computational linguistics | 2004
Christian Biemann; Stefan Bordag; Gerhard Heyer; Uwe Quasthoff; Christian Wolff
In this paper we describe a flexible, portable and language-independent infrastructure for setting up large monolingual language corpora. The approach is based on collecting a large amount of monolingual text from various sources. The input data is processed on the basis of a sentence-based text segmentation algorithm. We describe the entry structure of the corpus database as well as various query types and tools for information extraction. Among them, the extraction and usage of sentence-based word collocations is discussed in detail. Finally we give an overview of different applications for this language resource. A WWW interface allows for public access to most of the data and information extraction tools (http://wortschatz.uni-leipzig.de).
Lecture Notes in Computer Science | 2004
Christian Biemann; Karsten Böhm; Gerhard Heyer; Ronny Melz
The automated creation and the visualization of concept structures become more important as the number of relevant information continues to grow dramatically. Especially information and knowledge intensive tasks are relying heavily on accessing the relevant information or knowledge at the right time. Moreover the capturing of relevant facts and good ideas should be focused on as early as possible in the knowledge creation process. In this paper we introduce a technology to support knowledge structuring processes already at the time of their creation by building up concept structures in real time. Our focus was set on the design of a minimal invasive system, which ideally requires no human interaction and thus gives the maximum freedom to the participants of a knowledge creation or exchange processes. The initial prototype concentrates on the capturing of spoken language to support meetings of human experts, but can be easily adapted for the use in Internet communities that have to rely on knowledge exchange using electronic communication channels.
international conference on computational linguistics | 2002
Uwe Quasthoff; Christian Biemann; Christian Wolff
The regularity of named entities is used to learn names and to extract named entities. Having only a few name elements and a set of patterns the algorithm learns new names and its elements. A verification step assures quality using a large background corpus. Further improvement is reached through classifying the newly learnt elements on character level. Moreover, unsupervised rule learning is discussed.
european conference on principles of data mining and knowledge discovery | 2004
Christian Biemann; Karsten Böhm; Gerhard Heyer; Ronny Melz
In this demonstration we introduce a technology to support knowledge structuring processes already at the time of their creation by building up concept structures in real time. Our focus was set on the design of a minimal invasive system, which ideally requires no human interaction and thus gives the maximum freedom to the participants of a knowledge creation or exchange processes. The system captures and displays spoken dialogs as well as text documents for further use in knowledge engineers tools.
language resources and evaluation | 2006
Uwe Quasthoff; Matthias Richter; Christian Biemann
language resources and evaluation | 2004
Christian Biemann; Stefan Bordag; Uwe Quasthoff; Christian Wolff
Journal of Universal Computer Science | 2003
Christian Biemann; Karsten Böhm; Uwe Quasthoff; Christian Wolff
Ldv Forum | 2004
Christian Biemann; Stefan Bordag; Uwe Quasthoff
Ldv Forum | 2003
Christian Biemann