Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Timm Lehmberg is active.

Publication


Featured researches published by Timm Lehmberg.


Literary and Linguistic Computing | 2009

SusTEInability of linguistic resources through feature structures

Andreas Witt; Georg Rehm; Erhard W. Hinrichs; Timm Lehmberg; Jens Stegmann

This article shows that the TEI tag set for feature structures can be adopted to represent a heterogeneous set of linguistic corpora. The majority of corpora is annotated using markup languages that are based on the Annotation Graph framework, the upcoming Linguistic Annotation Format ISO standard, or according to tag sets defined by or based upon the TEI guidelines. A unified representation comprises the separation of conceptually different annotation layers contained in the original corpus data (e.g. syntax, phonology, and semantics) into multiple XML files. These annotation layers are linked to each other implicitly by the identical textual content of all files. A suitable data structure for the representation of these annotations is a multi-rooted tree that again can be represented by the TEI and ISO tag set for feature structures. The mapping process and representational issues are discussed as well as the advantages and drawbacks associated with the use of the TEI tag set for feature structures as a storage and exchange format for linguistically annotated data.


Library Trends | 2008

Digital Text Collections, Linguistic Research Data, and Mashups: Notes on the Legal Situation

Timm Lehmberg; Georg Rehm; Andreas Witt; Felix Zimmermann

Comprehensive data repositories are an essential part of practically all research carried out in the digital humanities nowadays. For example, library science, literary studies, and computational and corpus linguistics strongly depend on online archives that are highly sustainable and that contain not only digitized texts but also audio and video data as well as additional information such as metadata and arbitrary annotations. Current Web technologies, especially those that are related to what is commonly referred to as the Web 2.0, provide a number of novel functions such as multiuser editing or the inclusion of third-party content and applications that are also highly attractive for research applications in the areas mentioned above. Hand in hand with this development goes a high degree of legal uncertainty. The special nature of the data entails that, in quite a few cases, there are multiple holders of personal rights (mostly copyright) to different layers of data that often have different origins. This article discusses the legal problems of multiple authorships in private, commercial, and research environments. We also introduce significant differences between European and U.S. law with regard to the handling of this kind of data for scientific purposes.


Archive | 2006

Avoiding Data Graveyards: From Heterogeneous Data Collected in Multiple Research Projects to Sustainable Linguistic Resources.

Thomas C. Schmidt; Christian Chiarcos; Timm Lehmberg; Georg Rehm; Andreas Witt; Erhard W. Hinrichs


Archive | 2007

Rechtsfragen bei der Nutzung und Weitergabe linguistischer Daten

Timm Lehmberg; Christian Chiarcos; Georg Rehm; Andreas Witt


Archive | 2007

Collecting Legally Relevant Metadata by Means of a Decision-Tree-Based Questionnaire System

Timm Lehmberg; Christian Chiarcos; Erhard W. Hinrichs; Georg Rehm; Andreas Witt


Archive | 2011

New and future developments in EXMARaLDA

Thomas C. Schmidt; Kai Wörner; Hanna Hedeland; Timm Lehmberg


Procesamiento Del Lenguaje Natural | 2008

A web-platform for preserving, exploring, visualising, and querying linguistic corpora and other resources

Georg Rehm; Oliver Schonefeld; Andreas Witt; Christian Chiarcos; Timm Lehmberg


Archive | 2011

Multilingual Corpora at the Hamburg Centre for Language Corpora

Hanna Hedeland; Timm Lehmberg; Thomas C. Schmidt; Kai Wörner


KONVENS | 2008

Data structures for the analysis of regional language variation.

Birgit Kellner; Timm Lehmberg; Ingrid Schröder; Kai Wörner


language resources and evaluation | 2018

Introducing the CLARIN Knowledge Centre for Linguistic Diversity and Language Documentation.

Hanna Hedeland; Timm Lehmberg; Felix Rau; Sophie Salffner; Mandana Seyfeddinipur; Andreas Witt

Collaboration


Dive into the Timm Lehmberg's collaboration.

Top Co-Authors

Avatar

Andreas Witt

University of Tübingen

View shared research outputs
Top Co-Authors

Avatar

Georg Rehm

University of Tübingen

View shared research outputs
Top Co-Authors

Avatar

Christian Chiarcos

Goethe University Frankfurt

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Thomas C. Schmidt

Hamburg University of Applied Sciences

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Felix Rau

University of Cologne

View shared research outputs
Researchain Logo
Decentralizing Knowledge