Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Maarten Janssen is active.

Publication


Featured researches published by Maarten Janssen.


TPDL | 2018

Adding Words to Manuscripts: From PagesXML to TEITOK

Maarten Janssen

This article describes a two-step method for transcribing historic manuscripts. In this method, the first step uses a page-based representation making it easy to transcribe the document page-by-page and line-by-line, while the second step converts this to the TEI/XML text-based format, in order to make sure the document becomes fully searchable.


PROPOR | 2018

Technical Implementation of the Vocabulário Ortográfico Comum da Língua Portuguesa

Maarten Janssen; José Pedro Ferreira

The recent Portuguese language orthographic agreement (AOLP90) specifies that the new spelling rules are implemented in an official spelling dictionary (VOC). VOC, released in 2017, is the first common spelling dictionary valid in all Portuguese-speaking countries. AOLP90 allows for some national-level spelling variation, defined in a national spelling dictionary (VON) for each country, containing the nationally-representative words and national-level variants. This combination of a single official spelling with national variation cannot be handled in a traditional set-up for lexical data. This article describes how the lexicon is practically implemented in the VOC database. We start by presenting the nature of AOLP90, the requirements for VOC, and the lexical database. We then analyze the technical implications of orthographic variation in a pluricentric context and present the solutions and practical implementation adopted in VOC. We finish by presenting the pluricentric management system designed for this purpose, devised to cater for decentralized, but compatible management of the lexical database.


Proceedings of the XIII EURALEX International Congress (Barcelona, 15-19 July 2008), 2008, ISBN 978-84-96742-67-3, págs. 351-357 | 2008

Mordebe Admin—A Lexical Management System

José Pedro Ferreira; Sílvia Barbosa; Maarten Janssen


language resources and evaluation | 2016

The COPLE2 corpus: a learner corpus for Portuguese.

Amália Mendes; Sandra Antunes; Maarten Janssen; Anabela Gonçalves


language resources and evaluation | 2016

TEITOK: Text-Faithful Annotated Corpora.

Maarten Janssen


conference of the international speech communication association | 2012

A Rule Based Pronunciation Generator and Regional Accent Databank for Portuguese.

Simone Ashby; Sílvia Barbosa; Silvia Brandão; José Pedro Ferreira; Maarten Janssen; Catarina Silva; Mário Eduardo Viaro


language and technology conference | 2016

Towards error annotation in a learner corpus of Portuguese

Iria del Río; Sandra Antunes; Amália Mendes; Maarten Janssen


Revista da Associação Portuguesa de Linguística | 2016

Apresentação do Corpus de Português Língua Estrangeira/Língua Segunda – COPLE2

Sandra Antunes; Amália Mendes; Anabela Gonçalves; Maarten Janssen; Nélia Alexandre; António Avelar; Adelina Castelo; Inês Duarte; Maria João Freitas; José Pascoal; Jorge Pinto


Livro de Resumos – XXX Encontro Nacional da Associação Portuguesa de Linguística, 22-24 de outubro | 2014

Corpus de Português Língua Estrangeira / Língua Segunda – COPLE2

Amália Mendes; Sandra Antunes; Nélia Alexandre; António Avelar; Adelina Castelo; Inês Duarte; Maria João Freitas; Anabela Gonçalves; José Pascoal; Jorge Pinto; Maarten Janssen


language resources and evaluation | 2008

Spock - a Spoken Corpus Client.

Maarten Janssen; Tiago Freitas

Collaboration


Dive into the Maarten Janssen's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar

Sandra Antunes

Universidade Nova de Lisboa

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Adelina Castelo

Instituto Politécnico Nacional

View shared research outputs
Top Co-Authors

Avatar

Simone Ashby

University College Dublin

View shared research outputs
Researchain Logo
Decentralizing Knowledge