Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Diamantino Caseiro is active.

Publication


Featured researches published by Diamantino Caseiro.


Speech Communication | 2008

Recovering capitalization and punctuation marks for automatic speech recognition: Case study for Portuguese broadcast news

Fernando Batista; Diamantino Caseiro; Nuno J. Mamede; Isabel Trancoso

The following material presents a study about recovering punctuation marks, and capitalization information from European Portuguese broadcast news speech transcriptions. Different approaches were tested for capitalization, both generative and discriminative, using: finite state transducers automatically built from language models; and maximum entropy models. Several resources were used, including lexica, written newspaper corpora and speech transcriptions. Finite state transducers produced the best results for written newspaper corpora, but the maximum entropy approach also proved to be a good choice, suitable for the capitalization of speech transcriptions, and allowing straightforward on-the-fly capitalization. Evaluation results are presented both for written newspaper corpora and for broadcast news speech transcriptions. The frequency of each punctuation mark in BN speech transcriptions was analyzed for three different languages: English, Spanish and Portuguese. The punctuation task was performed using a maximum entropy modeling approach, which combines different types of information both lexical and acoustic. The contribution of each feature was analyzed individually and separated results for each focus condition are given, making it possible to analyze the performance differences between planned and spontaneous speech. All results were evaluated on speech transcriptions of a Portuguese broadcast news corpus. The benefits of enriching speech recognition with punctuation and capitalization are shown in an example, illustrating the effects of described experiments into spoken texts.


the second international conference | 2002

Spoken book alignment using WFSTs

Diamantino Caseiro; Hugo Meinedo; António Joaquim Serralheiro; Isabel Trancoso; João Paulo Neto

The framework of this paper is a national project known as IPSOM, whose main goal is to improve the access to digitally stored spoken books, used primarily by the visually impaired community, by providing tools for easily detecting and indexing units (words, sentences, topics). Simultaneously, the project also aims to broaden the usage of multimedia spoken books (for instance in didactic applications, etc.), by providing multimedia interfaces for access and retrieval. Hence, spoken book alignment is a major task.


IEEE Transactions on Audio, Speech, and Language Processing | 2006

A specialized on-the-fly algorithm for lexicon and language model composition

Diamantino Caseiro; Isabel Trancoso


conference of the international speech communication association | 2006

Recognition of classroom lectures in european portuguese.

Isabel Trancoso; Ricardo Nunes; Luís Neves; Céu Viana; Helena Moniz; Diamantino Caseiro; Ana Isabel Mata


conference of the international speech communication association | 2002

Using dynamic WFST composition for recognizing broadcast news.

Diamantino Caseiro; Isabel Trancoso


conference of the international speech communication association | 2007

Recovering Punctuation Marks for Automatic Speech Recognition

Fernando Batista; Diamantino Caseiro; Nuno J. Mamede; Isabel Trancoso


conference of the international speech communication association | 2001

On integrating the lexicon with the language model.

Diamantino Caseiro; Isabel Trancoso


conference of the international speech communication association | 2003

Towards a Repository of Digital Talking Books

António Joaquim Serralheiro; Isabel Trancoso; Diamantino Caseiro; Teresa Chambel; Luís Carriço; Nuno Guimarães


international conference on acoustics, speech, and signal processing | 2003

A tail-sharing WFST composition algorithm for large vocabulary speech recognition

Diamantino Caseiro; Isabel Trancoso


conference of the international speech communication association | 2006

Spoken Language Technologies Applied to Digital Talking Books

Isabel Trancoso; Carlos Duarte; António Joaquim Serralheiro; Diamantino Caseiro; Luís Carriço; Céu Viana

Collaboration


Dive into the Diamantino Caseiro's collaboration.

Top Co-Authors

Avatar

Isabel Trancoso

Instituto Superior Técnico

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Nuno J. Mamede

Technical University of Lisbon

View shared research outputs
Top Co-Authors

Avatar

Isabel Trancoso

Instituto Superior Técnico

View shared research outputs
Researchain Logo
Decentralizing Knowledge