Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Samuel Gustman is active.

Publication


Featured researches published by Samuel Gustman.


IEEE Transactions on Speech and Audio Processing | 2004

Automatic recognition of spontaneous speech for access to multilingual oral history archives

William Byrne; David S. Doermann; Martin Franz; Samuel Gustman; Jan Hajic; Douglas W. Oard; Michael Picheny; Josef Psutka; Bhuvana Ramabhadran; Dagobert Soergel; Todd Ward; Wei-Jing Zhu

Much is known about the design of automated systems to search broadcast news, but it has only recently become possible to apply similar techniques to large collections of spontaneous speech. This paper presents initial results from experiments with speech recognition, topic segmentation, topic categorization, and named entity detection using a large collection of recorded oral histories. The work leverages a massive manual annotation effort on 10 000 h of spontaneous speech to evaluate the degree to which automatic speech recognition (ASR)-based segmentation and categorization techniques can be adapted to approximate decisions made by human annotators. ASR word error rates near 40% were achieved for both English and Czech for heavily accented, emotional and elderly spontaneous speech based on 65-84 h of transcribed speech. Topical segmentation based on shifts in the recognized English vocabulary resulted in 80% agreement with manually annotated boundary positions at a 0.35 false alarm rate. Categorization was considerably more challenging, with a nearest-neighbor technique yielding F=0.3. This is less than half the value obtained by the same technique on a standard newswire categorization benchmark, but replication on human-transcribed interviews showed that ASR errors explain little of that difference. The paper concludes with a description of how these capabilities could be used together to search large collections of recorded oral histories.


international acm sigir conference on research and development in information retrieval | 2004

Building an information retrieval test collection for spontaneous conversational speech

Douglas W. Oard; Dagobert Soergel; David S. Doermann; Xiaoli Huang; G. Craig Murray; Jianqiang Wang; Bhuvana Ramabhadran; Martin Franz; Samuel Gustman; James Mayfield; Liliya Kharevych; Stephanie M. Strassel

Test collections model use cases in ways that facilitate evaluation of information retrieval systems. This paper describes the use of search-guided relevance assessment to create a test collection for retrieval of spontaneous conversational speech. Approximately 10,000 thematically coherent segments were manually identified in 625 hours of oral history interviews with 246 individuals. Automatic speech recognition results, manually prepared summaries, controlled vocabulary indexing, and name authority control are available for every segment. Those features were leveraged by a team of four relevance assessors to identify topically relevant segments for 28 topics developed from actual user requests. Search-guided assessment yielded sufficient inter-annotator agreement to support formative evaluation during system development. Baseline results for ranked retrieval are presented to illustrate use of the collection.


acm/ieee joint conference on digital libraries | 2002

Supporting access to large digital oral history archives

Samuel Gustman; Dagobert Soergel; Douglas W. Oard; William Byrne; Michael Picheny; Bhuvana Ramabhadran; Douglas Greenberg

This paper describes our experience with the creation, indexing, and provision of access to a very large archive of videotaped oral histories - 116,000 hours of digitized interviews in 32 languages from 52,000 survivors, liberators, rescuers, and witnesses of the Nazi Holocaust. It goes on to identify a set of critical research issues that must be addressed if we are to provide full and detailed access to collections of this size: issues in user requirement studies, automatic speech recognition, automatic classification, segmentation, summarization, retrieval, and user interfaces. The paper ends by inviting others to discuss use of these materials in their own research.


text speech and dialogue | 2002

Automatic Transcription of Czech Language Oral History in the MALACH Project: Resources and Initial Experiments

Josef Psutka; Pavel Ircing; Vlasta Radová; William Byrne; Jan Hajic; Samuel Gustman; Bhuvana Ramabhadran

In this paper we describe the initial stages of the ASR component of the MALACH (Multilingual Access to Large Spoken Archives) project. This project will attempt to provide improved access to the large multilingual spoken archives collected by the Survivors of the Shoah Visual History Foundation (VHF) by advancing the state of the art in automated speech recognition. In order to train the ASR system, it is neccesary to manually transcribe a large amount of speech data, identify the appropriate vocabulary, and obtain relevant text for language modeling. We give a detailed description of the speech annotation process; show the specific properties of the spontaneous speech contained in the archives; and present a baseline speech recognition results.


text speech and dialogue | 2003

Building LVCSR System for Transcription of Spontaneously Pronounced Russian Testimonies in the MALACH Project: Initial Steps and First Results

Josef Psutka; Ilja Iljuchin; Pavel Ircing; Václav Trejbal; William Byrne; Jan Hajic; Samuel Gustman

The MALACH project [1] uses the world’s largest digital archives of video oral histories collected by the Survivors of the Shoah Visual History Foundation (VHF) and attempts to access such archives by advancing the state-of-the-art in Automated Speech Recognition (ASR) and Information Retrieval (IR). This paper discusses the initial steps and the first results in building large vocabulary continuous speech recognition (LVCSR) system for transcription of Russian witnesses. Russian as the third language processed in the MALACH project (after English [2] and Czech [3]) brought new problems especially in the phonetic area. Although the most of the Russian testimonies were provided by native Russian survivors we have encountered many different accents in their speech caused by a territory where the survivors are living.


text speech and dialogue | 2002

Cross-Language Access to Recorded Speech in the MALACH Project

Douglas W. Oard; Dina Demner-Fushman; Jan Hajic; Bhuvana Ramabhadran; Samuel Gustman; William Byrne; Dagobert Soergel; Bonnie J. Dorr; Philip Resnik; Michael Picheny

The MALACH project seeks to help users find information in a vast multilingual collections of untranscribed oral history interviews. This paper introduces the goals of the project and focuses on supporting access by users who are unfamiliar with the interview language. It begins with a review of the state of the art in cross-language speech retrieval; approaches that will be investigated in the project are then described. Czech was selected as the first non-English language to be supported, so results of an initial experiment with Czech/English cross-language retrieval are reported.


Proceedings of The Asist Annual Meeting | 2005

Access to large spoken archives: Uses and technology. Sponsored by SIG VIS

Dagobert Soergel; Samuel Gustman; Mark Kornbluh; Bhuvana Ramabhadran; Jerry Goldman

With recent advances in information technology, digital archiving is emerging as an important and practical method for capturing the human experience. Large amounts of spoken materials and audiovisual materials in which speech is an important component are becoming available. This panel will discuss the uses of these mateials for education, information retrieval and dissemination, and research, the requirements that arise from these uses, and speech recognition and retrieval technologies being developed to meet these requirements. These materials have tremendous potential for enriching the presentation of information in education, newscasts and documentaries, but retrieval from and access to these large repositories pose significant challenges. The panel will provide an overview of these issues.


conference of the international speech communication association | 2003

Large vocabulary ASR for spontaneous Czech in the MALACH project

Josef Psutka; Pavel Ircing; V Psutka Josef; Vlasta Radová; William Byrne; Jan Hajic; Jiří Mírovský; Samuel Gustman


language resources and evaluation | 2004

Issues in annotation of the Czech spontaneous speech corpus in the MALACH project

Josef Psutka; Pavel Ircing; Jan Hajic; Vlasta Radová; William Byrne; Samuel Gustman


Lecture Notes in Computer Science | 2002

Cross-language access to recorded speech in the MALACH project

Douglas W. Oard; Dina Demner-Fushman; Jan Hajic; Bhuvana Ramabhadran; Samuel Gustman; William Byrne; Dagobert Soergel; Bonnie J. Dorr; Philip Resnik; Michael Picheny

Collaboration


Dive into the Samuel Gustman's collaboration.

Top Co-Authors

Avatar

Josef Psutka

University of West Bohemia

View shared research outputs
Top Co-Authors

Avatar

Jan Hajic

Charles University in Prague

View shared research outputs
Top Co-Authors

Avatar

Pavel Ircing

University of West Bohemia

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Vlasta Radová

University of West Bohemia

View shared research outputs
Top Co-Authors

Avatar

V Psutka Josef

University of West Bohemia

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Jan Hajic

Charles University in Prague

View shared research outputs
Researchain Logo
Decentralizing Knowledge