Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Sue E. Johnson is active.

Publication


Featured researches published by Sue E. Johnson.


international conference on acoustics speech and signal processing | 1999

The Cambridge University spoken document retrieval system

Sue E. Johnson; P. Jourlin; Gareth L. Moore; Karen Sparck Jones; Philip C. Woodland

This paper describes the spoken document retrieval system that we have been developing and assesses its performance using automatic transcriptions of about 50 hours of broadcast news data. The recognition engine is based on the HTK broadcast news transcription system and the retrieval engine is based on the techniques developed at City University. The retrieval performance over a wide range of speech transcription error rates is presented and a number of recognition error metrics that more accurately reflect the impact of transcription errors on retrieval accuracy are defined and computed. The results demonstrate the importance of high accuracy automatic transcription. The final system is currently being evaluated on the 1998 TREC-7 spoken document retrieval task.


international conference on acoustics speech and signal processing | 1998

Experiments in broadcast news transcription

Philip C. Woodland; Thomas Hain; Sue E. Johnson; Thomas Niesler; Andreas Tuerk; Steve J. Young

This paper presents the development of the HTK broadcast news transcription system. Previously we have used data type specific modelling based on adapted Wall Street Journal trained HMMs. However, we are now experimenting with data for which no manual pre-classification or segmentation is available and therefore automatic techniques are required and compatible acoustic modelling strategies adopted. An approach for automatic audio segmentation and classification is described and evaluated as well as extensions to our previous work on segment clustering. A number of recognition experiments are presented that compare datatype specific and non-specific models; differing amounts of training data; the use of gender-dependent modelling and the effects of automatic data-type classification. It is shown that robust segmentation into a small number of audio types is possible and that models trained on a wide variety of data types can yield good performance.


international conference on acoustics, speech, and signal processing | 2000

A method for direct audio search with applications to indexing and retrieval

Sue E. Johnson; Philip C. Woodland

A technique for searching audio data to find an exact match for a given piece of cue-audio is described. The method uses a cepstral parameterisation of the audio and a covariance-based distance metric to quickly locate direct repeats. Results on data from ABC news broadcasts show that the method can successfully locate matches several hundred times faster than real-time and requires less than a second of cue-audio. By applying the match recursively to the data, repeated sections of audio, which nearly always correspond to non-news items such as commercials and theme-music, can be identified. Experiments show that the application of the technique can also lead to improved information retrieval using automatically transcribed broadcast data.


international acm sigir conference on research and development in information retrieval | 2000

Effects of out of vocabulary words in spoken document retrieval (poster session)

Philip C. Woodland; Sue E. Johnson; P. Jourlin; K. Sparck Jones

The effects of out-of-vocabulary (OOV) items in spoken document retrieval (SDR) are investigated. Several sets of transcriptions were created for the TREC-8 SDR task using a speech recognition system varying the vocabulary sizes and OOV rates, and the relative retrieval performance measured. The effects of OOV terms on a simple baseline IR system and on more sophisticated retrieval systems are described. The use of a parallel corpus for query and document expansion is found to be especially beneficial, and with this data set, good retrieval performance can be achieved even for fairly high OOV rates.


Speech Communication | 2000

Spoken document representations for probabilistic retrieval

P. Jourlin; Sue E. Johnson; Karen Sparck Jones; Philip C. Woodland

This paper presents some developments in query expansion and document representation of our spoken document retrieval system and shows how various retrieval techniques affect performance for different sets of transcriptions derived from a common speech source. Modifications of the document representation are used, which combine several techniques for query expansion, knowledge-based on one hand and statistics-based on the other. Taken together, these techniques can improve Average Precision by over 19% relative to a system similar to that which we presented at TREC-7. These new experiments have also confirmed that the degradation of Average Precision due to a word error rate (WER) of 25% is quite small (3.7% relative) and can be reduced to almost zero (0.2% relative). The overall improvement of the retrieval system can also be observed for seven different sets of transcriptions from different recognition engines with a WER ranging from 24.8% to 61.5%. We hope to repeat these experiments when larger document collections become available, in order to evaluate the scalability of these techniques.


international acm sigir conference on research and development in information retrieval | 2000

The Cambridge University multimedia document retrieval demo system

Andreas Tuerk; Sue E. Johnson; P. Jourlin; K. Sparck Jones; Philip C. Woodland

The Cambridge University Multimedia Document Retrieval (CU-MDR) Demo System is a web-based application that allows the user to query a database of radio broadcasts that are available on the Internet. The audio from several radio stations is downloaded and transcribed automatically. This gives a collection of text and audio documents that can be searched by a user. The paper describes how speech recognition and information retrieval techniques are combined in the CU-MDR Demo System and shows how the user can interact with it.


International Journal of Speech Technology | 2001

Information Retrieval from Unsegmented Broadcast News Audio

Sue E. Johnson; P. Jourlin; Karen Sparck Jones; Philip C. Woodland

This paper describes a system for retrieving relevant portions of broadcast news shows starting with only the audio data. A novel method of automatically detecting and removing commercials is presented and shown to increase the performance of the system while also reducing the computational effort required. A sophisticated large vocabulary speech recogniser which produces high-quality transcriptions of the audio and a window-based retrieval system with post-retrieval merging are also described.Results are presented using the 1999 TREC-8 Spoken Document Retrieval data for the task where no story boundaries are known. Experiments investigating the effectiveness of all aspects of the system are described, and the relative benefits of automatically eliminating commercials, enforcing broadcast structure during retrieval, using relevance feedback, changing retrieval parameters and merging during post-processing are shown.An Average Precision of 46.8%, when duplicates are scored as irrelevant, is shown to be achievable using this system, with the corresponding word error rate of the recogniser being 20.5%.


international acm sigir conference on research and development in information retrieval | 1999

Improving retrieval on imperfect speech transcriptions (poster abstract)

P. Jourlin; Sue E. Johnson; K. Sparck Jones; Philip C. Woodland

This paper presents the results from adding several forms of query expansion to our retrieval system running on several sets of transcriptions of broadcast news from the 1997 TREC-7 spoken document retrieval track.


international acm sigir conference on research and development in information retrieval | 2000

The Cambridge University Multimedia Document Retrieval demo system (demonstration session)

Andreas Tuerk; Sue E. Johnson; P. Jourlin; K. Sparck Jones; Philip C. Woodland

The CU-MDR Demo [3] is a web based application that allows the user to query a database of automatically generated transcripts of radio broadcasts that are available on-line. The system downloads the audio track of British and American news broadcasts from the Internet once a day and adds them to its archive. The audio, which comes in RealAudio format, is first converted into standard uncompressed format from which a transcription is produced using our large vocabulary broadcast news recognition engine. This yields a collection of text and audio documents which can be searched by the user.


Archive | 1998

Segment generation and clustering in the HTK broadcast news transcription system

Thomas Hain; Sue E. Johnson; Andreas Tuerk; Philip C. Woodland; Steve J. Young

Collaboration


Dive into the Sue E. Johnson's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar

P. Jourlin

University of Cambridge

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Thomas Hain

University of Sheffield

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Researchain Logo
Decentralizing Knowledge