Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Neil Kleynhans is active.

Publication


Featured researches published by Neil Kleynhans.


language resources and evaluation | 2015

Efficient data selection for ASR

Neil Kleynhans; Etienne Barnard

Automatic speech recognition (ASR) technology has matured over the past few decades and has made significant impacts in a variety of fields, from assistive technologies to commercial products. However, ASR system development is a resource intensive activity and requires language resources in the form of text annotated audio recordings and pronunciation dictionaries. Unfortunately, many languages found in the developing world fall into the resource-scarce category and due to this resource scarcity the deployment of ASR systems in the developing world is severely inhibited. One approach to assist with resource-scarce ASR system development, is to select “useful” training samples which could reduce the resources needed to collect new corpora. In this work, we propose a new data selection framework which can be used to design a speech recognition corpus. We show for limited data sets, independent of language and bandwidth, the most effective strategy for data selection is frequency-matched selection and that the widely-used maximum entropy methods generally produced the least promising results. In our model, the frequency-matched selection method corresponds to a logarithmic relationship between accuracy and corpus size; we also investigated other model relationships, and found that a hyperbolic relationship (as suggested from simple asymptotic arguments in learning theory) may lead to somewhat better performance under certain conditions.


Procedia Computer Science | 2016

Code-switched English pronunciation modeling for Swahili spoken term detection

Neil Kleynhans; William Hartman; Daniel R. van Niekerk; Charl Johannes van Heerden; Rich Schwartz; Stavros Tsakalidis; Marelie H. Davel

Abstract We investigate modeling strategies for English code-switched words as found in a Swahili spoken term detection system. Code switching, where speakers switch language in a conversation, occurs frequently in multilingual environments, and typically de- teriorates STD performance. Analysis is performed in the context of the IARPA Babel program which focuses on rapid STD system development for under-resourced languages. Our results show that approaches that specifically target the modeling of code-switched words, significantly improve the detection performance of these words.


conference of the international speech communication association | 2016

Improving the Lwazi ASR baseline

Charl Johannes van Heerden; Neil Kleynhans; Marelie H. Davel

We would like to thank Brno University of Technology (BUT) and our gracious hosts – Jan (Honza) Cˇ ernocky´, Martin Karafi´at, Karel Versel´y and team – for support during our visit to BUT, and for access to the BUT computing environment where most of these experiments were conducted.


conference of the international speech communication association | 2011

Efficient harvesting of Internet audio for resource-scarce ASR

Marelie H. Davel; Charl Johannes van Heerden; Neil Kleynhans; Etienne Barnard


Archive | 2011

Phone recognition for Spoken Web Search

Etienne Barnard; Marelie H. Davel; Charl Johannes van Heerden; Neil Kleynhans; Kalika Bali


SLTU | 2010

Pooling ASR data for closely related languages

Charl Johannes van Heerden; Neil Kleynhans; Etienne Barnard; Marelie H. Davel


South African Journal of Science | 2017

Speech recognition for under-resourced languages: Data sharing in hidden Markov model systems

Febe de Wet; Neil Kleynhans; Dirk Van Compernolle; Reza Sahraeian


Archive | 2015

Knowledge-based phoneme mapping between Flemish and Afrikaans

Reza Sahraeian; Neil Kleynhans; Febe de Wet; Dirk Van Compernolle


conference of the international speech communication association | 2017

Rapid development of TTS corpora for four South African languages

Daniel R. van Niekerk; Charl Johannes van Heerden; Marelie H. Davel; Neil Kleynhans; Oddur Kjartansson; Martin Jansche; Linne Ha


Archive | 2016

Code-switched English Pronunciation Modeling for Swahili Spoken Term Detection (Pub Version, Open Access)

Neil Kleynhans; William Hartman; Daniel R. van Niekerk; Charl Johannes van Heerden; Rich Schwartz; Stavros Tsakalidis; Marelie H. Davel

Collaboration


Dive into the Neil Kleynhans's collaboration.

Top Co-Authors

Avatar

Charl Johannes van Heerden

Council of Scientific and Industrial Research

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Febe de Wet

Council of Scientific and Industrial Research

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Reza Sahraeian

Katholieke Universiteit Leuven

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Jaco Badenhorst

Council of Scientific and Industrial Research

View shared research outputs
Researchain Logo
Decentralizing Knowledge