Neil Kleynhans
North-West University
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Neil Kleynhans.
language resources and evaluation | 2015
Neil Kleynhans; Etienne Barnard
Automatic speech recognition (ASR) technology has matured over the past few decades and has made significant impacts in a variety of fields, from assistive technologies to commercial products. However, ASR system development is a resource intensive activity and requires language resources in the form of text annotated audio recordings and pronunciation dictionaries. Unfortunately, many languages found in the developing world fall into the resource-scarce category and due to this resource scarcity the deployment of ASR systems in the developing world is severely inhibited. One approach to assist with resource-scarce ASR system development, is to select “useful” training samples which could reduce the resources needed to collect new corpora. In this work, we propose a new data selection framework which can be used to design a speech recognition corpus. We show for limited data sets, independent of language and bandwidth, the most effective strategy for data selection is frequency-matched selection and that the widely-used maximum entropy methods generally produced the least promising results. In our model, the frequency-matched selection method corresponds to a logarithmic relationship between accuracy and corpus size; we also investigated other model relationships, and found that a hyperbolic relationship (as suggested from simple asymptotic arguments in learning theory) may lead to somewhat better performance under certain conditions.
Procedia Computer Science | 2016
Neil Kleynhans; William Hartman; Daniel R. van Niekerk; Charl Johannes van Heerden; Rich Schwartz; Stavros Tsakalidis; Marelie H. Davel
Abstract We investigate modeling strategies for English code-switched words as found in a Swahili spoken term detection system. Code switching, where speakers switch language in a conversation, occurs frequently in multilingual environments, and typically de- teriorates STD performance. Analysis is performed in the context of the IARPA Babel program which focuses on rapid STD system development for under-resourced languages. Our results show that approaches that specifically target the modeling of code-switched words, significantly improve the detection performance of these words.
conference of the international speech communication association | 2016
Charl Johannes van Heerden; Neil Kleynhans; Marelie H. Davel
We would like to thank Brno University of Technology (BUT) and our gracious hosts – Jan (Honza) Cˇ ernocky´, Martin Karafi´at, Karel Versel´y and team – for support during our visit to BUT, and for access to the BUT computing environment where most of these experiments were conducted.
conference of the international speech communication association | 2011
Marelie H. Davel; Charl Johannes van Heerden; Neil Kleynhans; Etienne Barnard
Archive | 2011
Etienne Barnard; Marelie H. Davel; Charl Johannes van Heerden; Neil Kleynhans; Kalika Bali
SLTU | 2010
Charl Johannes van Heerden; Neil Kleynhans; Etienne Barnard; Marelie H. Davel
South African Journal of Science | 2017
Febe de Wet; Neil Kleynhans; Dirk Van Compernolle; Reza Sahraeian
Archive | 2015
Reza Sahraeian; Neil Kleynhans; Febe de Wet; Dirk Van Compernolle
conference of the international speech communication association | 2017
Daniel R. van Niekerk; Charl Johannes van Heerden; Marelie H. Davel; Neil Kleynhans; Oddur Kjartansson; Martin Jansche; Linne Ha
Archive | 2016
Neil Kleynhans; William Hartman; Daniel R. van Niekerk; Charl Johannes van Heerden; Rich Schwartz; Stavros Tsakalidis; Marelie H. Davel