Daniel Bolaños
University of Colorado Boulder
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Daniel Bolaños.
ACM Transactions on Speech and Language Processing | 2011
Daniel Bolaños; Ronald A. Cole; Wayne H. Ward; Eric Borts; Edward Svirsky
We present initial results of FLORA, an accessible computer program that uses speech recognition to provide an accurate measure of childrens oral reading ability. FLORA presents grade-level text passages to children, who read the passages out loud, and computes the number of words correct per minute (WCPM), a standard measure of oral reading fluency. We describe the main components of the FLORA program, including the system architecture and the speech recognition subsystems. We compare results of FLORA to human scoring on 783 recordings of grade level text passages read aloud by first through fourth grade students in classroom settings. On average, FLORA WCPM scores were within 3 to 4 words of human scorers across students in different grade levels and schools.
Speech Communication | 2013
Daniel Bolaños; Ronald A. Cole; Wayne H. Ward; Gerald Tindal; Paula J. Schwanenflugel; Melanie R. Kuhn
We investigated the automatic assessment of expressive childrens oral reading of grade level text passages using a standardized rubric. After a careful review of the reading literature and a close examination of the rubric, we designed a novel set of prosodic and lexical features to characterize fluent expressive reading. A number of complementary sources of information were used to design the features, each of them motivated by research on different components of reading fluency. Features are connected to the childs reading rate, to the presence and number of pauses, filled-pauses and word-repetitions, the correlation between punctuation marks and pauses, the length of word groupings, syllable stress and duration and the location of pitch peaks and contours. The proposed features were evaluated on a corpus of 783 one-minute reading sessions from 313 students reading grade-leveled passages without assistance (cold unassisted reading). Experimental results show that the proposed lexical and prosodic features provide complementary information and are able to capture the characteristics of expressive reading. The results showed that on both the 2-point and the 4-point expressiveness scales, computer-generated ratings of expressiveness agreed with human raters better than the human raters agreed with each other. The results of the study suggest that automatic assessment of expressive oral reading can be combined with automatic measures of word accuracy and reading rate to produce an accurate multidimensional estimate of childrens oral reading ability.
north american chapter of the association for computational linguistics | 2009
Daniel Bolaños; Geoffrey Zweig; Patrick Nguyen
Voice Search applications provide a very convenient and direct access to a broad variety of services and information. However, due to the vast amount of information available and the open nature of the spoken queries, these applications still suffer from recognition errors. This paper explores the utilization of personalization features for the post-processing of recognition results in the form of n-best lists. Personalization is carried out from three different angles: short-term, long-term and Web-based, and a large variety of features are proposed for use in a log-linear classification framework. Experimental results on data obtained from a commercially deployed Voice Search system show that the combination of the proposed features leads to a substantial sentence error rate reduction. In addition, it is shown that personalization features which are very different in nature can successfully complement each other.
Proceedings of the 2nd Workshop on Child, Computer and Interaction | 2009
Daniel Bolaños; Wayne H. Ward; Ronald A. Cole
In this article we present a novel approach to reference verification, the problem of determining if a speakers utterance matches a specific reference (text) string. We will then discuss its application to a reading tracker system for childrens speech. Unlike other reading tracker systems proposed in the literature, that are built over conventional speech recognizers with ad-hoc language models, the reading tracker described here is designed specifically for the task of estimating whether a child has read an expected sequence of words out loud. The tracker is designed to handle in a natural and flexible way the disfluencies that frequently appear in childrens speech while reading out loud, (e.g., partial-words, repetitions, self-corrections, sentence-restarts, etc), and to overcome problems caused by using language models within the reference verification task. Three mechanisms have been introduced for this purpose: the utilization of filler models and the inclusion of forward and backward inter-word transitions in the static decoding network. While this article focuses on the approach used to overcome errors observed in previous systems, the performance of this system will be evaluated on a corpus of childrens speech while reading out loud and compared to the performance of a traditional reading tracker system that is built on top of a speech recognition system. The results of this comparison will be presented at WOCCI 2009.
text speech and dialogue | 2013
Ronald A. Cole; Wayne H. Ward; Daniel Bolaños; Cindy Buchenroth-Martin; Eric Borts
Advances in human language and character animation technologies have enabled a new generation of intelligent tutoring systems that support conversational interaction between young learners and a lifelike computer character that was designed to behave like a sensitive and effective human tutor My Science Tutor is a spoken dialog system in which children learn to construct science explanations through conversations with Marni, the virtual science tutor, in multimedia environments. MyST displays illustrations, silent animations or interactive simulations to the student, while Marni asks open-ended questions like “Whats going here?”. Based on MySTs analysis of the students spoken response, the system decides what the student understands about the science and what the student has not yet explained (or doesnt know), and generates a follow-on question a new prompt, and possibly a new animation, that is designed to scaffold learning and challenge the student to reason about the science. Two large scale evaluations were conducted in which third, fourth and fifth grade students received over 5 hours of tutoring during sixteen 20-minute sessions in four different areas of science. The results revealed that, relative to students who did not receive tutoring, students who used My Science Tutor achieved significant learning gains in standardized tests of science achievement, equivalent to gains achieved by students who received tutoring by expert human tutors. In recent research, we have extended the technologies used in MyST to a develop a new generation of interactive books that use text, speech and dialog technologies to help children learn to read science texts fluently, expressively, and with good comprehension. We will demonstrate these MindStars Books and present initial results of classroom testing.
ACM Transactions on Speech and Language Processing | 2011
Wayne H. Ward; Ronald A. Cole; Daniel Bolaños; Cindy Buchenroth-Martin; Edward Svirsky; Sarel van Vuuren; Timothy J. Weston; Jing Zheng; Lee Becker
Journal of Educational Psychology | 2013
Wayne H. Ward; Ron Cole; Daniel Bolaños; Cindy Buchenroth-Martin; Edward Svirsky; Timothy J. Weston
Journal of Educational Psychology | 2013
Daniel Bolaños; Ron Cole; Wayne H. Ward; Gerald Tindal; Jan Hasbrouck; Paula J. Schwanenflugel
conference of the international speech communication association | 2008
Daniel Bolaños; Wayne H. Ward
national conference on artificial intelligence | 2010
Rodney D. Nielsen; Richard M. Voyles; Daniel Bolaños; Mohammad H. Mahoor; Wilson D. Pace; Katie A. Siek; Wayne H. Ward