Pascal Getreuer
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Pascal Getreuer.
international conference on acoustics, speech, and signal processing | 2017
Pascal Getreuer; Thad Hughes; Richard F. Lyon; Rif A. Saurous
Robust and far-field speech recognition is critical to enable true hands-free communication. In far-field conditions, signals are attenuated due to distance. To improve robustness to loudness variation, we introduce a novel frontend called per-channel energy normalization (PCEN). The key ingredient of PCEN is the use of an automatic gain control based dynamic compression to replace the widely used static (such as log or root) compression. We evaluate PCEN on the keyword spotting task. On our large rerecorded noisy and far-field eval sets, we show that PCEN significantly improves recognition performance. Furthermore, we model PCEN as neural network layers and optimize high-dimensional PCEN parameters jointly with the keyword spotting acoustic model. The trained PCEN frontend demonstrates significant further improvements without increasing model complexity or inference-time cost.
IEEE Transactions on Multimedia | 2018
Pascal Getreuer; Chet N. Gnegy; Richard F. Lyon; Rif A. Saurous
We have implemented a near-ultrasonic communication protocol in the 18.5–20xa0kHz band, which is inaudible to most humans, using commodity smartphone speakers and microphones to transmit and receive signals. The protocol described in this paper is a component of Googles Nearby platform, where near-ultrasound signals are used to establish copresence between nearby devices by transmitting a short token. High-frequency sound does not pass through walls (most energy is reflected), so identified devices are constrained to approximately the same room, “within earshot” of one another. Our protocol has a raw data rate of 94.5 b/s, and we find in real indoor environments that transmission between mobile devices is reliable at 2 m distance and often works at 10 m. We use direct-sequence spread spectrum modulation, which makes it highly robust to multipath, motion, and narrowband noise. We use a 127-chip pseudorandom code, repeating once per data symbol, and modulate its amplitude with orthogonal sine waveforms encoding 4-bit symbol values. We add the orthogonal sines to a constant “pedestal,” which is inefficient in an information-theoretic sense, but makes synchronization easier. We describe a robust and computationally efficient transmitter and receiver implementations and show experiments on real and simulated data.
Archive | 2015
Andrew Ames Bunner; Alan Lee Gardner; Mohammed Waleed Kadous; Brian Patrick Williams; Marc Stogaitis; Nadav Aharony; Brian Duff; Pascal Getreuer; Zhentao Sun; Daniel Estrada Alva; Ami Patel; Benjamin Razon; Richard Daniel Webb; Tony Weber; Thomas Yuchin Chao; Ryan Michael Rifkin; Richard F. Lyon; Liem Tran; Joseph A. Farfel
arXiv: Computer Vision and Pattern Recognition | 2018
Frank Ong; Peyman Milanfar; Pascal Getreuer
international conference on image processing | 2018
Sungjoon Choi; John Isidoro; Pascal Getreuer; Peyman Milanfar
international conference on computational photography | 2018
Pascal Getreuer; Ignacio Garcia-Dorado; John Isidoro; Sungjoon Choi; Frank Ong; Peyman Milanfar
Bulletin of the American Physical Society | 2018
Yohai Bar-Sinai; Michael P. Brenner; Pascal Getreuer; Jason Hickey; Stephan Hoyer; Peyman Milanfar
arXiv: Graphics | 2017
Ignacio Garcia-Dorado; Pascal Getreuer; Madison Le; Robin Debreuil; Alex Kauffmann; Peyman Milanfar
Archive | 2017
Pascal Getreuer; Ignacio Garcia-Dorado; John Isidoro; Sungjoon Choi; Frank Ong; Peyman Milanfar
Archive | 2017
Nadav Aharony; Andrew Ames Bunner; Alan Lee Gardner; Mohammed Waleed Kadous; Brian Patrick Williams; Marc Stogaitis; Brian Duff; Pascal Getreuer; Zhentao Sun; Daniel Estrada Alva; Ami Patel; Benjamin Razon; Richard Daniel Webb; Tony Weber; Thomas Yuchin Chao; Ralph Jacob Cressman; Denise Ho; Liem Tran; Joseph A. Farfel