Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Vijay Kumar Peddinti is active.

Publication


Featured researches published by Vijay Kumar Peddinti.


Journal of the Acoustical Society of America | 2013

Synchrony capture filterbank: Auditory-inspired signal processing for tracking individual frequency components in speech

Ramdas Kumaresan; Vijay Kumar Peddinti; Peter Cariani

A processing scheme for speech signals is proposed that emulates synchrony capture in the auditory nerve. The role of stimulus-locked spike timing is important for representation of stimulus periodicity, low frequency spectrum, and spatial location. In synchrony capture, dominant single frequency components in each frequency region impress their time structures on temporal firing patterns of auditory nerve fibers with nearby characteristic frequencies (CFs). At low frequencies, for voiced sounds, synchrony capture divides the nerve into discrete CF territories associated with individual harmonics. An adaptive, synchrony capture filterbank (SCFB) consisting of a fixed array of traditional, passive linear (gammatone) filters cascaded with a bank of adaptively tunable, bandpass filter triplets is proposed. Differences in triplet output envelopes steer triplet center frequencies via voltage controlled oscillators (VCOs). The SCFB exhibits some cochlea-like responses, such as two-tone suppression and distortion products, and possesses many desirable properties for processing speech, music, and natural sounds. Strong signal components dominate relatively greater numbers of filter channels, thereby yielding robust encodings of relative component intensities. The VCOs precisely lock onto harmonics most important for formant tracking, pitch perception, and sound separation.


international conference on acoustics, speech, and signal processing | 2012

Synchrony capture filterbank (SCFB): An auditory periphery inspired method for tracking sinusoids

Ramdas Kumaresan; Vijay Kumar Peddinti; Peter Cariani

We propose a novel algorithm for tracking multiple sinusoidal signals that is motivated by neural coding in the mammalian peripheral auditory system. A striking feature of auditory nerve activity is the phenomenon of “synchrony capture,” whereby the most intense frequency components in the stimulus dominate the temporal firing patterns of whole subpopulations of auditory nerve fibers (ANFs). A novel adaptive filterbank structure that emulates key aspects of synchrony capture is presented. The proposed filterbank has two components: a fixed bank of traditional gammatone (or equivalent) filters that are cascaded with a bank of adaptively-tunable bandpass filter triplets. The bandpass filters are tuned by using a voltage controlled oscillator (VCO) whose frequency is steered by a frequency discriminator loop (FDL). The resulting filterbank is used to process synthetic signals and speech. It is shown that the VCOs can track the low frequency harmonics in speech that evoke voice pitch at their fundamental (F0). For vowels, the VCOs faithfully track the strongest harmonic present in each formant region.


international conference on acoustics, speech, and signal processing | 2011

Multiple pitch identification using cochlear-like frequency capture and harmonic grouping

Ramdas Kumaresan; Vijay Kumar Peddinti; Peter Cariani

This work addresses the problem of identifying multiple fundamental frequencies in an acoustic signal. An auditory-inspired peripheral signal processing model is proposed that functions in a manner more like a bank of FM receivers rather than a traditional filterbank. Such receivers lock on to a strong signal (synchrony capture, frequency capture) even in the presence of nearby only slightly weaker signal components. Once the individual signal components are resolved, the model subjects them to an instantaneous nonlinearity and then performs harmonic grouping by cross correlating the isolated components. After the harmonically-related components are grouped, their pitches are computed using a standard summary autocorrelation approach.


Signal Processing | 2016

Bandpass phase shifter and analytic signal generator

Vijay Kumar Peddinti; Ramdas Kumaresan

In this note, a novel tunable bandpass filter/phase shifter implementation (with a Hilbert transformer as a special case) is proposed. The filter can also be used to synthesize a bandpass analytic signal from a real-valued signal. The novelty is the simple, yet elegant implementation that exploits the even and odd symmetry of the in-phase and quadrature carrier modulation. HighlightsAlternate tunable bandpass phase shifter implementation is proposed.Exploits even and odd symmetry of in-phase and quadrature carrier modulation.Novel, simple yet elegant, architecture to shift the phase by any arbitrary angle.Hilbert transform computation of a bandpass signal as a specific case is presented.


international conference on acoustics, speech, and signal processing | 2014

AUDITORY-INSPIRED PITCH EXTRACTION USING A SYNCHRONY CAPTURE FILTERBANK AND PHASE ALIGNMENT

Ramdas Kumaresan; Vijay Kumar Peddinti; Peter Cariani

The question of how harmonic sounds produce strong, low pitches at their fundamental frequencies, f0s, has been of theoretical and practical interest to scientists and engineers for many decades. Currently the best auditory models for f0 pitch, e.g. [1], are based on bandpass filtering (cochlear mechanics), half-wave rectification and low-pass filtering (haircell transduction and synaptic transmission), channel autocorrelations (all-order interspike interval statistics) aggregated into a summary autocorrelation, and an analysis that determines the most prevalent interspike intervals. As a possible alternative to autocorrelation computations, we propose an alternative model that uses an adaptive Synchrony Capture Filterbank (SCFB) in which groups of filters or channels in a filterbank neighborhood are driven exclusively (captured) by dominant frequency components that are closest to them. The channel outputs are then adaptively phase aligned with respect to a common time reference to compute a Summary Phase Aligned Function (SPAF), aggregated across all channels, from which f0 can be easily extracted.


Journal of the Acoustical Society of America | 2014

Auditory-inspired pitch extraction using a synchrony capture filterbank for speech signals

Kumaresan Ramdas; Vijay Kumar Peddinti; Peter Cariani

The question of how harmonic sounds produce strong, low pitches at their fundamental frequencies, F0s, has been of theoretical and practical interest to scientists and engineers for many decades. Currently the best auditory models for F0 pitch, [e.g., Meddis and Hewitt, J. Acoust. Soc. Am. 89(6), 2866–2894 (1991)] are based on bandpass filtering (cochlear mechanics), half-wave rectification and low-pass filtering (haircell transduction and synaptic transmission), channel autocorrelations (all-order interspike interval statistics) aggregated into a summary autocorrelation, and an analysis that determines the most prevalent interspike intervals. As a possible alternative to autocorrelation computations, we propose an alternative model that uses an adaptive Synchrony Capture Filterbank (SCFB) in which groups of filter channels in a spectral neighborhood are driven exclusively (captured) by dominant frequency components that are closest to them. The channel outputs (for frequencies below 1500 Hz) are then ada...


Journal of the Acoustical Society of America | 2014

Auditory-inspired pitch extraction using a synchrony capture filterbank

Kumaresan Ramdas; Vijay Kumar Peddinti; Peter Cariani

The question of how harmonic sounds in speech and music produce strong, low pitches at their fundamental frequencies, F0’s, has been of theoretical and practical interest to scientists and engineers for many decades. Currently the best auditory models for F0 pitch, (e.g., Meddis & Hewitt, 1991), are based on bandpass filtering (cochlear mechanics), half-wave rectification and low-pass filtering (hair cell transduction, synaptic transmission), channel autocorrelations (all-order interspike interval distributions) aggregated into a summary autocorrelation, followed by an analysis that determines the most prevalent interspike intervals. As a possible alternative to explicit autocorrelation computations, we propose an alternative model that uses an adaptive Synchrony Capture Filterbank (SCFB) in which channels in a filterbank neighborhood are driven exclusively (captured) by dominant frequency components closest to them. Channel outputs are then adaptively phase aligned with respect to a common time reference...


Journal of the Acoustical Society of America | 2011

Synchrony‐capture filterbank: A novel cochlear signal processing model.

Ramdas Kumaresan; Vijay Kumar Peddinti; Peter Cariani

Examination of the representation of low harmonics of complex sounds in the auditory nerve shows a striking feature known as “synchrony capture.” Fibers of an entire cochlear region are driven almost exclusively by one local, dominant harmonic component [Delgutte and Kiang, 1984]. Sharp boundaries characteristic of such synchrony capture are also seen between the different CF regions driven by different dominant, formant‐region harmonics for multiformant vowels. Based on this observation, we propose a model for peripheral processing, which is not just a filter bank but behaves more like signal adaptive receivers. We call this model synchrony capture filterbank (SCFB). SCFB consists of a traditional gammatone filterbank, the individual filters of which are then cascaded with a bandpass filter (BPF) triplet. The BPF triplet is an adaptive tone follower and consists of three overlapping bandpass filters whose center frequencies can be changed using feedback. The amplitudes at the output of the the bandpass filters are feedback to tune the BPF triplet such that it centers itself on top of the dominant tone in the input signal. The SCFB exhibits the synchrony capture behavior that is observed in real auditory nerve fibers.


Journal of the Acoustical Society of America | 2010

Spatiotemporal coding of signals in the auditory periphery.

Ramdas Kumaresan; Vijay Kumar Peddinti; Peter Cariani

Signal representation in the cochlea is often thought to involve either rate‐place profiles or purely temporal, interspike interval codes. Spatio‐temporal coding strategies based on phase‐locking, cochlear delays, and coincidence detectors have also been proposed [Loeb et al., Biol. Cybern. (1983); K. & Shamma, J. Acoust. Soc. Am. 107 (2000); and Carney et al., Acoustica 88, 334–337 (2002)]. In this view, spatiotemporal patterns of spikes locked to relative phases of the traveling wave at specific cochlear places at a given time can convey information about a tone. We propose a general mathematical basis for using such spatial phase/amplitude patterns along the frequency axis to represent an arbitrary (approximately) time and bandwidth‐limited signal. We posit that the spatial pattern of phases and amplitudes corresponds to locations at which (real and/or imaginary parts of) the Fourier transform of the signal crosses certain levels (e.g., zero level). Given these locations, we show that we can accurately...


Archive | 2015

Synchrony capture filterbank (SCFB): Auditory-inspired signal processing for frequency tracking

Vijay Kumar Peddinti

Collaboration


Dive into the Vijay Kumar Peddinti's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar

Ramdas Kumaresan

University of Rhode Island

View shared research outputs
Researchain Logo
Decentralizing Knowledge