Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Ee-Leng Tan is active.

Publication


Featured researches published by Ee-Leng Tan.


IEEE Transactions on Audio, Speech, and Language Processing | 2013

Robust SVD-Based Audio Watermarking Scheme With Differential Evolution Optimization

Bai Ying Lei; Ing Yann Soon; Ee-Leng Tan

In this paper, a robust audio watermarking scheme based on singular value decomposition (SVD) and differential evolution (DE) using dither modulation (DM) quantization algorithm is proposed. Two novel SVD-based algorithms, lifting wavelet transform (LWT)-discrete cosine transform (DCT)-SVD and discrete wavelet transform (DWT)-DCT-SVD, are developed for audio copyright protection. In our method, LWTDWT is first applied to decompose the host signal and obtain the corresponding approximate coefficients followed by DCT to take advantage of “energy compaction” property. SVD is further performed to acquire the singular values and enhance the robustness of the scheme. The adaptive DM quantization is adopted to quantize the singular values and embed the watermark. To withstand desynchronization attacks, synchronization code is inserted using audio statistical characteristics. Furthermore, the conflicting problem of robustness and imperceptibility is effectively resolved by the DE optimization. Simulation results demonstrate that both the LWT-DCT-SVD and DWT-DCT-SVD methods not only have good imperceptibility performance, but also resist general signal processing, hybrid and desynchronization attacks. Compared with the previous DWT-DCT, support vector regression (SVR)-DWT-DCT and DWT-SVD methods, our method obtains more robustness against the selected attacks.


Archive | 2011

Audio projection : directional sound and its applications in immersive communication.

Woon-Seng Gan; Ee-Leng Tan; Sen M. Kuo

The parametric loudspeaker provides an effective means of projecting sound in a highly directional manner without using large loudspeaker arrays to form sharp directional beams. It can be augmented with conventional loudspeakers to create a more immersive audio soundscape. Deployment of parametric loudspeakers in many public places where private messaging can make a difference in attracting attention, conveying messages without needing headphones, and creating private listening zones to reduce noise pollution. Digital signal processing plays a significant role in enhancing the aural quality of the parametric loudspeakers, and array processing can help to shape and steer the beam electronically. In addition, other signal processing techniques can also be applied to add more flexibility and improve the performance of parametric loudspeakers. These developments rely heavily on the latest techniques in acoustics and audio signal processing to overcome some of the current limitations in nonlinear acoustics modeling and ultrasonic transducers technology. A useful feature in sound projection is to realize a highaccuracy digital beamsteering capability in air using an array of parametric loudspeakers. An in-depth study into the theoretical model of wave steering capability in parametric array in air can provide some hints on how we can best steer the demodulated signal in an efficient manner. As seen from this article, digital signal processing provides the main engine to achieve directional sound projection, and new digital processing techniques will be devised to provide a better quality, controllable audio beaming, and efficient sound focusing device in the future.


IEEE Transactions on Audio, Speech, and Language Processing | 2014

Linear Estimation Based Primary-Ambient Extraction for Stereo Audio Signals

Jianjun He; Ee-Leng Tan; Woon-Seng Gan

Audio signals for moving pictures and video games are often linear combinations of primary and ambient components. In spatial audio analysis-synthesis, these mixed signals are usually decomposed into primary and ambient components to facilitate flexible spatial rendering and enhancement. Existing approaches such as principal component analysis (PCA) and least squares (LS) are widely used to perform this decomposition from stereo signals. However, the performance of these approaches in primary-ambient extraction (PAE) has not been well studied and no comparative analysis among the existing approaches has been carried out so far. In this paper, we generalize the existing approaches into a linear estimation framework. Under this framework, we propose a series of performance measures to identify the components that contribute to the extraction error. Based on the generalized linear estimation framework and our proposed performance measures, a comparative study and experimental testing of the linear estimation based PAE approaches including existing PCA, LS, and three proposed variant LS approaches are presented.


international conference on acoustics, speech, and signal processing | 2014

A Study on the Frequency-Domain Primary-Ambient Extraction for Stereo Audio Signals

Jianjun He; Woon-Seng Gan; Ee-Leng Tan

Primary-ambient extraction (PAE) has been playing an important role in spatial audio analysis-synthesis. Based on the spatial features, PAE decomposes a signal into primary and ambient components, which are then rendered separately. PAE is performed in subband domain for complex input signals having multiple point-like sound sources. However, the performance of PAE approaches and their key influences for such signals have not been well-studied so far. In this paper, we conducted a study on frequency-domain PAE using principal component analysis (PCA) in the case of multiple sources. We found that the partitioning of the frequency bins is very critical in PAE. Simulation results reveal that the proposed top-down adaptive partitioning method achieves superior performance as compared to the conventional partitioning methods.


international conference on acoustics, speech, and signal processing | 2013

Time-shifted principal component analysis based cue extraction for stereo audio signals

Jianjun He; Ee-Leng Tan; Woon-Seng Gan

In spatial audio analysis-synthesis, one of the key issues is to decompose a signal into cue and ambient components based on their spatial features. Principal component analysis (PCA) has been widely employed in cue extraction. However, the performance of PCA based cue extraction is highly dependent on the assumptions of the input signal model. One of these assumptions is the input signal contains highly correlated cue at zero lag. However, this assumption is often unmet. To overcome this problem, time shifted PCA is proposed in this paper, which involves time-shifting the input signal according to the estimated inter-channel time difference (ITD) of the input signal before cue extraction. From our simulation and listening tests results, the proposed method is found to be superior to the conventional PCA based cue extraction method.


international conference on acoustics, speech, and signal processing | 2012

A psychoacoustic bass enhancement system with improved transient and steady-state performance

Hao Mu; Woon-Seng Gan; Ee-Leng Tan

Psychoacoustic bass (low frequency) enhancement approach has received strong interests from consumer electronics manufacturers who demand good bass quality from small or flat panel speakers. Due to physical limitations and cost constraints, good bass performance is lacking from such speakers. A typical solution is based on the psychoacoustic phenomenon known as the “missing fundamental”, whereby human auditory system can perceive the fundamental frequency from its higher harmonics. This psychoacoustic bass enhancement system generally uses nonlinear devices (NLD) or phase vocoders (PV) to generate harmonics that enhance bass virtually. However, both approaches have their strengths and weaknesses. This paper presents a hybrid system, which combines these two approaches, to overcome their drawbacks and achieve good bass performance. The new approach first separates musical signals into transient and steady-state components, and applies NLD and PV on the separated signals. MUSHRA subjective test is used to evaluate the bass effect and audio quality of the proposed hybrid system against the NLD and PV methods.


IEEE Transactions on Audio, Speech, and Language Processing | 2011

A Comparative Analysis of Preprocessing Methods for the Parametric Loudspeaker Based on the Khokhlov–Zabolotskaya–Kuznetsov Equation for Speech Reproduction

Peifeng Ji; Ee-Leng Tan; Woon-Seng Gan; Jun Yang

Based on the Berktays farfield solution, various preprocessing methods were proposed to reduce the distortion of the highly directional audible signal in the parametric loudspeaker. However, the Berktays farfield solution is an approximated model of nonlinear acoustic propagation. To determine the effectiveness of these methods, we analyze various preprocessing methods theoretically for directional speech reproduction using the Khokhlov-Zabolotskaya-Kuznetsov (KZK) equation, which provides a more accurate model of nonlinear acoustic propagation. In order to reduce the distortion effectively in the parametric loudspeaker with these preprocessing methods, the initial sound pressure level of the carrier frequency is found to be less than 132 dB according to the KZK equation. Unlike the Berktay farfield solution that results in a +12 dB/octave gain slope, different gain slopes are derived using the KZK equation and appropriate equalizers are proposed to improve the frequency response of the parametric loudspeaker. The optimal preprocessing method for directional speech reproduction is established based on the KZK equation, which has a relatively flat frequency response of the desired speech signal and the best total harmonic distortion performance.


IEEE Transactions on Audio, Speech, and Language Processing | 2015

Primary-ambient extraction using ambient spectrum estimation for immersive spatial audio reproduction

Jianjun He; Woon-Seng Gan; Ee-Leng Tan

The diversity of todays playback systems requires a flexible, efficient, and immersive reproduction of sound scenes in digital media. Spatial audio reproduction based on primary-ambient extraction (PAE) fulfills this objective, where accurate extraction of primary and ambient components from sound mixtures in channel-based audio is crucial. Severe extraction error was found in existing PAE approaches when dealing with sound mixtures that contain a relatively strong ambient component, a commonly encountered case in the sound scenes of digital media. In this paper, we propose a novel ambient spectrum estimation (ASE) framework to improve the performance of PAE. The ASE framework exploits the equal magnitude of the uncorrelated ambient components in two channels of a stereo signal, and reformulates the PAE problem into the problem of estimating either ambient phase or magnitude. In particular, we take advantage of the sparse characteristic of the primary components to derive sparse solutions for ASE based PAE, together with an approximate solution that can significantly reduce the computational cost. Our objective and subjective experimental results demonstrate that the proposed ASE approaches significantly outperform existing approaches, especially when the ambient component is relatively strong.


IEEE Signal Processing Letters | 2015

Primary-Ambient Extraction Using Ambient Phase Estimation with a Sparsity Constraint

Jianjun He; Woon-Seng Gan; Ee-Leng Tan

Spatial audio reproduction addresses the growing commercial need to recreate an immersive listening experience of digital media content, such as movies and games. Primary-ambient extraction (PAE) is one of the key approaches to facilitate flexible and optimal rendering in spatial audio reproduction. Existing approaches, such as principal component analysis and time-frequency masking, often suffer from severe extraction error. This problem is more evident when the sound scene contains a relatively strong ambient component, which is frequently encountered in digital media. In this Letter, we propose a novel PAE approach by estimating the ambient phase with a sparsity constraint (APES). This approach exploits the equal magnitude of the uncorrelated ambient components in the two channels of a stereo signal and reformulates the PAE problem as an ambient phase estimation problem, which is then solved using the criterion that the primary component is sparse. Our experimental results demonstrate that the proposed approach significantly outperforms existing approaches, especially when the ambient component is relatively strong.


Journal of the Acoustical Society of America | 2013

Hybrid immersive three-dimensional sound reproduction system with steerable parametric loudspeakers

Chuang Shi; Ee-Leng Tan; Woon-Seng Gan

A loudspeaker must be both dispersive and directive to accurately reproduce spatial audio from digital media. To address this problem, an audio system that has a unique combination of conventional and parametric loudspeakers has previously been proposed and proved to be effective in producing an immersive 3D soundscape. However, this system has two drawbacks: (1) There is only one fixed sweet spot, and (2) only one listener within the sweet spot can enjoy the complete experience. Therefore, a hybrid 3D sound reproduction system combining conventional loudspeakers with a pair of steerable parametric loudspeakers is proposed in this paper. By using this new combination of conventional and steerable parametric loudspeakers, the sweet spot can be steered towards the listeners head position. Thus, the listener no longer needs to keep his head stationary while watching movies or playing games, which resulting in a more relaxing and pleasant experience. Furthermore, a dual-beamsteering method is proposed ...

Collaboration


Dive into the Ee-Leng Tan's collaboration.

Top Co-Authors

Avatar

Woon-Seng Gan

Nanyang Technological University

View shared research outputs
Top Co-Authors

Avatar

Jianjun He

Nanyang Technological University

View shared research outputs
Top Co-Authors

Avatar

Kaushik Sunder

Nanyang Technological University

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Peifeng Ji

Nanyang Technological University

View shared research outputs
Top Co-Authors

Avatar

Jun Yang

Chinese Academy of Sciences

View shared research outputs
Top Co-Authors

Avatar

Hao Mu

Nanyang Technological University

View shared research outputs
Top Co-Authors

Avatar

Bai Ying Lei

Nanyang Technological University

View shared research outputs
Top Co-Authors

Avatar

C.Y. Chia

Nanyang Technological University

View shared research outputs
Top Co-Authors

Avatar

Chiu-Hao Chen

Nanyang Technological University

View shared research outputs
Researchain Logo
Decentralizing Knowledge