Scott Fischaber | Researchain

Archive Network Publication Hotspot Collaboration

Network

Latest external collaboration on country level. Dive into details by clicking on the dots.

Explore More

Hotspot

Dive into the research topics where Scott Fischaber is active.

Explore More

Publication

Featured researches published by Scott Fischaber.

Journal of Systems Architecture | 2007

Rapid implementation and optimisation of DSP systems on FPGA-centric heterogeneous platforms

John McAllister; Roger F. Woods; Scott Fischaber; Eoin Malins

The emergence of programmable logic devices as single-chip heterogeneous processing platforms for digital signal processing applications poses challenges concerning rapid implementation and high level optimisation of algorithms on these platforms. This paper describes Abhainn, a rapid implementation methodology and toolsuite for translating an algorithmic expression of the system to a working solution on FPGA-centric embedded platforms. Two particular focuses for Abhainn are the automated but configurable realisation of inter-processor communication fabrics, and the establishment of novel dedicated hardware component design methodologies allowing algorithm level transformation for system optimisation. This paper outlines the approaches employed in these areas and demonstrates their effectiveness on high end signal processing beamforming applications.

2010 VI Southern Programmable Logic Conference (SPL) | 2010

Acceleration of HMM-based speech recognition system by parallel FPGA Gaussian calculation

Richard Veitch; Louis-Marie Aubert; Roger F. Woods; Scott Fischaber

An FPGA-based custom core which computes the Gaussian calculation portion of a Hidden Markov Model (HMM) based speech recognition system, is presented. The work is part of the development of a custom embedded system which will provide speaker independend, large vocabulary continuos speech recognition and is currently presented as a hardware/software codesign. By de-coupling the Gaussian calculation from the backend search, calculation of Gaussian results is performed with minimal communication between backend search software and an FPGA based Gaussian core. Several implementations have been investigated in order to minimize memory bandwidth and FPGA resource requirements and are presented. The system has been implemented using an Alpha Data XCR-5T1, reconfigurable computer housing a Virtex 5 SX95T FPGA and has achieved better than real-time performance at 133MHz. The core has been tested and is capable of calculating a full set of Gaussian results from 3825 acoustic models in 5.3ms which coupled with a backend search of 5000 words has provided over 80% accuracy.

International Journal of Reconfigurable Computing | 2011

FPGA implementation of a pipelined Gaussian calculation for HMM-based large vocabulary speech recognition

Richard Veitch; Louis-Marie Aubert; Roger F. Woods; Scott Fischaber

A scalable large vocabulary, speaker independent speech recognition system is being developed using Hidden Markov Models (HMMs) for acoustic modeling and a Weighted Finite State Transducer (WFST) to compile sentence, word, and phoneme models. The system comprises a software backend search and an FPGA-based Gaussian calculation which are covered here. In this paper, we present an efficient pipelined design implemented both as an embedded peripheral and as a scalable, parallel hardware accelerator. Both architectures have been implemented on an Alpha Data XRC-5T1, reconfigurable computer housing a Virtex 5 SX95T FPGA. The core has been tested and is capable of calculating a full set of Gaussian results from 3825 acoustic models in 9.03 ms which coupled with a backend search of 5000 words has provided an accuracy of over 80%. Parallel implementations have been designed with up to 32 cores and have been successfully implemented with a clock frequency of 133 MHz.

IEEE Transactions on Computers | 2013

Optimization of Weighted Finite State Transducer for Speech Recognition

Louis-Marie Aubert; Roger F. Woods; Scott Fischaber; Richard Veitch

There is considerable interest in creating embedded, speech recognition hardware using the weighted finite state transducer (WFST) technique but there are performance and memory usage challenges. Two system optimization techniques are presented to address this; one approach improves token propagation by removing the WFST epsilon input arcs; another one-pass, adaptive pruning algorithm gives a dramatic reduction in active nodes to be computed. Results for memory and bandwidth are given for a 5,000 word vocabulary giving a better practical performance than conventional WFST; this is then exploited in an adaptive pruning algorithm that reduces the active nodes from 30,000 down to 4,000 with only a 2 percent sacrifice in speech recognition accuracy; these optimizations lead to a more simplified design with deterministic performance.

international conference on embedded computer systems architectures modeling and simulation | 2005

Rapid implementation and optimisation of DSP systems on SoPC heterogeneous platforms

John McAllister; Roger F. Woods; Darren Gerard Reilly; Scott Fischaber; R. Hasson

The emergence of programmable logic devices as processing platforms for digital signal processing applications poses challenges concerning rapid implementation and high level optimization of algorithms on these platforms. This paper describes Abhainn, a rapid implementation methodology and toolsuite for translating an algorithmic expression of the system to a working implementation on a heterogeneous multiprocessor/field programmable gate array platform, or a standalone system on programmable chip solution. Two particular focuses for Abhainn are the automated but configurable realisation of inter-processor communuication fabrics, and the establishment of novel dedicated hardware component design methodologies allowing algorithm level transformation for system optimization. This paper outlines the approaches employed in both these particular instances.

international conference / workshop on embedded computer systems: architectures, modeling and simulation | 2008

Memory-Centric Hardware Synthesis from Dataflow Models

Scott Fischaber; John McAllister; Roger F. Woods

Generation of hardware architectures directly from dataflow representations is increasingly being considered as research moves toward system level design methodologies. Creation of networks of IP cores to implement actor functionality is a common approach to the problem, but often the memory sub-systems produced using these techniques are inefficiently utilised. This paper explores some of the issues in terms of memory organisation and accesses when developing systems from these high level representations. Using a template matching design study, challenges such as modelling memory reuse and minimising buffer requirements are examined, yielding results with significantly less memory requirements and costly off-chip memory accesses.

embedded systems for real-time multimedia | 2006

Muir Hardware Synthesis for Multimedia Applications

Scott Fischaber; John McAllister; Roger F. Woods; Eoin Malins

Muir hardware synthesis process used in the Abhainn design flow for optimisation and implementation of applications on FPGA-centric platforms and specifically its use for multimedia applications. Demonstrated are transformations available at the algorithmic level for an MPEG-2 encoder to reduce the memory usage and increase the throughput of the system. Also examined is the effect manipulations of the algorithm have on the final hardware architecture. Implementation of the encoder shows that results derived from a system level design approach are comparable with those derived from hand-coded implementations

international conference on neural networks and signal processing | 2008

Design methodology for a block motion estimation IP core

Richard Turner; Roger F. Woods; Scott Fischaber; John McAllister

The paper describes the design of a parameterizable core for motion estimation. Using a high level strategy targeted at memory requirements, a core has been developed for H.261, H.263 and MPEG-2 video compression standards which works efficiently across a range of search mechanisms, window sizes and error metrics. The core can perform motion estimation at up to 150 frame/s and performs well across the parameter range, demonstrating the design quality.

field-programmable technology | 2005

FPGA core network implementation and optimization: a case study

Scott Fischaber; R. Hasson; John McAllister; Roger F. Woods

It is becoming increasingly important to realize and optimize hardware functionality from high level descriptions. This paper explores the Muir approach, which integrates core based reuse and optimization principles in a heterogeneous system context. This is discussed in the context of a fixed beamformer system to highlight the rapid implementation power of such an approach

signal processing systems | 2007