Godwin Enemali
University of Edinburgh
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Godwin Enemali.
international parallel and distributed processing symposium | 2017
Adewale Adetomi; Godwin Enemali; Tughrul Arslan
The number of IPs running concurrently on an FPGA has increased in recent years. Communication among these IPs has necessitated the introduction of the network on chip (NoC) for low-power, high-performance, and scalable on-chip networking. While NoCs are superior to traditional shared buses, there is an attendant resource overhead incurred by the NoC links, routers and network adapters. We present CELOC, a Clock-Enabled Low-Overhead Communication technique. It is a network access technique that uses the clock buffers of an FPGA as serial communication links in order to reduce the overhead contributed by the NoC links. This technique involves toggling the clock enables of clock buffers to transmit communication signals from one circuit to another. A demonstrator based on a Xilinx 7 series FPGA showed that a single link can achieve a bandwidth of 6.5 Gbps at 100 MHz.
international symposium on neural networks | 2015
Shiwei Wang; Thomas Jacob Koickal; Godwin Enemali; Luiz Carlos Gouveia; Lei Wang; Alister Hamilton
This paper presents the design and simulation results of a silicon cochlea system that has closely similar behavior as the real cochlea. A cochlea filter-bank based on the improved three-stage filter cascade structure is used to model the frequency decomposition function of the basilar membrane; a filter tuning block is designed to model the adaptive response of the cochlea; besides, an asynchronous event-triggered spike codec is employed as the system interface with bank-end spiking neural networks. As shown in the simulation results, the system has biologically faithful frequency response, impulse response, and active adaptation behavior; also the system outputs multiple band-pass channels of spikes from which the original sound input can be recovered. The proposed silicon cochlea is feasible for analog VLSI implementation so that it not only emulates the way that sounds are preprocessed in human ears but also is able match the compact physical size of a real cochlea.
international parallel and distributed processing symposium | 2017
Godwin Enemali; Adewale Adetomi; Tughrul Arslan
The use of reconfigurable chips such as FPGAs in embedded systems for many runtime applications is limited by large reconfiguration time. Techniques to circumvent this limitation relies on hardware task reuse which preserve certain circuits on the chip. However, the frequent addition and removal of circuits while preserving others on the chip will inevitably lead to fragmentation of its area, in an ongoing manner. In this paper, we present a fragmentation-aware replacement policy (FAReP) for reusing tasks on reconfigurable chips. FAReP aims not only at circumventing reconfiguration time, but also offering some defragmentation of the chip area at no extra reconfiguration cost. Our results show that FAReP leads to a reduced task rejection ratio, at least a 13% reduction in average unused chip area and up to 29% of reconfiguration time could be avoided compared to state of the art techniques based on task reuse.
field programmable custom computing machines | 2017
Adewale Adetomi; Godwin Enemali; Tughrul Arslan
The ability to relocate hardware tasks in FPGAs is an attractive task management technique, especially in reconfigurable operating systems. A method of relocation involves the modification of the location address of the task while it is being configured. However, the use of encryption to protect bitstreams requires that decryption is done on-chip before relocation. This usually results in a significant resource overhead, arising from the introduced decryption circuit. This paper presents Advance Task Address Loading (ATAL), a unique solution that involves loading the unencrypted task address ahead of the encrypted tasks configuration frame data. We have developed a software named Splixbit, which processes the bitstream offline, and a corresponding hardware configuration controller that configures the bitstream on the FPGA. Our results confirmed the possibility of avoiding on-chip dedicated decryption circuit in relocating encrypted partial bitstreams.
IEEE Transactions on Circuits and Systems Ii-express Briefs | 2018
Godwin Enemali; Adewale Adetomi; Gopalakrishnan Seetharaman; Tughrul Arslan
Runtime relocation of circuits on field-programmable gate arrays (FPGAs) has been proposed for achieving many desirable features including fault tolerance, defragmentation, and system load balancing. However, the changes in the architectural composition of FPGAs have made relocation more challenging mainly because FPGAs have become more heterogeneous. Previous and state-of-the-art circuit relocation systems on FPGAs have relied only on direct bitstream relocation which requires the source and destination resource layouts to be the same, as well as access to the design bitstream for manipulation. Hence, their efficiency on modern heterogeneous chips greatly reduces, and mostly cannot be applied to encrypted bitstreams of intellectual property blocks. In this brief, we present a circuit relocator which augments direct bitstream relocation with a functionality-based relocation scheme. We demonstrate the feasibility of the proposed technique using a CORDIC application and show that an average of over 2.6-fold increase in the number of relocations can be obtained compared to only direct bitstream relocation at the expense of a small memory overhead and manageable relocation time for this case study.
international symposium on circuits and systems | 2017
Godwin Enemali; Adewale Adetomi; Tughrul Arslan
Reconfigurable hardware such as FPGAs offer promising platform for the development of embedded autonomous systems. This is due to their unique combination of high performance and flexibility. However, state-of-the-art FPGAs have large reconfiguration time, which often leads to missed deadlines in real-time systems. They also suffer from considerable fragmentation during runtime placement, leading to poor chip area utilization. In this paper, we present a novel hardware-placement management circuit to address these limitations by offering circuit reuse and a low-cost defragmentation. Its implementation occupies only 1852 FPGA slices. Our results showed that over 70% of configuration time was circumvented compared to state-of-the-art techniques. In addition, up to 56% improvement in reuse efficiency was observed.
international new circuits and systems conference | 2017
Godwin Enemali; Shiwei Wang; Alister Hamilton
This paper presents the design and experimental results of an FPGA-based cochlea filter tuning control block that emulates the active nonlinear behaviour of outer hair cells (OHC) in the cochlea. The tuning block uses data measured from physiological experiments which are mapped into adaptive voltage bias of a CMOS cochlea-mimicking filter channel to tune it. It extracts signal level from inputs to the cochlea channel and feedforwards the corresponding voltage bias to the tuning stage of the filter channel. A level-dependent control of the filter gain and quality factor was observed in the measurement results. The tuning control block nonlinearly compressed wide range of audio input distinctively into about 20 dB, which is the active gain inherent in the sample cochlea channel used. Based on this work, we will be able to build an intelligent audio front-end for future machine hearing systems that is able to adaptively process sound and cope with wide dynamic sound input like the cochlea does.
international conference on emerging security technologies | 2017
Adewale Adetomi; Godwin Enemali; Tughrul Arslan
The trend in computing is towards the use of FPGAs to improve performance at reduced costs. An indication of this is the adoption of FPGAs for data centre and server application acceleration by notable technological giants like Microsoft, Amazon, and Baidu. The continued protection of Intellectual Properties (IPs) on the FPGA has thus become both more important and challenging. To facilitate IP security, FPGA vendors have provided bitstream authentication and encryption. However, advancements in FPGA programming technology have engendered a bitstream manipulation technique like partial bitstream relocation (PBR), which is promising in terms of reducing bitstream storage cost and facilitating adaptability. Meanwhile, encrypted bitstreams are not amenable to PBR. In this paper, we present three methods for performing encrypted PBR with varying overheads of resources and time. These methods ensure that PBR can be applied to bitstreams without losing the protection of IPs.
field programmable logic and applications | 2017
Adewale Adetomi; Godwin Enemali; Tughrul Arslan
The parallelism of hardware and the dynamic reconfigurability of FPGAs enable multiple hardware tasks to run concurrently, and also time-share resources by being swapped in and out of the device during runtime. More than ever before, these capabilities are being employed in systems with high-reliability requirements. To improve reliability, a method often used is circuit relocation. However, the static nature of conventional FPGA communication interconnects is a bane to flexible runtime relocation. This paper employs a novel network architecture to enable dynamic communication and thus improve the flexibility of circuit relocation. By using the clock infrastructure of the FPGA as the physical network links for tasks in a 4-node star network, we have shown that dynamic communication between relocatable circuits can be achieved without incurring any overheads of time and resources, save for only 32 slices used for the Network Interface.
adaptive hardware and systems | 2017
Godwin Enemali; Adewale Adetomi; Tughrul Arslan
The addition of hard blocks such as Block RAMs and Digital Signal Processors, have proven to be good means of improving various performance metrics in FPGAs. This however places stricter constraints on runtime relocation of hardware tasks and hence reduces their application in dealing with permanent faults. In this paper, we present a strategy that enhances the utilization of heterogeneous reconfigurable FPGAs by minimizing resources which are tied down as unusable areas. Our results show that the strategy leads to a 9.4% reduction in task rejections and improved placement quality compared to state of the art techniques. The complete implementation occupies only 1712 LUTs and 1645 Flip Flops on the Xilinxs xc7z100ffg900-2. Based on this strategy, more task relocations can be obtained which enhances the capacity of FPGAs to deal with permanent faults.