Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Laura Conde-Canencia is active.

Publication


Featured researches published by Laura Conde-Canencia.


IEEE Transactions on Circuits and Systems I-regular Papers | 2013

Design of a GF(64)-LDPC Decoder Based on the EMS Algorithm

Emmanuel Boutillon; Laura Conde-Canencia; Ali Chamas Al Ghouwayel

This paper presents the architecture, performance and implementation results of a serial GF(64)-LDPC decoder based on a reduced-complexity version of the Extended Min-Sum algorithm. The main contributions of this work correspond to the variable node processing, the codeword decision and the elementary check node processing. Post-synthesis area results show that the decoder area is less than 20% of a Virtex 4 FPGA for a decoding throughput of 2.95 Mbps. The implemented decoder presents performance at less than 0.7 dB from the Belief Propagation algorithm for different code lengths and rates. Moreover, the proposed architecture can be easily adapted to decode very high Galois Field orders, such as GF(4096) or higher, by slightly modifying a marginal part of the design.


global communications conference | 2009

Conflict Resolution by Matrix Reordering for DVB-T2 LDPC Decoders

Cédric Marchand; Jean-Baptiste Dore; Laura Conde-Canencia; Emmanuel Boutillon

Layered decoding is known to provide efficient and high-throughput implementation of LDPC decoders. However, the implementation of the layered architecture is not always straightforward because of the memory access conflicts in the a-posteriori information memory. In this paper, we focus our attention on a particular type of conflict introduced by the existence of multiple diagonal matrices in the DVB-T2 parity check matrix structure. We illustrate how the reordering of the matrix reduces the number of conflicts, at the cost of limiting the level of parallelism. We then propose a parity extending process to solve the remaining conflicts. Fixed point simulation results show coherent performance without modifying the layered architecture.


international symposium on turbo codes and iterative information processing | 2010

Simplified check node processing in nonbinary LDPC decoders

Emmanuel Boutillon; Laura Conde-Canencia

This paper deals with low-complexity algorithms for the check node processing in nonbinary LDPC decoders. After a review of the state-of-the-art, we focus on an original solution to significantly reduce the order of complexity of the Extended Min-Sum decoder at the elementary check node level. The main originality of the so-called Bubble Check algorithm is the two-dimensional strategy for the check node processing, which leads to a reduction of the number of comparisons. The simulation results obtained for the Bubble Check show that this complexity reduction does not introduce any performance loss and that it is even possible to further reduce the number of comparisons. This motivated the search of a simplified architecture and led to the L-Bubble Check, which is the main contribution of the paper. The implementation of a forward/backward check node as a systolic architecture of L-Bubble elementary checks is also described. Finally, some FPGA synthesis results of a whole GF(64)-LDPC decoder implementation are presented.


signal processing systems | 2009

Conflict resolution for pipelined layered LDPC decoders

Cédric Marchand; Jean-Baptiste Dore; Laura Conde-Canencia; Emmanuel Boutillon

Many of the current LDPC implementations of DVB-S2, T2 or WiMAX standard use the so-called layered architecture combined with pipeline. However, the pipeline process may introduce memory access conflicts. The resolution of these conflicts requires careful scheduling combined with dedicated hardware and/or idle cycle insertion. In this paper, based on the DVB-T2 example, we explain explicitly how the scheduling can solve most of the pipeline conflicts. The two contributions of the paper are 1) how to split the matrix to relax the pipeline conflicts at a cost of a reduced maximum available parallelism 2) how to project the problem of the research of an efficient scheduling to the well-known “Travelling Salesman Problem” and use a genetic algorithm to solve it.


signal processing systems | 2010

Architecture and finite precision optimization for layered LDPC decoders

Cédric Marchand; Laura Conde-Canencia; Emmanuel Boutillon

Layered decoding is known to provide efficient and high-throughput implementation of LDPC decoders. However, two main issues affect performance and area of practical implementations: quantization and memory. Quantization can strongly degrade performance and memory area can constitute up to 70% of the total area of the decoder implementation. This is the case of the DVB-S2,-T2 and -C2 decoders when considering long frames. This paper is then dedicated to the optimization of these decoders. We first focus on the reduction of the number of quantization bits and propose solutions based on the efficient saturation of the channel values, the extrinsic messages and the a posteriori probabilities (APP). We reduce from 6 to 5 the number of quantization bits for the channel and the extrinsic messages and from 8 to 6 the APPs, without introducing any performance loss. We then consider the optimization of the size of the extrinsic memory considering a multiple code rates decoder. The paper finally presents an optimized fixed-point architecture of a DVB-S2 layered decoder and its implementation on an FPGA device.


IEEE Transactions on Circuits and Systems Ii-express Briefs | 2017

A Novel Architecture For Elementary Check Node Processing In Non-Binary LDPC Decoders

Oussama Abassi; Laura Conde-Canencia; Ali Chamas Al Ghouwayel; Emmanuel Boutillon

This brief presents an efficient architecture design for elementary-check-node processing in nonbinary low-density parity-check decoders based on the extended min-sum algorithm. This architecture relies on a simplified version of the bubble check algorithm and is implemented by the means of first-in–first-out. The adoption of this new design at the check node level results in a high-rate low-cost full-pipelined processor. A proof-of-concept implementation of this processor shows that the proposed architecture halves the occupied the field-programmable gate array (FPGA) surface and doubles the maximum frequency without modifying the input/output behavior of the previous one.


international symposium on circuits and systems | 2015

Algorithm and implementation of an associative memory for oriented edge detection using improved clustered neural networks

Robin Danilo; Hooman Jarollahi; Vincent Gripon; Philippe Coussy; Laura Conde-Canencia; Warren J. Gross

Associative memories are capable of retrieving previously stored patterns given parts of them. This feature makes them good candidates for pattern detection in images. Clustered Neural Networks is a recently-introduced family of associative memories that allows a fast pattern retrieval when implemented in hardware. In this paper, we propose a new pattern retrieval algorithm that results in a dramatically lower error rate compared to that of the conventional approach when used in oriented edge detection process. This function plays an important role in image processing. Furthermore, we present the corresponding hardware architecture and implementation of the new approach in comparison with a conventional architecture in literature, and show that the proposed architecture does not significantly affect hardware complexity.


ACM Journal on Emerging Technologies in Computing Systems | 2015

Fully Binary Neural Network Model and Optimized Hardware Architectures for Associative Memories

Philippe Coussy; Cyrille Chavet; Hugues Nono Wouafo; Laura Conde-Canencia

Brain processes information through a complex hierarchical associative memory organization that is distributed across a complex neural network. The GBNN associative memory model has recently been proposed as a new class of recurrent clustered neural network that presents higher efficiency than the classical models. In this article, we propose computational simplifications and architectural optimizations of the original GBNN. This work leads to significant complexity and area reduction without affecting neither memorizing nor retrieving performance. The obtained results open new perspectives in the design of neuromorphic hardware to support large-scale general-purpose neural algorithms.


signal processing systems | 2013

High-speed conflict-free layered LDPC decoder for the DVB-S2, -T2 AND -C2 standards

Cédric Marchand; Laura Conde-Canencia; Emmanuel Boutillon

Layered decoding is known to provide efficient and high-throughput implementation of LDPC decoders. However, the implementation of layered architecture is not always straightforward because of memory update conflicts in the a posteriori information memory. In this paper, we focus our attention on a particular type of conflict that is due to multiple-diagonal sub-matrices in the DVB-S2, -T2 and -C2 parity-check matrices. We propose an original solution that combines repetition of the concerned layers and the write disable of the a posteriori information memory. The implementation of this solution on an FPGA-based LDPC decoder led to an average air throughput of 200 Mbit/s with a parallelism of 45 and a clock frequency of 300 MHz. Increasing the parallelism to 120 led to an average air throughput of 720 Mbit/s with a clock frequency of 400 MHz on CMOS technology.


great lakes symposium on vlsi | 2015

Restricted Clustered Neural Network for Storing Real Data

Robin Danilo; Philippe Coussy; Laura Conde-Canencia; Vincent Gripon; Warren J. Gross

Associative memories are an alternative to classical indexed memories that are capable of retrieving a message previously stored when an incomplete version of this message is presented. Recently a new model of associative memory based on binary neurons and binary links has been proposed. This model named Clustered Neural Network (CNN) offers large storage diversity (number of messages stored) and fast message retrieval when implemented in hardware. The performance of this model drops when the stored message distribution is non-uniform. In this paper, we enhance the CNN model to support non-uniform message distribution by adding features of Restricted Boltzmann Machines. In addition, we present a fully parallel hardware design of the model. The proposed implementation multiplies the performance (diversity) of Clustered Neural Networks by a factor of 3 with an increase of complexity of 40%.

Collaboration


Dive into the Laura Conde-Canencia's collaboration.

Top Co-Authors

Avatar

Emmanuel Boutillon

Centre national de la recherche scientifique

View shared research outputs
Top Co-Authors

Avatar

Cédric Marchand

Centre national de la recherche scientifique

View shared research outputs
Top Co-Authors

Avatar

Oussama Abassi

Centre national de la recherche scientifique

View shared research outputs
Top Co-Authors

Avatar

Ali Chamas Al Ghouwayel

Lebanese International University

View shared research outputs
Top Co-Authors

Avatar

Philippe Coussy

European University of Brittany

View shared research outputs
Top Co-Authors

Avatar

Hassan Harb

Centre national de la recherche scientifique

View shared research outputs
Top Co-Authors

Avatar

Yvan Eustache

Centre national de la recherche scientifique

View shared research outputs
Top Co-Authors

Avatar

Mohammad M. Mansour

American University of Beirut

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Researchain Logo
Decentralizing Knowledge