HomeProjectsPeoplePublicatons
Search:
   
 

 

Design and Applications of a Reconfigurable Computing System for High Performance Digital Signal Processing

Chen Chang, 2005 Ph.D. Thesis

Advisor:  Robert W. Brodersen

Abstract:

The FPGA (Field Programmable Gate Array) strikes a middle ground in cost/power/performance between ASIC (Application Specific Integrated Circuit) and DSP (Digital Signal Processor), while providing field programmability via reconfiguration of internal interconnect and logic functions. However, most current FPGA-based solutions are highly specialized hardware systems designed for narrowly focused dedicated applications. These systems are typically programmed with low-level hardware description languages (HDL) inherited from ASIC design methodologies. The lack of high-level, user-friendly programming models and modular scalable reusable hardware system architectures is the key factor impeding wider adoption of large-scale FPGA-based system.

In this thesis, we have demonstrated the practical application of a new generation of highend reconfigurable computer (HERC) systems that were built with FPGAs as the sole processing element. Through the design and construction of two generations of the Berkeley Emulation Engine (BEE) systems, a high-level, user-friendly software programming model based on synchronous data flow was developed to provide an efficient and productive design methodology for a wide range of high-performance DSP applications and emulation of wireless communications systems.

To date, multi-antenna radio astronomy signal processing has provided the largest-scale application of the BEE2 system. We have successfully demonstrated an 800MHz, billionchannel spectrometer using the BEE2 system on a single antenna, as well as a four-antenna 150MHz imaging correlator. Each BEE2 module provided sustained performance up to 1 trillion integer operations per second, and the BEE2 served as the basic building block for larger-scale systems with one to hundreds of modules.

In terms of computational throughput per chip, the FPGAs in the BEE2 system outperform a 720MHz (130nm) DSP by a factor of 10 to 34, a 1GHz (90nm) DSP by a factor of 7 to 25, and the latest Pentium-4 by a factor of 4 to 13. In terms of power efficiency, the XC2VP70 FPGA delivers 72% to 106% more throughput on 16-bit operations compared to DSPs, and more than 1100% more throughput on 4-bit operations. When compared to microprocessors, the FPGA proved over 100 times more power-efficient. Similarly, the compute throughput per unit chip cost of FPGAs is 20% to 307% more than the 1 GHz DSP, and 50% to 505% more than the3.8GHz Pentium-4 processor.

Combining DSP-like ease of programming, ASIC-like computational efficiency, and the scalability of computer clusters, the BEE2 system is becoming a preferred solution for a variety of high-performance digital signal processing applications. With continued evolution of hardware architecture and improvements in alternative programming models, future generations of HERC systems will serve as general-purpose computing platforms for many additional application domains, including scientific computing and computational biology.