[Crib-list] Computational Research in Boston Seminar -- Friday, April 1, 2005

Shirley Entzminger daisymae at math.mit.edu
Wed Mar 30 10:24:14 EST 2005


COMPUTATIONAL RESEARCH IN BOSTON SEMINAR


DATE:  		Friday, April 1, 2005
LOCATION:  	Building 32, Room 124 (Stata Center)
TIME:  		12:30 PM

TOPIC:		FFTW: The "Fastest Fourier Transform in the West"

SPEAKER:	Steven G. Johnson
 		Massachusetts Institute of Technology

ABSTRACT:

Fast Fourier transforms (FFTs) lie at the heart of many problems in 
scientific computing and engineering, from spectral methods for partial 
differential equations, to digital signal processing, to large-number 
arithmetic for cryptography.  However, despite nearly 40 years of 
scrutiny, they have proved surprisingly resistant to efficient 
implementation -- FFTs never achieve the peak CPU speed except for very 
small problems, and their historical performance has doubled every ~30 
months over 40 years, considerably slower than Moore's 18-month doubling 
time followed by peak CPU throughput.

In this talk, we outline the historical progression of the understanding 
of fast Fourier transforms and their implementation, from Gauss in 1805 to 
Danielson and Lanczos in 1942 to the digital age that dawned with Cooley 
and Tukey in 1965, to our recent developments of self-optimizing, 
auto-generated FFTs at MIT.  For much of the FFT's history, the focus was 
on minimizing the number of arithmetic operations, from Cooley and Tukey's 
erroneous assertion that radix 3 was optimal to Winograd's demonstration 
that only O(N) multiplications were required for an FFT of length N. 
Unfortunately, modern architectures have divorced performance from simple 
heuristics like operation counts, and the past strategies no longer 
succeed; the question of the fastest algorithm, therefore, has had an 
ever-changing answer that is increasingly difficult to predict.  Instead, 
we believe that a more stable solution may be possible by changing the 
question: instead of asking what is the best algorithm, one should ask 
what is the smallest collection of simple algorithmic fragments whose 
compositions span the optimal algorithm on as many computer architectures 
as possible.  We describe an implementation of this strategy---combining 
self-optimization, code generation, and recursive cache-oblivious 
algorithms---in the context of FFTW, our free and widely used fast Fourier 
transform (FFT) library (www.fftw.org).


DATE:  		Friday, April 1, 2005

LOCATION:	Building 32, Room 124
 		(Stata Center)

TIME:		12:30 PM

Pizza and beverages will be provided.



More information about the CRiB-list mailing list