Parallel Methods in Computational Genomics

Mr. Ananth Kalyanaraman
Wednesday, April 19, 2006
1:30PM - CSB-232

Abstract


Analyzing molecular sequence data is a key computational step in the process of making biological discovery at a molecular level. Due to rapid advancements in sequencing technologies, sequence databases have grown phenomenally in both size and complexity. This has rendered the task of analyzing them on a serial computer ineffective due to both time and memory constraints. The focus of this talk will be on the design and development of parallel algorithms and software for analyzing very large scale sequence data. I will describe a parallel clustering algorithm that we developed for the following two applications: (i) clustering Expressed Sequence Tags (ESTs) which are a type of DNA sequence data used in gene-related studies, and (ii) clustering genomic data for assembling the maize genome. The novelty of our approach lies in its space and time efficiency and its capability to exploit the vast computing power and memory easily available through distributed memory parallel computers. This work has significantly enhanced the problem size reach while also drastically reducing the time to solution.

Short Bio


Ananth Kalyanaraman is a doctoral candidate in Computer Engineering at Iowa State University. He received his M.S. in Computer Science from Iowa State University in 2002, and B.E. in Computer Science and Engineering from Visvesvaraya National Institute of Technology, Nagpur, India in 1998. His research interests include bioinformatics and computational biology, parallel computing, and string algorithms. Ananth is a recipient of IBM and Pioneer Hi-Bred graduate research fellowships. Recently, two of his publications were recognized with best paper awards (in conferences IPDPS'06 and CSB'05). He is a member of ACM, IEEE, and SIAM. More information can be found at his homepage: http://www.public.iastate.edu/~ananthk