ABOUT ME

I am a Ph.D student studying at School of Electircal Engineering and Computer Science, Washington State Unversity in Pullman, and I received my B.S and M.S degree on computer science from Hohai University (2004) and Tongji University (2007), respectively. Before coming to Washington State University, I worked as a research assistant in National Engineering and Technology HPC center, Tongji Branch, Shanghai. Currently, I am working with Dr. Ananth on Computational Biology and High Perfomance Computing.

- Curriculum Vitae:

Research

- Protein families identification in large scale metagenomic data

Metagenomics is the study of environmental microbial communities using state-of-the-art genomic tools. Recent advancements in high-throughput technologies have enabled the accumulation of large volumes of metagenomic data that was until a couple of years back was deemed impractical for generation. A primary bottleneck, however, is in the lack of scalable algorithms and open source software for large-scale data processing. We design and implement a novel parallel approach to identify protein families from large-scale metagenomic data. More...

- Organellar genome sequencing and targeted resequencing

Traditional application of the Polymerase Chain Reaction (PCR) technology has been limited to amplifying DNA targets of small sizes ~1Kbp). A recently developed approach called ASAP proposes the use of PCR to amplify arbitrarily long DNA targets by exploiting the sequence-level conservation expected from their evolutionarily related counterparts. This approach is useful for sequencing organellar genomes and in targeted resequencing of arbitrary genomic loci. More...

Publications

[2008] C. Wu, A. Kalyanaraman. An efficient parallel approach for identifying protein families in large-scale metagenomic data sets. Proc. ACM/IEEE Supercomputing Conference (SC'08), Austin, TX, November 15-21. (Acceptance Ratio: 21.2% = 59/277). paper: ; presentation:

[2007] C. Wu, A. Kalyanaraman, A. Dhingra. An efficient computational framework for amplifying arbitrarily long conserved DNA sequences. Proc. LSS Computational Systems Bioinformatics (CSB'07), San Diego, CA, August 13-17. (extended abstract). paper:

 More...

Recent Updates