PAG-XVIII  Plant & Animal Genomes XVIII Conference

January 9-13, 2010
Town & Country Convention Center
San Diego, CA



W109 : Cacao Genome Sequencing


T. cacao Transcriptome Sequencing

Keithanne Mockaitis1 , David Kuhn2 , Mohammed Mohiuddin3 , Chinnappa Kodira3 , Dorrie Main4 , Niina Haiminen5 , Greg May6 , Chris Saski7 , Brian Scheffler8 , Raymond J. Schnell2 , Tim Harkins9 , Juan Carlos Motamayor10 , Howard Shapiro10

1  Indiana University Center for Genomics and Bioinformatics, Bloomington IN 47405 USA
2  USDA-ARS Subtropical Horticulture Research Station, Miami FL 33158 USA
3  454 Life Sciences, Branford CT 06405 USA
4  Washington State University, Pullman WA 99164 USA
5  IBM Watson Research Center, Yorktown Heights NY 10598 USA
6  National Center for Genome Resources, Santa Fe NM 87505 USA
7  Clemson University Genomics Institute, Clemson SC 29634 USA
8  USDA-ARS Genomics and Bioinformatics Research Unit, Stoneville MS 38776 USA
9  Roche Applied Science, Indianapolis IN 46250 USA
10  Mars, Inc., McLean VA 22101 USA

To compliment the T. cacao genome sequencing initiative and to build a reference set of expressed genes for functional studies, a broad and state-of-the-art approach to transcriptome sequencing is underway. Using newly optimized methods, transcriptome sequencing libraries were prepared from RNA of organ-specific collections of Matina1-6, the chosen cacao genome sequence reference. De novo assemblies were generated using a newly released version of the 454 Newbler Assembler for cDNA analysis.
Analysis of 1.5 M reads from 454 GS-FLX Titanium sequencing of a Matina1-6 leaf transcriptome library produced an assembly of 22,172 isotigs, representing individual transcripts of the reference tree. Earlier 159,320 cloned EST sequences derived from a variety of genotypes and tissue collections were assembled into 45,056 putative unigenes using CAP3 (Huang and Madan, 1999). Deep surveys of RNA samples from Matina1-6 and more highly polymorphic accessions were generated from Illumina GAII sequencing reads mapped to the above assemblies. Findings from these projects, additional 454 and Illumina mRNA sequencing results, as well as protein-coding functional annotation, will be presented to overview current collaborative efforts engaged in cacao gene discovery.