January 9-13, 2010
Town & Country Convention Center
San Diego, CA
F Alex Feltus1,2 , Barbara Blackmon1 , Stephen P Ficklin1 , Niina Haiminen3 , David N Kuhn4 , Greg D May5 , Juan-Carlos Motamayor6 , Chris A Saski1 , Ray J Schnell4 , Margaret E Staton1
The Clemson University Genomics Institute (CUGI) is performing a proof of concept pilot assembly of a subset of the minimum tiling path (MTP) derived from the cacao physical map that contains genes of interest to the cacao community. BAC clones have been selected for sequencing that cover this pilot 3 Mbp region of the MTP. MTP BAC clones are being sequenced using pooled BAC shotgun 454/Roche reads, pooled BAC shotgun Illumina/Solexa reads, and a traditional BAC-by-BAC Sanger sequenced shotgun library approach. Once sequencing is complete, assembly of the MTP will be performed on these reads with varying levels of coverage and read-type mixtures in order to determine the optimal read depth and flavors required for efficient assembly. In addition, we will use the BAC-end sequences distributed every ~20 Kbp within the 3 Mbp segment, genetic marker sequences, and RNA-seq clusters to scaffold and merge contigs. In parallel, CUGI is performing assembly simulations with in silico generated data that mimics 3 MBp of the cacao genome generated by IBM scientists to evaluate various assemblies from Sanger, 454, and Solexa data to determine the needed pool size, appropriate level of barcoding, error tolerance, and most efficient mix of read types. Information gathered in the successful assembly of the pilot region will guide the overall Theobroma cacao genome sequencing strategy.