Workshop: Databases, Gene Systematics, and Nomenclature
W22_05.html
To fully explore the potential of microarray expression data, that data must be seamlessly integrated with other relevant information. In the Arabidopsis Functional Genomics Consortium (AFGC) microarray project, our approach has been to produce cDNA microarrays using publicly available cDNA clones. For the majority of these clones,only partial EST sequence data is available. These EST sequences are therefore the primary link between the expression data and other information, such as genomic position and gene identity. I will discuss our strategy for organising the sequence data by clustering and assembly of ESTs and mRNAs and mapping of assemblies to genomic sequence and how integrated large-scale expression and sequence data can be used for discovery.