January 11-15, 2003
Town & Country Convention Center
San Diego, CA
Workshop: Bioinformatics - Genomic Computing Techniques and Applications
The EBI's Sequence Database Group provides several databases and tools for the analysing and handling of the enormous amount of publicly available sequence data. The EMBL Nucleotide Sequence Database, EMBL bank, constitutes the primary nucleotide sequence resource of Europe. It is produced by an international collaboration with GenBank (USA) and the DNA Database of Japan (DDBJ). The CDSs of the sequences in EMBL bank are deposited in TrEMBLnew. The data is cleaned from redundancy, annotated automatically and moved to the TrEMBL database. From here the entries go through a thorough manual scientific annotation and checking procedure before they are finally accepted into the high-quality, non-redundant protein sequence database, SWISS-PROT. InterPro is an integrated documentation resource for protein families, domains and functional sites. It is formed by the collaboration between SWISS-PROT and TrEMBL, and pattern databases Pfam, PRINTS, PROSITE, ProDom, SMART and TIGRFAMs. It is an excellent tool for predicting the structure and function of novel protein sequences. CluSTr is a database that offers automatic classification of SWISS-PROT and TrEMBL proteins into groups of related proteins. It is a useful resource for proteome and whole genome analysis. The Proteome Analysis database, compiled using InterPro, CluSTr and GO Slim, provides statistics and analyses of the predicted proteomes of fully sequenced organisms. IntEnz is the Integrated relational enzyme database supported by the Nomenclature Committee of the International Union of Biochemistry and Molecular Biology (NC-IUBMB). It will serve as the master copy of the Enzyme Nomenclature. All of these resources can be accessed at http://www.ebi.ac.uk