PAG-IX: InterPro, CluSTr and Proteomes - The Integrated Resource of Protein Families, Domains and Sites

PAG-IX   Plant & Animal Genome IX Conference

Town & Country Hotel, San Diego, CA, January 13-17, 2001.


Computer: Demonstrations
C01_04.html

INTERPRO, CLUSTR AND PROTEOMES - THE INTEGRATED RESOURCE OF PROTEIN FAMILIES, DOMAINS AND SITES

WOLFGANG FLEISCHMANN, Rolf Apweiler

The European Bioinformatics Institute

InterPro is a collaboration of the databases SWISS-PROT + TrEMBL, PROSITE, PRINTS, Pfam and ProDom. As of October 2000, InterPro contains 6804 diagnostic protein signatures linked to 3205 manually annotated entries. With this amout of patterns, we gain consistently 42 ˜ 13 percent coverage for the completely sequence proteomes. This enabled us to provide a series of analysis pages for all the completely sequenced organisms at www.ebi.ac.uk/proteome/. Tightly coupled to InterPro is the new database Clusters of SWISS-PROT + TrEMBL proteins CluSTr. It uses the only algorithm that guarantees to find the most closely related protein, which is of course slower than the more commonly used BLAST and FastA tools. Nevertheless, we finished clustering all plant proteins and the completed eukaryotic proteomes. InterPro, CluSTr and the proteome analysis pages are freely available at the EBI.


Return to Previous Page or Intl-PAG Homepage