SCOP - Structural Classification Of Proteins
Database Description
The fundamental unit of classification in the SCOP database is the protein domain. A domain is defined as an evolutionary unit observed in nature either in isolation or in more than one context in multidomain proteins. The protein domains are classified hierarchically into families, superfamilies, folds and classes, whose meaning has been discussed before [1]. The first official SCOP release nine years ago comprised 3179 protein domains grouped into 498 families, 366 superfamilies and 279 folds [1]. The seven main classes in the latest release (1.65) contain 40452 domains organised into 2327 families, 1294 superfamilies and 800 folds. These domains correspond to 20619 entries in the Protein Data Bank (PDB) [4] and one literature reference to a structure with unpublished coordinates. Statistics of the current and previous releases, summaries and full histories of changes and other information are available from the SCOP website (http://scop.mrc-lmb.cam.ac.uk/scop/) together with parseable files encoding all SCOP data [2]. The sequences and PDB-style structures of SCOP domains are available from the ASTRAL compendium (http://astral.stanford.edu/) [5], and hidden Markov models of SCOP domains are available from the SUPERFAMILY database (http://supfam.mrc-lmb.cam.ac.uk/SUPERFAMILY/)[6].
References
2. Lo Conte, L., Brenner, S. E., Hubbard, T. J. P., Chothia, C., and Murzin, A.G. (2002) SCOP database in 2002: refinements accommodate structural genomics. Nucl. Acids. Res. 30, 264-267.
3. Andreeva, A., Howorth, D., Brenner, S. E., Hubbard, T. J. P., Chothia, C., and Murzin, A.G. (2004) SCOP database in 2004: refinements integrate structure and sequence family data. Nucl. Acids. Res. 32, [??-??].
4. Westbrook,J., Feng,Z., Jain,S., Bhat,T.N., Thanki,N., Ravichandran,V., Gilliland,G.L., Bluhm,W., Weissig,H., Greer,D.S., Bourne,P.E. and Berman,H.M. (2002) The Protein Data Bank: unifying the archive. Nucleic Acids Res. 30, 245-248.
5. Chandonia, J.M., Hon, G., Walker, N.S., Lo Conte, L., Koehl, P., Levitt, M., and Brenner, S.E. (2004) The ASTRAL compendium in 2004. Nucleic Acids. Res. 32, [xx-xx].
6. Madera, M. Vogel, C., Kummerfeld, S.K., Chothia, C. and Gough, J. (2004) The SUPERFAMILY database in 2004: additions and improvements. Nucleic Acids. Res. 32, [xx-xx].