NAR Molecular Biology Database Collection entry number 775
Apweiler R.1, Bairoch A.2 and Wu C.H.3,4
1 EMBL Outstation - The European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
2Swiss Institute of Bioinformatics and Department of Structural Biology and Bioinformatics, Centre Medical Universitaire, 1 rue Michel Servet, 1211 Geneva 4, Switzerland
3Department of Biochemistry and Molecular Biology, and 4Protein Information Resource, Georgetown University Medical Center, Suite 1200, 3300 Whitehaven Street NW, Washington, DC 20007, USA

Database Description

The UniProt archive (UniParc), part of the UniProt databases, is an archival protein sequence collection from all major publicly accessible resources. New and revised protein sequences are added daily into UniParc while not deleting the previous versions. A UniParc sequence version is provided and incremented each time the underlying sequence changes, making it possible to observe the history of sequence changes in all source databases. To avoid redundancy, each unique sequence is assigned a unique identifier and is stored only once. The basic information stored with each UniParc entry is the identifier, the sequence, cyclic redundancy check number (CRC64), source database(s) with accession and version numbers, and a time stamp; all other information must be retrieved from the source databases. Each source database accession number is tagged with its status in that database, indicating if the sequence still exists or has been deleted at that source.


UniProt is mainly supported by the National Institutes of Health (NIH) grant 1 U01 HG02712-01. Minor support for the EBI's involvement in UniProt comes from the two European Union contracts BioBabel (QLRT-2000-00981) and TEMBLOR (QLRI-2001-00015) and from the NIH grant 1R01HGO2273-01. Swiss-Prot activities at the SIB are supported by the Swiss Federal Government through the Federal Office of Education and Science. PIR activities are also supported by the National Science Foundation (NSF) grants DBI-0138188 and ITR-0205470.


Wu, C., Bairoch, A., Apweiler, R., Natale, D.A., Barker, W.C., Boeckmann, B., Ferro, S., Gasteiger, E., Huang, H., Lopez, R., Magrane, M., Martin, M.J., Mazumder, R., O'Donovan, C., Redaschi, N. (2006). The Universal Protein Resource (UniProt): an expanding universe of protein information. Database issue. Nucleic Acids Res. 34: Database issue (in press).
2. Leinonen, R., Diez, F.G., Binns, D., Fleischmann, W., Lopez, R., Apweiler, R. (2004) UniProt Archive. Bioinformatics 2004 Mar 25. http://bioinformatics.oupjournals.org/cgi/content/abstract/bth191v1

Go to the abstract in the NAR 2006 Database Issue.
Oxford University Press is not responsible for the content of external internet sites