NAR Molecular Biology Database Collection entry number 218
Letunic, Ivica; Doerks, Tobias; Bork, Peer
1EMBL, Meyerhofstrasse 1, 69012 Heidelberg, Germany

Database Description

SMART (Simple Modular Architecture Research Tool, is a web-based resource used for the identification and annotation of protein domains and the analysis of domain architectures [1]. The current release has added more than 200 original hand-curated domain models. This brings the total to more than 600 domain families represented among nuclear, signalling and extracellular proteins. Extensive annotation for each domain family is available, providing information on function, subcellular localization, phyletic distribution and tertiary structure. Annotation now includes links to OMIM in cases where a human disease is associated with one or more mutations in a particular domain. A non-redundant sequence database is searched weekly for occurrences of SMART domains and several intrinsic features (transmembrane regions, coiled coils, signal peptides and internal repeats), and results are stored in a relational database. We have included new analysis methods and updated others. Internal protein repeats and transmembrane regions are detected using Prospero [2] and TMHMM2 [3], respectively. Improvements in the web interface now allow easy searches for proteins that contain user-defined combinations of domains or intrinsic features within a specified phyletic range. New advanced queries provide direct access to the SMART database using SQL, so users are no longer restricted to using simple AND-NOT logic. Protein lists are easily displayed or retrieved in FASTA format (with optional filtering for the domain of interest). Schematic representations of proteins use dynamically generated single images, which enables easy inclusion of SMART output in users' documents. SMART now provides multiple sequence alignments coloured by consensus, thereby highlighting patterns of residue conservation. SMART is now mirrored at In conclusion, SMART provides a unique combination of powerful and accurate analytical tools with simple visualization of results.


1. Schultz, J., et al., SMART: a web-based tool for the study of genetically mobile domains. Nucleic Acids Res, 2000. 28(1): p. 231-4.
2. Mott, R., Accurate formula for P-values of gapped local sequence and profile alignments. J Mol Biol, 2000. 300(3): p. 649-59.
3. Krogh, A., et al., Predicting transmembrane protein topology with a hidden Markov model: application to complete genomes. J Mol Biol, 2001. 305(3): p. 567-80.
4. Goodstadt, L. and C.P. Ponting, CHROMA: consensus-based colouring of multiple alignments for publication. Bioinformatics, 2001, 17(9):845-846.

Go to the article in the NAR Database issue.
Oxford University Press is not responsible for the content of external internet sites