Skip Navigation

ModBase


NAR Molecular Biology Database Collection entry number 331
Pieper, U., Eswar, N., Stuart, A.C., Braberg, H., Madhusudhan, M.S., Davis, F., Rossi, A., Marti-Renom, M.A., Sali, A.
Departments of Biopharmaceutical Sciences and Pharmaceutical Chemistry, and California Institute for Quantitative Biomedical Research, University of California, San Francisco, 600 16th St, San Francisco, CA 94143-2240

Database Description

MODBASE (http://salilab.org/modbase) is a relational database of annotated comparative protein structure models for all available protein sequences matched to at least one known protein structure. The models are calculated by MODPIPE, an automated modeling pipeline that relies on PSI-BLAST, IMPALA and MODELLER. MODBASE uses the MySQL relational database management system for flexible and efficient querying. It is updated regularly to reflect the growth of the protein sequence and structure databases, as well as improvements in the software for calculating the models. For ease of access, MODBASE is organized into different datasets. The largest dataset contains models for domains in 415,937 out of 733,239 unique protein sequences in the complete TrEMBL database (March 2002); only models based on significant alignments (PSI-BLAST E-value < 10-4) and models assessed to have the correct fold are included. Other datasets include models for target selection and structure-based annotation by the New York Structural Genomics Research Consortium, models for prediction of genes in the Drosophila melanogaster genome, models for structure determination of several ribosomal particles and models calculated by the MODWEB comparative modeling Web server.

Subcategory: Protein structure

Go to the abstract in the NAR 2014 Database Issue.
Oxford University Press is not responsible for the content of external internet sites