ChEBI - Chemical Entities of Biological Interest
2Laboratoire Statistique et Génome, Université d'Evry Val d'Essonne, France
3Department of Genetics, University of Cambridge, Cambridge, UK
Database Description
- Nomenclature Committee of the International Union of Biochemistry and Molecular Biology (http://www.iubmb.org/) (NC-IUBMB)
All data in the database is non-proprietary or is derived from a non-proprietary source. Each data item is fully traceable and explicitly referenced to the original source.
ChEBI shows the following data fields:- ChEBI identifer - the unique identifer
- ChEBI name - the name recommended for use in biological databases
- ChEBI ASCII name - the ChEBI name with any special characters rendered in ASCII format
- Definition - a short verbal definition is included in some entries, especially relevant to classes
- Structure - graphical representation(s) of a molecular structure and associated molfile(s), IUPAC International Chemical Identifier (InChI) and SMILES strings
- Formula - Molecular formula
- Parents and children, according to ChEBI Ontology
- IUPAC name - a name generated according to the recommendations of IUPAC
- Synonyms - other names together with an indication of their source
- Database links - manually curated cross-references to other non-proprietary databases
- Registry numbers - CAS Registry Number, Beilstein Registry Number, Gmelin Registry Number (if available)
- Comment(s) - A free-text comment may be added to some terms, especially in cases where confusing terminology has been historically used. A comment may relate to a single term or to the entry as a whole.
The ChEBI terms have been automatically mapped to the UniProt (http://www.ebi.uniprot.org/) database, and for each ChEBI entity the list of UniProt entries containing the relevant terms are available. ChEBI can be accessed through the WWW (http://www.ebi.ac.uk/chebi/) or may be downloaded by anonymous FTP (ftp://ftp.ebi.ac.uk/pub/databases/chebi/).
Recent Developments
ChEBI Ontology consists of four sub-ontologies:
- Molecular Structure, in which molecular entities or parts thereof are classified according to structure;
- Biological Role, which classifies entities on the basis of their role within a biological context (e.g. antibiotic, coenzyme, hormone);
- Application, which classifies entities, where appropriate, on the basis of their intended use by humans (e.g. pesticide, drug, fuel);
- Subatomic Particle, which classifies particles smaller than atoms.
The ontology uses the following relationship types:
is a
is part of
is conjugate acid of and is conjugate base of
is tautomer of
is enantiomer of
has functional parent
has parent hydride
is substituent group from
Web Services: the main aim of ChEBI Web Services (http://www.ebi.ac.uk/chebi/webServices.do) is to provide programmatic access to the ChEBI database in order to aid our users in integrating ChEBI into their applications. ChEBI provides Simple Object Access Protocol (SOAP) access to its database. Both Java and perl clients are available through which you can access the data.To obtain ‘lightweight’ ontology objects, the Ontology Lookup Service (http://www.ebi.ac.uk/ontology-lookup/) [2] can be used as alternative Web Services.
SourceForge forum (http://sourceforge.net/projects/chebi/) has been established and is used to report bugs, discuss annotation issues and to request the new ChEBI terms or entries.
PubChem deposition: the structures of molecular entities from ChEBI are deposited to PubChem (http://pubchem.ncbi.nlm.nih.gov/)