NAR Molecular Biology Database Collection entry number 1679
Lin Yang1, Tianyin Zhou1, Iris Dror1,2, Anthony Mathelier3, Wyeth W. Wasserman3, Raluca Gordân4, and Remo Rohs1
1 Molecular and Computational Biology Program, University of Southern California, Los Angeles, CA 90089, USA 2 Department of Biology, Technion – Israel Institute of Technology, Technion City, Haifa 32000, Israel 3 Centre for Molecular Medicine and Therapeutics, University of British Columbia, Vancouver, BC, Canada 4 Institute for Genome Sciences & Policy, Duke University, Durham, NC 27708, USA

Database Description

TFBSshape provides DNA shape features for transcription factor binding sites (TFBSs) that in addtion to sequence features, usually in the form of position weight matrices (PWMs), characterize DNA binding specificities of transcription factors (TFs) from 23 different species. The DNA shape features were predicted using our recently published high-throughput method (1) and visualized in the form of heat maps. Users can download these DNA shape feature data and perform further analysis, for example, to identify which DNA shape feature(s) at which position(s) in the target DNA contribute to the DNA binding specificity of a TF. Users can also compare DNA shape profiles, both qualitatively and quantitatively, between two TF datasets from the database, or between a set of uploaded sequences and a TF from the database. TFBSshape collects data from the open-access motif databases JASPAR (2) and UniPROBE (3). However, TFBSshape is fundamentally different from the two databases in that it provides DNA shape profiles for TFBSs in addition to sequence information. The JASPAR database provides TFBSs for a significant portion of its TF collection. For these TFs, TFBSshape uses our high-throuput method to predict DNA shape features for the TFBSs. On the other hand, the UniPROBE provides only protein binding microarray (PBM) probes and PWMs derived from the PBM experiments for the TFs, instead of specific TFBSs. In this case, TFBSshape first uses FIMO (4) to scan the PBM probes for TFBSs given the probe sequences and UniPROBE PWMs as input. DNA shape feature are then predicted for these TFBSs found by FIMO. Since DNA shape has been shown to play an important role in protein-DNA recognition (5), providing DNA shape feature data for TFBSs derived from motif databases will help to advance our understanding of the mechanisms underlying TF-DNA interactions.


This work was supported by the USC-Technion Visiting Fellows Program, an Alfred P. Sloan Research Fellowship [to R.R.], the National Institutes of Health (NIH) [U01GM103804 and R01HG003008; in part to R.R.], and a PhRMA Foundation Research Starter Grant [to R.G.].


1. Zhou, T., Yang, L., Lu, Y., Dror, I., Dantas Machado, A.C., Ghane, T., Di Felice, R. and Rohs, R. (2013) DNAshape: a method for the high-throughput prediction of DNA structural features on a genomic scale. Nucleic Acids Res. 41, W56-62. 2. Mathelier, A., Zhao, X., Zhang, A., Parcy, F., Worsley-Hunt, R., Arenillas, D., Buchman, S., Chen, C.-Y., Chou, A., Ienasescu, H., Lim, J., Shyr, C., Tan, G., Zhou, M., Lenhardt, B., Sandelin, A. and Wasserman, W.W. (2014) JASPAR 2014: An extensively expanded and updated open-access database of transcription factor binding profiles. Nucleic Acids Res. (Database Issue 2014). 3. Robasky, K. and Bulyk, M.L. (2011) UniPROBE, update 2011: expanded content and search tools in the online database of protein-binding microarray data on protein-DNA interactions. Nucleic Acids Res. 39, D124-128. 4. Grant, C.E., Bailey, T.L. and Noble, W.S. (2011) FIMO: scanning for occurrences of a given motif. Bioinformatics 27, 1017-1018. 5. Rohs, R., West, S.M., Sosinsky, A., Liu, P., Mann, R.S. and Honig, B. (2009) The role of DNA shape in protein-DNA recognition. Nature 461, 1248-1253.

Go to the abstract in the NAR 2014 Database Issue.
Oxford University Press is not responsible for the content of external internet sites