{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,3]],"date-time":"2026-03-03T16:26:05Z","timestamp":1772555165681,"version":"3.50.1"},"reference-count":46,"publisher":"Oxford University Press (OUP)","issue":"2","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2015,1,15]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Motivation: Intrinsically disordered regions are key for the function of numerous proteins. Due to the difficulties in experimental disorder characterization, many computational predictors have been developed with various disorder flavors. Their performance is generally measured on small sets mainly from experimentally solved structures, e.g. Protein Data Bank (PDB) chains. MobiDB has only recently started to collect disorder annotations from multiple experimental structures.<\/jats:p>\n               <jats:p>Results: MobiDB annotates disorder for UniProt sequences, allowing us to conduct the first large-scale assessment of fast disorder predictors on 25 833 different sequences with X-ray crystallographic structures. In addition to a comprehensive ranking of predictors, this analysis produced the following interesting observations. (i) The predictors cluster according to their disorder definition, with a consensus giving more confidence. (ii) Previous assessments appear over-reliant on data annotated at the PDB chain level and performance is lower on entire UniProt sequences. (iii) Long disordered regions are harder to predict. (iv) Depending on the structural and functional types of the proteins, differences in prediction performance of up to 10% are observed.<\/jats:p>\n               <jats:p>Availability: The datasets are available from Web site at URL: https:\/\/summer-heart-0930.chufeiyun1688.workers.dev:443\/http\/mobidb.bio.unipd.it\/lsd .<\/jats:p>\n               <jats:p>Contact: \u00a0silvio.tosatto@unipd.it<\/jats:p>\n               <jats:p>Supplementary information: \u00a0Supplementary data are available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/btu625","type":"journal-article","created":{"date-parts":[[2014,9,23]],"date-time":"2014-09-23T02:41:35Z","timestamp":1411440095000},"page":"201-208","source":"Crossref","is-referenced-by-count":143,"title":["Comprehensive large-scale assessment of intrinsic protein disorder"],"prefix":"10.1093","volume":"31","author":[{"given":"Ian","family":"Walsh","sequence":"first","affiliation":[{"name":"1 Department of Biomedical Sciences, 2 Department of Information Engineering, University of Padua, Via Gradenigo 6, 35121 Padova, Italy and 3 Institute for Advanced Simulation, Forschungszentrum Juelich, Wilhelm-Johnen-Str., 52425 Juelich, Germany"}]},{"given":"Manuel","family":"Giollo","sequence":"additional","affiliation":[{"name":"1 Department of Biomedical Sciences, 2 Department of Information Engineering, University of Padua, Via Gradenigo 6, 35121 Padova, Italy and 3 Institute for Advanced Simulation, Forschungszentrum Juelich, Wilhelm-Johnen-Str., 52425 Juelich, Germany"},{"name":"1 Department of Biomedical Sciences, 2 Department of Information Engineering, University of Padua, Via Gradenigo 6, 35121 Padova, Italy and 3 Institute for Advanced Simulation, Forschungszentrum Juelich, Wilhelm-Johnen-Str., 52425 Juelich, Germany"}]},{"given":"Tom\u00e1s","family":"Di Domenico","sequence":"additional","affiliation":[{"name":"1 Department of Biomedical Sciences, 2 Department of Information Engineering, University of Padua, Via Gradenigo 6, 35121 Padova, Italy and 3 Institute for Advanced Simulation, Forschungszentrum Juelich, Wilhelm-Johnen-Str., 52425 Juelich, Germany"}]},{"given":"Carlo","family":"Ferrari","sequence":"additional","affiliation":[{"name":"1 Department of Biomedical Sciences, 2 Department of Information Engineering, University of Padua, Via Gradenigo 6, 35121 Padova, Italy and 3 Institute for Advanced Simulation, Forschungszentrum Juelich, Wilhelm-Johnen-Str., 52425 Juelich, Germany"}]},{"given":"Olav","family":"Zimmermann","sequence":"additional","affiliation":[{"name":"1 Department of Biomedical Sciences, 2 Department of Information Engineering, University of Padua, Via Gradenigo 6, 35121 Padova, Italy and 3 Institute for Advanced Simulation, Forschungszentrum Juelich, Wilhelm-Johnen-Str., 52425 Juelich, Germany"}]},{"given":"Silvio C. E.","family":"Tosatto","sequence":"additional","affiliation":[{"name":"1 Department of Biomedical Sciences, 2 Department of Information Engineering, University of Padua, Via Gradenigo 6, 35121 Padova, Italy and 3 Institute for Advanced Simulation, Forschungszentrum Juelich, Wilhelm-Johnen-Str., 52425 Juelich, Germany"}]}],"member":"286","published-online":{"date-parts":[[2014,9,21]]},"reference":[{"key":"2023020116141780000_btu625-B1","doi-asserted-by":"crossref","first-page":"459","DOI":"10.1093\/protein\/gzg063","article-title":"Simple consensus procedures are effective and sufficient in secondary structure prediction","volume":"16","author":"Albrecht","year":"2003","journal-title":"Protein Eng."},{"key":"2023020116141780000_btu625-B2","doi-asserted-by":"crossref","first-page":"25","DOI":"10.1038\/75556","article-title":"Gene Ontology: tool for the unification of biology. The Gene Ontology consortium","volume":"25","author":"Ashburner","year":"2000","journal-title":"Nat. Genet."},{"key":"2023020116141780000_btu625-B3","doi-asserted-by":"crossref","first-page":"R14","DOI":"10.1186\/gb-2011-12-2-r14","article-title":"Bringing order to protein disorder through comparative genomics and genetic interactions","volume":"12","author":"Bellay","year":"2011","journal-title":"Genome Biol."},{"key":"2023020116141780000_btu625-B4","doi-asserted-by":"crossref","first-page":"2080","DOI":"10.1093\/bioinformatics\/bts327","article-title":"MobiDB: a comprehensive database of intrinsic protein disorder annotations","volume":"28","author":"Di Domenico","year":"2012","journal-title":"Bioinformatics"},{"key":"2023020116141780000_btu625-B5","doi-asserted-by":"crossref","first-page":"2985","DOI":"10.1021\/pr060171o","article-title":"Disorder and sequence repeats in hub proteins and their implications for network evolution","volume":"5","author":"Doszt\u00e1nyi","year":"2006","journal-title":"J. Proteome Res."},{"key":"2023020116141780000_btu625-B6","doi-asserted-by":"crossref","first-page":"827","DOI":"10.1016\/j.jmb.2005.01.071","article-title":"The pairwise energy content estimated from amino acid composition discriminates between folded and intrinsically unstructured proteins","volume":"347","author":"Doszt\u00e1nyi","year":"2005","journal-title":"J. Mol. Biol."},{"key":"2023020116141780000_btu625-B7","doi-asserted-by":"crossref","first-page":"6573","DOI":"10.1021\/bi012159+","article-title":"Intrinsic disorder and protein function","volume":"41","author":"Dunker","year":"2002","journal-title":"Biochemistry"},{"key":"2023020116141780000_btu625-B8","first-page":"161","article-title":"Intrinsic protein disorder in complete genomes","volume":"11","author":"Dunker","year":"2000","journal-title":"Genome Inform. Workshop Genome Inform."},{"key":"2023020116141780000_btu625-B9","doi-asserted-by":"crossref","first-page":"805","DOI":"10.1038\/nbt0901-805","article-title":"The protein trinity\u2014linking function and disorder","volume":"19","author":"Dunker","year":"2001","journal-title":"Nat. Biotechnol."},{"key":"2023020116141780000_btu625-B10","doi-asserted-by":"crossref","first-page":"88","DOI":"10.1186\/1471-2105-14-88","article-title":"DNdisorder: predicting protein disorder using boosting and deep networks","volume":"14","author":"Eickholt","year":"2013","journal-title":"BMC Bioinformatics"},{"key":"2023020116141780000_btu625-B11","doi-asserted-by":"crossref","first-page":"e177","DOI":"10.1371\/journal.pcbi.0020177","article-title":"Prediction of amyloidogenic and disordered regions in protein chains","volume":"2","author":"Galzitskaya","year":"2006","journal-title":"PLoS Comput. Biol."},{"key":"2023020116141780000_btu625-B12","doi-asserted-by":"crossref","first-page":"2046","DOI":"10.1093\/bioinformatics\/btm302","article-title":"POODLE-L: a two-level SVM prediction system for reliably predicting long disordered regions","volume":"23","author":"Hirose","year":"2007","journal-title":"Bioinformatics"},{"key":"2023020116141780000_btu625-B13","doi-asserted-by":"crossref","first-page":"W460","DOI":"10.1093\/nar\/gkm363","article-title":"PrDOS: prediction of disordered protein regions from amino acid sequence","volume":"35","author":"Ishida","year":"2007","journal-title":"Nucleic Acids Res."},{"key":"2023020116141780000_btu625-B14","doi-asserted-by":"crossref","first-page":"1344","DOI":"10.1093\/bioinformatics\/btn195","article-title":"Prediction of disordered regions in proteins based on the meta approach","volume":"24","author":"Ishida","year":"2008","journal-title":"Bioinformatics"},{"key":"2023020116141780000_btu625-B15","doi-asserted-by":"crossref","first-page":"161","DOI":"10.1016\/S0968-0004(01)02039-4","article-title":"Getting the most from PSI-BLAST","volume":"27","author":"Jones","year":"2002","journal-title":"Trends Biochem. Sci."},{"key":"2023020116141780000_btu625-B16","doi-asserted-by":"crossref","first-page":"3701","DOI":"10.1093\/nar\/gkg519","article-title":"GlobPlot: exploring protein sequences for globularity and disorder","volume":"31","author":"Linding","year":"2003","journal-title":"Nucleic Acids Res."},{"key":"2023020116141780000_btu625-B17","doi-asserted-by":"crossref","first-page":"1453","DOI":"10.1016\/j.str.2003.10.002","article-title":"Protein disorder prediction: implications for structural proteomics","volume":"11","author":"Linding","year":"2003","journal-title":"Structure"},{"key":"2023020116141780000_btu625-B18","doi-asserted-by":"crossref","first-page":"2916","DOI":"10.1093\/bioinformatics\/btq537","article-title":"MOBI: a web server to define and visualize structural mobility in NMR protein ensembles","volume":"26","author":"Martin","year":"2010","journal-title":"Bioinformatics"},{"key":"2023020116141780000_btu625-B19","doi-asserted-by":"crossref","first-page":"i489","DOI":"10.1093\/bioinformatics\/btq373","article-title":"Improved sequence-based prediction of disordered regions with multilayer fusion of multiple information sources","volume":"26","author":"Mizianty","year":"2010","journal-title":"Bioinformatics"},{"key":"2023020116141780000_btu625-B20","doi-asserted-by":"crossref","first-page":"245","DOI":"10.1186\/1471-2105-12-245","article-title":"In-silico prediction of disorder content using hybrid sequence representation","volume":"12","author":"Mizianty","year":"2011","journal-title":"BMC Bioinformatics"},{"key":"2023020116141780000_btu625-B21","doi-asserted-by":"crossref","first-page":"e1000497","DOI":"10.1371\/journal.pcbi.1000497","article-title":"Influence of sequence changes and environment on intrinsically disordered proteins","volume":"5","author":"Mohan","year":"2009","journal-title":"PLoS Comput. Biol."},{"key":"2023020116141780000_btu625-B22","doi-asserted-by":"crossref","first-page":"127","DOI":"10.1002\/prot.24391","article-title":"Assessment of protein disorder region predictions in CASP10","volume":"82","author":"Monastyrskyy","year":"2014","journal-title":"Proteins"},{"key":"2023020116141780000_btu625-B23","doi-asserted-by":"crossref","first-page":"208","DOI":"10.1186\/1471-2105-7-208","article-title":"Length-dependent prediction of protein intrinsic disorder","volume":"7","author":"Peng","year":"2006","journal-title":"BMC Bioinformatics"},{"key":"2023020116141780000_btu625-B24","doi-asserted-by":"crossref","first-page":"3435","DOI":"10.1093\/bioinformatics\/bti537","article-title":"FoldIndex: a simple tool to predict whether a given protein sequence is intrinsically unfolded","volume":"21","author":"Prilusky","year":"2005","journal-title":"Bioinformatics"},{"key":"2023020116141780000_btu625-B25","doi-asserted-by":"crossref","first-page":"38","DOI":"10.1002\/1097-0134(20010101)42:1<38::AID-PROT50>3.0.CO;2-3","article-title":"Sequence complexity of disordered protein","volume":"42","author":"Romero","year":"2001","journal-title":"Proteins"},{"key":"2023020116141780000_btu625-B26","doi-asserted-by":"crossref","first-page":"D475","DOI":"10.1093\/nar\/gks1200","article-title":"The RCSB Protein Data Bank: new resources for research and education","volume":"41","author":"Rose","year":"2013","journal-title":"Nucleic Acids Res."},{"key":"2023020116141780000_btu625-B27","doi-asserted-by":"crossref","first-page":"204","DOI":"10.1006\/jsbi.2001.4336","article-title":"Review: protein secondary structure prediction continues to rise","volume":"134","author":"Rost","year":"2001","journal-title":"J. Struct. Biol."},{"key":"2023020116141780000_btu625-B28","doi-asserted-by":"crossref","first-page":"e4433","DOI":"10.1371\/journal.pone.0004433","article-title":"Improved disorder prediction by combination of orthogonal approaches","volume":"4","author":"Schlessinger","year":"2009","journal-title":"PloS One"},{"key":"2023020116141780000_btu625-B29","doi-asserted-by":"crossref","first-page":"412","DOI":"10.1016\/j.sbi.2011.03.014","article-title":"Protein disorder\u2014a breakthrough invention of evolution?","volume":"21","author":"Schlessinger","year":"2011","journal-title":"Curr. Opin. Struct. Biol."},{"key":"2023020116141780000_btu625-B30","doi-asserted-by":"crossref","first-page":"D786","DOI":"10.1093\/nar\/gkl893","article-title":"DisProt: the database of disordered proteins","volume":"35","author":"Sickmeier","year":"2007","journal-title":"Nucleic Acids Res."},{"key":"2023020116141780000_btu625-B31","doi-asserted-by":"crossref","first-page":"D71","DOI":"10.1093\/nar\/gkr981","article-title":"Reorganizing the protein space at the Universal Protein Resource (UniProt)","volume":"40","author":"The UniProt Consortium","year":"2012","journal-title":"Nucleic Acids Res."},{"key":"2023020116141780000_btu625-B32","doi-asserted-by":"crossref","first-page":"328","DOI":"10.1002\/bies.200800151","article-title":"Close encounters of the third kind: disordered domains and the interactions of proteins","volume":"31","author":"Tompa","year":"2009","journal-title":"BioEssays"},{"key":"2023020116141780000_btu625-B33","doi-asserted-by":"crossref","first-page":"527","DOI":"10.1016\/S0968-0004(02)02169-2","article-title":"Intrinsically unstructured proteins","volume":"27","author":"Tompa","year":"2002","journal-title":"Trends Biochem. Sci."},{"key":"2023020116141780000_btu625-B34","doi-asserted-by":"crossref","first-page":"D483","DOI":"10.1093\/nar\/gks1258","article-title":"SIFTS: structure integration with function, taxonomy and sequences resource","volume":"41","author":"Velankar","year":"2013","journal-title":"Nucleic Acids Res."},{"key":"2023020116141780000_btu625-B35","doi-asserted-by":"crossref","first-page":"573","DOI":"10.1002\/prot.10437","article-title":"Flavors of protein disorder","volume":"52","author":"Vucetic","year":"2003","journal-title":"Proteins"},{"key":"2023020116141780000_btu625-B36","doi-asserted-by":"crossref","first-page":"W190","DOI":"10.1093\/nar\/gkr411","article-title":"CSpritz: accurate prediction of protein disorder segments with annotation for homology, secondary structure and linear motifs","volume":"39","author":"Walsh","year":"2011","journal-title":"Nucleic Acids Res."},{"key":"2023020116141780000_btu625-B37","doi-asserted-by":"crossref","first-page":"503","DOI":"10.1093\/bioinformatics\/btr682","article-title":"ESpritz: accurate and fast prediction of protein disorder","volume":"28","author":"Walsh","year":"2012","journal-title":"Bioinformatics"},{"key":"2023020116141780000_btu625-B38","doi-asserted-by":"crossref","first-page":"635","DOI":"10.1016\/j.jmb.2004.02.002","article-title":"Prediction and functional analysis of native disorder in proteins from the three kingdoms of life","volume":"337","author":"Ward","year":"2004","journal-title":"J. Mol. Biol."},{"key":"2023020116141780000_btu625-B39","doi-asserted-by":"crossref","first-page":"269","DOI":"10.1016\/0097-8485(94)85023-2","article-title":"Non-globular domains in protein sequences: automated segmentation using complexity measures","volume":"18","author":"Wootton","year":"1994","journal-title":"Comput. Chem."},{"key":"2023020116141780000_btu625-B40","doi-asserted-by":"crossref","first-page":"321","DOI":"10.1006\/jmbi.1999.3110","article-title":"Intrinsically unstructured proteins: re-assessing the protein structure-function paradigm","volume":"293","author":"Wright","year":"1999","journal-title":"J. Mol. Biol."},{"key":"2023020116141780000_btu625-B41","doi-asserted-by":"crossref","first-page":"31","DOI":"10.1016\/j.sbi.2008.12.003","article-title":"Linking folding and binding","volume":"19","author":"Wright","year":"2009","journal-title":"Curr. Opin. Struct. Biol."},{"key":"2023020116141780000_btu625-B42","doi-asserted-by":"crossref","first-page":"1688","DOI":"10.1039\/b905913j","article-title":"Analysis of structured and intrinsically disordered regions of transmembrane proteins","volume":"5","author":"Xue","year":"2009","journal-title":"Mol. Biosyst."},{"key":"2023020116141780000_btu625-B43","doi-asserted-by":"crossref","first-page":"996","DOI":"10.1016\/j.bbapap.2010.01.011","article-title":"PONDR-FIT: a meta-predictor of intrinsically disordered amino acids","volume":"1804","author":"Xue","year":"2010","journal-title":"Biochim. Biophys. Acta"},{"key":"2023020116141780000_btu625-B44","doi-asserted-by":"crossref","first-page":"6880","DOI":"10.1021\/cr4005692","article-title":"Structural disorder in viral proteins","volume":"114","author":"Xue","year":"2014","journal-title":"Chem. Rev."},{"key":"2023020116141780000_btu625-B45","doi-asserted-by":"crossref","first-page":"3369","DOI":"10.1093\/bioinformatics\/bti534","article-title":"RONN: the bio-basis function neural network technique applied to the detection of natively disordered regions in proteins","volume":"21","author":"Yang","year":"2005","journal-title":"Bioinformatics"},{"key":"2023020116141780000_btu625-B46","doi-asserted-by":"crossref","first-page":"220","DOI":"10.1002\/(SICI)1097-0134(19990201)34:2<220::AID-PROT7>3.0.CO;2-K","article-title":"A modified definition of Sov, a segment-based measure for protein secondary structure prediction assessment","volume":"34","author":"Zemla","year":"1999","journal-title":"Proteins"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/summer-heart-0930.chufeiyun1688.workers.dev:443\/https\/academic.oup.com\/bioinformatics\/article-pdf\/31\/2\/201\/49011587\/bioinformatics_31_2_201.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/summer-heart-0930.chufeiyun1688.workers.dev:443\/https\/academic.oup.com\/bioinformatics\/article-pdf\/31\/2\/201\/49011587\/bioinformatics_31_2_201.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,2,2]],"date-time":"2023-02-02T00:23:02Z","timestamp":1675297382000},"score":1,"resource":{"primary":{"URL":"https:\/\/summer-heart-0930.chufeiyun1688.workers.dev:443\/https\/academic.oup.com\/bioinformatics\/article\/31\/2\/201\/2365844"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2014,9,21]]},"references-count":46,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2015,1,15]]}},"URL":"https:\/\/summer-heart-0930.chufeiyun1688.workers.dev:443\/https\/doi.org\/10.1093\/bioinformatics\/btu625","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2015,1,15]]},"published":{"date-parts":[[2014,9,21]]}}}