Mostrar el registro sencillo del ítem

dc.contributor.authorVillegas Morcillo, Amelia Otilia 
dc.contributor.authorGómez García, Ángel Manuel 
dc.contributor.authorSánchez Calle, Victoria Eugenia 
dc.date.accessioned2021-06-16T07:33:13Z
dc.date.available2021-06-16T07:33:13Z
dc.date.issued2020-08-14
dc.identifier.citationAmelia Villegas-Morcillo, Stavros Makrodimitris, Roeland C H J van Ham, Angel M Gomez, Victoria Sanchez, Marcel J T Reinders, Unsupervised protein embeddings outperform hand-crafted sequence and structure features at predicting molecular function, Bioinformatics, Volume 37, Issue 2, 15 January 2021, Pages 162–170, [https://doi.org/10.1093/bioinformatics/btaa701]es_ES
dc.identifier.urihttp://hdl.handle.net/10481/69200
dc.descriptionThis work was supported by Keygene N.V., a crop innovation company in the Netherlands and by the Spanish MINECO/FEDER Project TEC201680141-P with the associated FPI grant BES-2017-079792.es_ES
dc.descriptionThe authors thank Dr. Elvin Isufi and Chirag Raman for their valuable comments and feedback.es_ES
dc.description.abstractMotivation: Protein function prediction is a difficult bioinformatics problem. Many recent methods use deep neural networks to learn complex sequence representations and predict function from these. Deep supervised models require a lot of labeled training data which are not available for this task. However, a very large amount of protein sequences without functional labels is available. Results: We applied an existing deep sequence model that had been pretrained in an unsupervised setting on the supervised task of protein molecular function prediction. We found that this complex feature representation is effective for this task, outperforming hand-crafted features such as one-hot encoding of amino acids, k-mer counts, secondary structure and backbone angles. Also, it partly negates the need for complex prediction models, as a two-layer perceptron was enough to achieve competitive performance in the third Critical Assessment of Functional Annotation benchmark. We also show that combining this sequence representation with protein 3D structure information does not lead to performance improvement, hinting that 3D structure is also potentially learned during the unsupervised pretraining.es_ES
dc.description.sponsorshipKeygene N.V., a crop innovation company in the Netherlandses_ES
dc.description.sponsorshipSpanish MINECO/FEDER TEC201680141-Pes_ES
dc.description.sponsorshipFPI grant BES-2017-079792es_ES
dc.language.isoenges_ES
dc.publisherOxford University Presses_ES
dc.rightsAtribución-NoComercial 3.0 España*
dc.rights.urihttp://creativecommons.org/licenses/by-nc/3.0/es/*
dc.titleUnsupervised protein embeddings outperform hand-crafted sequence and structure features at predicting molecular functiones_ES
dc.typeinfo:eu-repo/semantics/articlees_ES
dc.rights.accessRightsinfo:eu-repo/semantics/openAccesses_ES
dc.identifier.doi10.1093/bioinformatics/btaa701
dc.type.hasVersioninfo:eu-repo/semantics/publishedVersiones_ES


Ficheros en el ítem

[PDF]

Este ítem aparece en la(s) siguiente(s) colección(ones)

Mostrar el registro sencillo del ítem

Atribución-NoComercial 3.0 España
Excepto si se señala otra cosa, la licencia del ítem se describe como Atribución-NoComercial 3.0 España