Romanian
Albanian
Arabic
Armenian
Azerbaijani
Belarusian
Bengali
Bosnian
Catalan
Czech
Danish
Deutsch
Dutch
English
Estonian
Finnish
Français
Greek
Haitian Creole
Hebrew
Hindi
Hungarian
Icelandic
Indonesian
Irish
Italian
Japanese
Korean
Latvian
Lithuanian
Macedonian
Mongolian
Norwegian
Persian
Polish
Portuguese
Romanian
Russian
Serbian
Slovak
Slovenian
Spanish
Swahili
Swedish
Turkish
Ukrainian
Vietnamese
Български
中文(简体)
中文(繁體)
International Journal of Molecular Sciences 2018-Jan

Assessing the Performances of Protein Function Prediction Algorithms from the Perspectives of Identification Accuracy and False Discovery Rate.

Numai utilizatorii înregistrați pot traduce articole
Log In / Înregistrare
Linkul este salvat în clipboard
Chun Yan Yu
Xiao Xu Li
Hong Yang
Ying Hong Li
Wei Wei Xue
Yu Zong Chen
Lin Tao
Feng Zhu

Cuvinte cheie

Abstract

The function of a protein is of great interest in the cutting-edge research of biological mechanisms, disease development and drug/target discovery. Besides experimental explorations, a variety of computational methods have been designed to predict protein function. Among these in silico methods, the prediction of BLAST is based on protein sequence similarity, while that of machine learning is also based on the sequence, but without the consideration of their similarity. This unique characteristic of machine learning makes it a good complement to BLAST and many other approaches in predicting the function of remotely relevant proteins and the homologous proteins of distinct function. However, the identification accuracies of these in silico methods and their false discovery rate have not yet been assessed so far, which greatly limits the usage of these algorithms. Herein, a comprehensive comparison of the performances among four popular prediction algorithms (BLAST, SVM, PNN and KNN) was conducted. In particular, the performance of these methods was systematically assessed by four standard statistical indexes based on the independent test datasets of 93 functional protein families defined by UniProtKB keywords. Moreover, the false discovery rates of these algorithms were evaluated by scanning the genomes of four representative model organisms (Homo sapiens, Arabidopsis thaliana, Saccharomyces cerevisiae and Mycobacterium tuberculosis). As a result, the substantially higher sensitivity of SVM and BLAST was observed compared with that of PNN and KNN. However, the machine learning algorithms (PNN, KNN and SVM) were found capable of substantially reducing the false discovery rate (SVM < PNN < KNN). In sum, this study comprehensively assessed the performance of four popular algorithms applied to protein function prediction, which could facilitate the selection of the most appropriate method in the related biomedical research.

Alăturați-vă paginii
noastre de facebook

Cea mai completă bază de date cu plante medicinale susținută de știință

  • Funcționează în 55 de limbi
  • Cure pe bază de plante susținute de știință
  • Recunoașterea ierburilor după imagine
  • Harta GPS interactivă - etichetați ierburile în locație (în curând)
  • Citiți publicațiile științifice legate de căutarea dvs.
  • Căutați plante medicinale după efectele lor
  • Organizați-vă interesele și rămâneți la curent cu noutățile de cercetare, studiile clinice și brevetele

Tastați un simptom sau o boală și citiți despre plante care ar putea ajuta, tastați o plantă și vedeți boli și simptome împotriva cărora este folosit.
* Toate informațiile se bazează pe cercetări științifice publicate

Google Play badgeApp Store badge