Extremely-randomized-tree-based prediction of N6-methyladenosine sites in saccharomyces cerevisiae

Research output: Contribution to journalArticlepeer-review

24 Scopus citations

Abstract

N6-methyladenosine (m6A) is one of the most common post-transcriptional modifications in RNA, which has been related to several biological processes. The accurate prediction of m6A sites from RNA sequences is one of the challenging tasks in computational biology. Several computational methods utilizing machine-learning algorithms have been proposed that accelerate in silico screening of m6A sites, thereby drastically reducing the experimental time and labor costs involved. In this study, we proposed a novel computational predictor termed ERT-m6Apred, for the accurate prediction of 6mA sites. To identify the feature encodings with more discriminative capability, we applied a two-step feature selection technique on seven different feature encodings and identified the corresponding optimal feature set. Subsequently, performance comparison of the corresponding optimal feature set-based extremely randomized tree model revealed that Pseudo k-tuple composition encoding, which includes 14 physicochemical properties significantly outperformed other encodings. Moreover, ERT-m6Apred achieved an accuracy of 78.84% during cross-validation analysis, which is comparatively better than recently reported predictors. In summary, ERT-m6Apred predicts Saccharomyces cerevisiae m6A sites with higher accuracy, thus facilitating biological hypothesis generation and experimental validations.

Original languageEnglish
Pages (from-to)26-33
Number of pages8
JournalCurrent Genomics
Volume21
Issue number1
DOIs
StatePublished - 2020
Externally publishedYes

Keywords

  • And RNA sequences
  • Cross-validation
  • Extremely randomized tree
  • Feature optimization
  • N-methyladenosine sites

Fingerprint

Dive into the research topics of 'Extremely-randomized-tree-based prediction of N6-methyladenosine sites in saccharomyces cerevisiae'. Together they form a unique fingerprint.

Cite this