Skip to main navigation Skip to search Skip to main content

Iterative feature representations improve N4-methylcytosine site prediction

  • Tianjin University
  • Fujian Medical University
  • Ajou University
  • University of Electronic Science and Technology of China
  • Guangzhou University

Research output: Contribution to journalArticlepeer-review

Abstract

Motivation: Accurate identification of N4-methylcytosine (4mC) modifications in a genome wide can provide insights into their biological functions and mechanisms. Machine learning recently have become effective approaches for computational identification of 4mC sites in genome. Unfortunately, existing methods cannot achieve satisfactory performance, owing to the lack of effective DNA feature representations that are capable to capture the characteristics of 4mC modifications. Results: In this work, we developed a new predictor named 4mcPred-IFL, aiming to identify 4mC sites. To represent and capture discriminative features, we proposed an iterative feature representation algorithm that enables to learn informative features from several sequential models in a supervised iterative mode. Our analysis results showed that the feature representations learnt by our algorithm can capture the discriminative distribution characteristics between 4mC sites and non-4mC sites, enlarging the decision margin between the positives and negatives in feature space. Additionally, by evaluating and comparing our predictor with the state-of-the-art predictors on benchmark datasets, we demonstrate that our predictor can identify 4mC sites more accurately. Availability and implementation: The user-friendly webserver that implements the proposed 4mcPred-IFL is well established, and is freely accessible at http://server.malab.cn/4mcPred-IFL. Supplementary information: Supplementary data are available at Bioinformatics online.

Original languageEnglish
Pages (from-to)4930-4937
Number of pages8
JournalBioinformatics
Volume35
Issue number23
DOIs
StatePublished - 1 Dec 2019
Externally publishedYes

Fingerprint

Dive into the research topics of 'Iterative feature representations improve N4-methylcytosine site prediction'. Together they form a unique fingerprint.

Cite this