Concepedia

Publication | Open Access

BERMP: a cross-species classifier for predicting m<sup>6</sup>A sites by integrating a deep learning algorithm and a random forest approach

104

Citations

25

References

2018

Year

Abstract

N<sup>6</sup>-methyladenosine (m<sup>6</sup>A) is a prevalent RNA methylation modification involved in several biological processes. Hundreds or thousands of m<sup>6</sup>A sites identified from different species using high-throughput experiments provides a rich resource to construct <i>in-silico</i> approaches for identifying m<sup>6</sup>A sites. The existing m<sup>6</sup>A predictors are developed using conventional machine-learning (ML) algorithms and most are species-centric. In this paper, we develop a novel cross-species deep-learning classifier based on bidirectional Gated Recurrent Unit (BGRU) for the prediction of m<sup>6</sup>A sites. In comparison with conventional ML approaches, BGRU achieves outstanding performance for the <i>Mammalia</i> dataset that contains over fifty thousand m<sup>6</sup>A sites but inferior for the <i>Saccharomyces cerevisiae</i> dataset that covers around a thousand positives. The accuracy of BGRU is sensitive to the data size and the sensitivity is compensated by the integration of a random forest classifier with a novel encoding of enhanced nucleic acid content. The integrated approach dubbed as BGRU-based Ensemble RNA Methylation site Predictor (BERMP) has competitive performance in both cross-validation test and independent test. BERMP also outperforms existing m<sup>6</sup>A predictors for different species. Therefore, BERMP is a novel multi-species tool for identifying m<sup>6</sup>A sites with high confidence. This classifier is freely available at http://www.bioinfogo.org/bermp.

References

YearCitations

Page 1