Publication | Open Access
BERMP: a cross-species classifier for predicting m<sup>6</sup>A sites by integrating a deep learning algorithm and a random forest approach
104
Citations
25
References
2018
Year
N<sup>6</sup>-methyladenosine (m<sup>6</sup>A) is a prevalent RNA methylation modification involved in several biological processes. Hundreds or thousands of m<sup>6</sup>A sites identified from different species using high-throughput experiments provides a rich resource to construct <i>in-silico</i> approaches for identifying m<sup>6</sup>A sites. The existing m<sup>6</sup>A predictors are developed using conventional machine-learning (ML) algorithms and most are species-centric. In this paper, we develop a novel cross-species deep-learning classifier based on bidirectional Gated Recurrent Unit (BGRU) for the prediction of m<sup>6</sup>A sites. In comparison with conventional ML approaches, BGRU achieves outstanding performance for the <i>Mammalia</i> dataset that contains over fifty thousand m<sup>6</sup>A sites but inferior for the <i>Saccharomyces cerevisiae</i> dataset that covers around a thousand positives. The accuracy of BGRU is sensitive to the data size and the sensitivity is compensated by the integration of a random forest classifier with a novel encoding of enhanced nucleic acid content. The integrated approach dubbed as BGRU-based Ensemble RNA Methylation site Predictor (BERMP) has competitive performance in both cross-validation test and independent test. BERMP also outperforms existing m<sup>6</sup>A predictors for different species. Therefore, BERMP is a novel multi-species tool for identifying m<sup>6</sup>A sites with high confidence. This classifier is freely available at http://www.bioinfogo.org/bermp.
| Year | Citations | |
|---|---|---|
Page 1
Page 1