Trans-m5C: A transformer-based model for predicting 5-methylcytosine (m5C) sites

Haitao Fu, Zewen Ding, Wen Wang*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

5-Methylcytosine (m5C) plays a pivotal role in various RNA metabolic processes, including RNA localization, stability, and translation. Current high-throughput sequencing technologies for m5C site identification are resource-intensive in terms of cost, labor, and time. As such, there is a pressing need for efficient computational approaches. Many existing computational methods rely on intricate hand-crafted features, requiring unavailable features, often leading to suboptimal prediction accuracy. Addressing these challenges, we introduce a novel deep-learning method, Trans-m5C. We first categorize m5C sites into NSUN2-dependent and NSUN6-dependent types for independent feature extraction. Subsequently, meticulously crafted transformer neural networks are employed to distill global features. The prediction of m5C sites is then accomplished using a discriminator built from a multi-layer perceptron. A rigorous evaluation for the performance of Trans-m5C on experimentally validated m5C data from human and mouse species reveals that our method offers a competitive edge over both baseline and existing methodologies.

Original languageEnglish
Pages (from-to)178-186
Number of pages9
JournalMethods
Volume234
Early online date30 Dec 2024
DOIs
Publication statusPublished - Feb 2025

Keywords / Materials (for Non-textual outputs)

  • 5-Methylcytosine/chemistry
  • Humans
  • Animals
  • Mice
  • Neural Networks, Computer
  • Computational Biology/methods
  • Deep Learning
  • High-Throughput Nucleotide Sequencing/methods
  • RNA/genetics

Fingerprint

Dive into the research topics of 'Trans-m5C: A transformer-based model for predicting 5-methylcytosine (m5C) sites'. Together they form a unique fingerprint.

Cite this