Abstract
5-Methylcytosine (m5C) plays a pivotal role in various RNA metabolic processes, including RNA localization, stability, and translation. Current high-throughput sequencing technologies for m5C site identification are resource-intensive in terms of cost, labor, and time. As such, there is a pressing need for efficient computational approaches. Many existing computational methods rely on intricate hand-crafted features, requiring unavailable features, often leading to suboptimal prediction accuracy. Addressing these challenges, we introduce a novel deep-learning method, Trans-m5C. We first categorize m5C sites into NSUN2-dependent and NSUN6-dependent types for independent feature extraction. Subsequently, meticulously crafted transformer neural networks are employed to distill global features. The prediction of m5C sites is then accomplished using a discriminator built from a multi-layer perceptron. A rigorous evaluation for the performance of Trans-m5C on experimentally validated m5C data from human and mouse species reveals that our method offers a competitive edge over both baseline and existing methodologies.
Original language | English |
---|---|
Pages (from-to) | 178-186 |
Number of pages | 9 |
Journal | Methods |
Volume | 234 |
Early online date | 30 Dec 2024 |
DOIs | |
Publication status | Published - Feb 2025 |
Keywords / Materials (for Non-textual outputs)
- 5-Methylcytosine/chemistry
- Humans
- Animals
- Mice
- Neural Networks, Computer
- Computational Biology/methods
- Deep Learning
- High-Throughput Nucleotide Sequencing/methods
- RNA/genetics