MethylBERT:‬ ‭A‬ ‭Transformer-based‬ ‭model‬ ‭for‬ ‭read-level‬ ‭DNA‬ ‭methylation‬ pattern‬ ‭identification‬ ‭and‬ ‭tumour‬ ‭deconvolution‬

Pavlo Lutsik, Yunhee Yeong,Karl Rohr

crossref(2024)

引用 0|浏览0
暂无评分
摘要
Abstract DNA methylation (DNAm) is a key epigenetic mark that shows profound alterations in cancer. Read-level methylomes enable more in-depth DNAm analysis due to the broad coverage and preservation of rare cell-type signals, compared to array-based data such as 450K/EPIC array. Here, we propose MethylBERT, a novel Transformer-based model for read-level methylation pattern classification. MethylBERT identifies tumour-derived sequence reads based on their methylation patterns and genomic sequence. Based on the calculated classification probability, the method estimates tumour cell fractions within bulk samples and provides an assessment of the model precision. In our evaluation, MethylBERT outperforms existing deconvolution methods and demonstrates high accuracy regardless of methylation pattern complexity, read length and read coverage. Moreover, we show its potential for accurate non-invasive early cancer diagnostics using liquid biopsy samples. MethylBERT represents a significant advancement in read-level methylome analysis. It will increase the accuracy of tumour deconvolution and enhance circulating tumour DNA studies.
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要