Journal Article DKFZ-2025-00169

http://join2-wiki.gsi.de/foswiki/pub/Main/Artwork/join2_logo100x88.png
MethylBERT enables read-level DNA methylation pattern identification and tumour deconvolution using a Transformer-based model.

 ;  ;  ;  ;  ;

2025
Springer Nature [London]

Nature Communications 16(1), 788 () [10.1038/s41467-025-55920-z]
 GO

This record in other databases:  

Please use a persistent id in citations: doi:

Abstract: DNA methylation (DNAm) is a key epigenetic mark that shows profound alterations in cancer. Read-level methylomes enable more in-depth analyses, due to their broad genomic coverage and preservation of rare cell-type signals, compared to summarized data such as 450K/EPIC microarrays. Here, we propose MethylBERT, a Transformer-based model for read-level methylation pattern classification. MethylBERT identifies tumour-derived sequence reads based on their methylation patterns and local genomic sequence, and estimates tumour cell fractions within bulk samples. In our evaluation, MethylBERT outperforms existing deconvolution methods and demonstrates high accuracy regardless of methylation pattern complexity, read length and read coverage. Moreover, we show its applicability to cell-type deconvolution as well as non-invasive early cancer diagnostics using liquid biopsy samples. MethylBERT represents a significant advancement in read-level methylome analysis and enables accurate tumour purity estimation. The broad applicability of MethylBERT will enhance studies on both tumour and non-cancerous bulk methylomes.

Keyword(s): DNA Methylation (MeSH) ; Humans (MeSH) ; Neoplasms: genetics (MeSH) ; Algorithms (MeSH) ; Epigenesis, Genetic (MeSH) ; Epigenome (MeSH) ; Software (MeSH) ; Sequence Analysis, DNA: methods (MeSH) ; Epigenomics: methods (MeSH)

Classification:

Note: #EA:B370#LA:B370# / #DKFZ-MOST-Ca191#

Contributing Institute(s):
  1. Epigenomik (B370)
Research Program(s):
  1. 312 - Funktionelle und strukturelle Genomforschung (POF4-312) (POF4-312)

Appears in the scientific report 2025
Database coverage:
Medline ; Creative Commons Attribution CC BY (No Version) ; DOAJ ; OpenAccess ; Article Processing Charges ; BIOSIS Previews ; Biological Abstracts ; Clarivate Analytics Master Journal List ; Current Contents - Agriculture, Biology and Environmental Sciences ; Current Contents - Life Sciences ; Current Contents - Physical, Chemical and Earth Sciences ; DOAJ Seal ; Essential Science Indicators ; Fees ; IF >= 15 ; JCR ; SCOPUS ; Science Citation Index Expanded ; Web of Science Core Collection ; Zoological Record
Click to display QR Code for this record

The record appears in these collections:
Document types > Articles > Journal Article
Public records
Publication Charges
Publications database
Open Access

 Record created 2025-01-20, last modified 2026-02-05


OpenAccess:
Download fulltext PDF Download fulltext PDF (PDFA)
Rate this document:

Rate this document:
1
2
3
 
(Not yet reviewed)