Learning to See Peaks: Attention-Based Feature Extraction for Automated Chromatographic Peak Detection

Walter, Daniel; Voltmer, Dominik; Großkopf, Tobias; Weydanz, Birgit; Marr, Carsten; Helbig, Mathias; Bonfiglio, Juan Jose

doi:10.1021/acsomega.6c01862

Journal Article

DKFZ-2026-01326

Learning to See Peaks: Attention-Based Feature Extraction for Automated Chromatographic Peak Detection

Walter, D. ; Helbig, M. ; Weydanz, B. ; Voltmer, D. ; Bonfiglio, J. J. ; Marr, C.DKFZ* ; Großkopf, T.

2026
ACS Publications Washington, DC

ACS omega nn, nn (2026) [10.1021/acsomega.6c01862]

Abstract: Reliable peak detection remains a bottleneck in size-exclusion chromatography (SEC) as overlapping signals, drifting baselines, and analyst variability limit reproducibility. As SEC is a routine release and comparability assay and its interpretation depends on peak morphology and context, machine learning methods are well-suited to improve reproducibility at scale. We present the Peak Feature Extractor 1 (PFE-1), a one-dimensional encoder-only transformer trained on millions of synthetic chromatograms generated by a simulator statistically calibrated to routine SEC data from antibodies and related large-molecule species. PFE-1 outputs probabilistic region and event predictions that are aggregated through a transparent rule-based procedure into interpretable peak boxes. We evaluate PFE-1 on synthetic benchmarks and on a curated real SEC benchmark, reporting window-level precision/recall/F1 and box-level agreement via an intensity-weighted box loss aligned with routine process annotations. Across these evaluations, PFE-1 outperforms convolutional and derivative-based baselines, with the largest gains observed under more challenging overlap and morphology conditions. On synthetic data, PFE-1 achieves substantially higher box-level agreement than both baselines; on the curated real SEC benchmark, it likewise achieves the strongest box-level agreement while requiring no sample-specific inputs (e.g., expected peak windows). We provide a reproducible and extensible SEC-specific framework for chromatographic peak detection that supports a more consistent peak interpretation in routine analytical workflows.

Classification:

ddc:660

Note: #DKTKZFB9# / epub

Contributing Institute(s):

DKTK Koordinierungsstelle München (MU01)

Research Program(s):

899 - ohne Topic (POF4-899) (POF4-899)

Appears in the scientific report 2026

Database coverage:
Medline

;

; Article Processing Charges ; Clarivate Analytics Master Journal List ; Current Contents - Physical, Chemical and Earth Sciences ; DOAJ Seal ; Essential Science Indicators ; Fees ; IF < 5 ; JCR ; PubMed Central ; SCOPUS ; Science Citation Index Expanded ; Web of Science Core Collection

Click to display QR Code for this record

The record appears in these collections:
Document types > Articles > Journal Article
Public records
Publications database

Record created 2026-06-03, last modified 2026-06-03

Similar records

Rate this document:

(Not yet reviewed)

Add to personal basket
Export as Author List with IDs BibTeX (UTF-8), EndNote XML, EndNote Text, RIS, MARC, Print MARC, MARCXML, DC,
Request correction
Submit fulltext

guest :: login DKFZ
		Search		Submit		Personalize Your alerts Your baskets Your searches		Help