001     301753
005     20250604113441.0
024 7 _ |a 10.1371/journal.pone.0322887
|2 doi
024 7 _ |a pmid:40455868
|2 pmid
037 _ _ |a DKFZ-2025-01137
041 _ _ |a English
082 _ _ |a 610
100 1 _ |a Stolte, Marieke
|0 0009-0002-0711-6789
|b 0
245 _ _ |a Simulation study to evaluate when Plasmode simulation is superior to parametric simulation in comparing classification methods on high-dimensional data.
260 _ _ |a San Francisco, California, US
|c 2025
|b PLOS
336 7 _ |a article
|2 DRIVER
336 7 _ |a Output Types/Journal article
|2 DataCite
336 7 _ |a Journal Article
|b journal
|m journal
|0 PUB:(DE-HGF)16
|s 1748954505_28541
|2 PUB:(DE-HGF)
336 7 _ |a ARTICLE
|2 BibTeX
336 7 _ |a JOURNAL_ARTICLE
|2 ORCID
336 7 _ |a Journal Article
|0 0
|2 EndNote
520 _ _ |a Simulation studies, especially neutral comparison studies, are crucial for evaluating and comparing statistical methods as they investigate whether methods work as intended and can guide an appropriate method choice. Typically, the term simulation refers to parametric simulation, i.e. computer experiments using pseudo-random numbers. For these, the full data-generating process (DGP) and outcome-generating model (OGM) are known within the simulation. However, the specification of realistic DGPs might be difficult in practice leading to oversimplified assumptions. The problem is more severe for higher-dimensional data as the number of parameters to specify typically increases with the number of variables in the data. Plasmode simulation, which is a combination of resampling covariates from a real-life dataset from the DGP of interest together with a specified OGM is often claimed to solve this problem since no explicit specification of the DGP is necessary. However, this claim is not well supported by empirical results. Here, parametric and Plasmode simulations are compared in the context of a method comparison study for binary classification methods. We focus on studies conducted with some specific data type or application in mind whose true, unknown data-generating mechanism is mimicked. The performance of Plasmode and parametric comparison studies for estimating classifier performance is compared as well as their ability to reproduce the true method ranking. The influence of misspecifications of the DGP on the results of parametric simulation and of misspecifications of the OGM on the results of parametric and Plasmode simulation are investigated. Moreover, different resampling strategies are compared for Plasmode comparison studies. The study finds that misspecifications of the DGP and OGM negatively influence the ability of the comparison studies to estimate the classification performances and method rankings. The best choice of the resampling strategy in Plasmode simulation depends on the concrete scenario.
536 _ _ |a 313 - Krebsrisikofaktoren und Prävention (POF4-313)
|0 G:(DE-HGF)POF4-313
|c POF4-313
|f POF IV
|x 0
588 _ _ |a Dataset connected to CrossRef, PubMed, , Journals: inrepo02.dkfz.de
650 _ 2 |a Computer Simulation
|2 MeSH
650 _ 2 |a Models, Statistical
|2 MeSH
650 _ 2 |a Humans
|2 MeSH
650 _ 2 |a Algorithms
|2 MeSH
700 1 _ |a Schreck, Nicholas
|0 P:(DE-He78)0d054b6843ace36d1c965b6cb938d1c9
|b 1
|u dkfz
700 1 _ |a Slynko, Alla
|b 2
700 1 _ |a Saadati, Maral
|0 P:(DE-He78)609d3f1c1420bf59b2332eeab889cb74
|b 3
|u dkfz
700 1 _ |a Benner, Axel
|0 P:(DE-He78)e15dfa1260625c69d6690a197392a994
|b 4
|u dkfz
700 1 _ |a Rahnenführer, Jörg
|b 5
700 1 _ |a Bommert, Andrea
|b 6
700 1 _ |a data”, topic group “High-dimensional
|b 7
|e Collaboration Author
773 _ _ |a 10.1371/journal.pone.0322887
|g Vol. 20, no. 6, p. e0322887 -
|0 PERI:(DE-600)2267670-3
|n 6
|p e0322887 -
|t PLOS ONE
|v 20
|y 2025
|x 1932-6203
909 C O |o oai:inrepo02.dkfz.de:301753
|p VDB
910 1 _ |a Deutsches Krebsforschungszentrum
|0 I:(DE-588b)2036810-0
|k DKFZ
|b 1
|6 P:(DE-He78)0d054b6843ace36d1c965b6cb938d1c9
910 1 _ |a Deutsches Krebsforschungszentrum
|0 I:(DE-588b)2036810-0
|k DKFZ
|b 3
|6 P:(DE-He78)609d3f1c1420bf59b2332eeab889cb74
910 1 _ |a Deutsches Krebsforschungszentrum
|0 I:(DE-588b)2036810-0
|k DKFZ
|b 4
|6 P:(DE-He78)e15dfa1260625c69d6690a197392a994
913 1 _ |a DE-HGF
|b Gesundheit
|l Krebsforschung
|1 G:(DE-HGF)POF4-310
|0 G:(DE-HGF)POF4-313
|3 G:(DE-HGF)POF4
|2 G:(DE-HGF)POF4-300
|4 G:(DE-HGF)POF
|v Krebsrisikofaktoren und Prävention
|x 0
914 1 _ |y 2025
915 _ _ |a JCR
|0 StatID:(DE-HGF)0100
|2 StatID
|b PLOS ONE : 2022
|d 2024-12-16
915 _ _ |a DBCoverage
|0 StatID:(DE-HGF)0200
|2 StatID
|b SCOPUS
|d 2024-12-16
915 _ _ |a DBCoverage
|0 StatID:(DE-HGF)0300
|2 StatID
|b Medline
|d 2024-12-16
915 _ _ |a DBCoverage
|0 StatID:(DE-HGF)0501
|2 StatID
|b DOAJ Seal
|d 2024-02-08T09:37:46Z
915 _ _ |a DBCoverage
|0 StatID:(DE-HGF)0500
|2 StatID
|b DOAJ
|d 2024-02-08T09:37:46Z
915 _ _ |a Peer Review
|0 StatID:(DE-HGF)0030
|2 StatID
|b DOAJ : Anonymous peer review
|d 2024-02-08T09:37:46Z
915 _ _ |a Creative Commons Attribution CC BY (No Version)
|0 LIC:(DE-HGF)CCBYNV
|2 V:(DE-HGF)
|b DOAJ
|d 2024-02-08T09:37:46Z
915 _ _ |a DBCoverage
|0 StatID:(DE-HGF)0600
|2 StatID
|b Ebsco Academic Search
|d 2024-12-16
915 _ _ |a Peer Review
|0 StatID:(DE-HGF)0030
|2 StatID
|b ASC
|d 2024-12-16
915 _ _ |a DBCoverage
|0 StatID:(DE-HGF)0199
|2 StatID
|b Clarivate Analytics Master Journal List
|d 2024-12-16
915 _ _ |a DBCoverage
|0 StatID:(DE-HGF)1040
|2 StatID
|b Zoological Record
|d 2024-12-16
915 _ _ |a DBCoverage
|0 StatID:(DE-HGF)1050
|2 StatID
|b BIOSIS Previews
|d 2024-12-16
915 _ _ |a DBCoverage
|0 StatID:(DE-HGF)0160
|2 StatID
|b Essential Science Indicators
|d 2024-12-16
915 _ _ |a DBCoverage
|0 StatID:(DE-HGF)1190
|2 StatID
|b Biological Abstracts
|d 2024-12-16
915 _ _ |a WoS
|0 StatID:(DE-HGF)0113
|2 StatID
|b Science Citation Index Expanded
|d 2024-12-16
915 _ _ |a DBCoverage
|0 StatID:(DE-HGF)0150
|2 StatID
|b Web of Science Core Collection
|d 2024-12-16
915 _ _ |a IF < 5
|0 StatID:(DE-HGF)9900
|2 StatID
|d 2024-12-16
915 _ _ |a Article Processing Charges
|0 StatID:(DE-HGF)0561
|2 StatID
|d 2024-12-16
915 _ _ |a Fees
|0 StatID:(DE-HGF)0700
|2 StatID
|d 2024-12-16
920 1 _ |0 I:(DE-He78)C060-20160331
|k C060
|l C060 Biostatistik
|x 0
980 _ _ |a journal
980 _ _ |a VDB
980 _ _ |a I:(DE-He78)C060-20160331
980 _ _ |a UNRESTRICTED


LibraryCollectionCLSMajorCLSMinorLanguageAuthor
Marc 21