Journal Article DKFZ-2025-00872

http://join2-wiki.gsi.de/foswiki/pub/Main/Artwork/join2_logo100x88.png
Identifying similar populations across independent single cell studies without data integration.

 ;  ;  ;  ;  ;  ;

2025
Oxford University Press Oxford

NAR: genomics and bioinformatics 7(2), lqaf042 () [10.1093/nargab/lqaf042]
 GO

This record in other databases:

Please use a persistent id in citations: doi:

Abstract: Supervised and unsupervised methods have emerged to address the complexity of single cell data analysis in the context of large pools of independent studies. Here, we present ClusterFoldSimilarity (CFS), a novel statistical method design to quantify the similarity between cell groups across any number of independent datasets, without the need for data correction or integration. By bypassing these processes, CFS avoids the introduction of artifacts and loss of information, offering a simple, efficient, and scalable solution. This method match groups of cells that exhibit conserved phenotypes across datasets, including different tissues and species, and in a multimodal scenario, including single-cell RNA-Seq, ATAC-Seq, single-cell proteomics, or, more broadly, data exhibiting differential abundance effects among groups of cells. Additionally, CFS performs feature selection, obtaining cross-dataset markers of the similar phenotypes observed, providing an inherent interpretability of relationships between cell populations. To showcase the effectiveness of our methodology, we generated single-nuclei RNA-Seq data from the motor cortex and spinal cord of adult mice. By using CFS, we identified three distinct sub-populations of astrocytes conserved on both tissues. CFS includes various visualization methods for the interpretation of the similarity scores and similar cell populations.

Keyword(s): Single-Cell Analysis: methods (MeSH) ; Animals (MeSH) ; Mice (MeSH) ; Spinal Cord: cytology (MeSH) ; Spinal Cord: metabolism (MeSH) ; Motor Cortex: cytology (MeSH) ; Motor Cortex: metabolism (MeSH) ; Astrocytes: metabolism (MeSH) ; Astrocytes: cytology (MeSH) ; RNA-Seq (MeSH) ; Cluster Analysis (MeSH)

Classification:

Note: #LA:B330#

Contributing Institute(s):
  1. Angewandte Bioinformatik (B330)
  2. DKTK HD zentral (HD01)
Research Program(s):
  1. 312 - Funktionelle und strukturelle Genomforschung (POF4-312) (POF4-312)

Appears in the scientific report 2025
Database coverage:
Medline ; DOAJ ; Article Processing Charges ; BIOSIS Previews ; Biological Abstracts ; Clarivate Analytics Master Journal List ; DOAJ Seal ; Emerging Sources Citation Index ; Fees ; SCOPUS ; Web of Science Core Collection
Click to display QR Code for this record

The record appears in these collections:
Document types > Articles > Journal Article
Public records
Publication Charges
Publications database

 Record created 2025-04-28, last modified 2025-05-09


Rate this document:

Rate this document:
1
2
3
 
(Not yet reviewed)