Journal Article (Review Article) DKFZ-2025-01826

http://join2-wiki.gsi.de/foswiki/pub/Main/Artwork/join2_logo100x88.png
A practical guide to identifying associations between tandem repeats and complex human traits using consensus genotypes from multiple tools.

 ;  ;  ;  ;  ;  ;  ;  ;  ;  ;  ;  ;  ;  ;  ;  ;  ;  ;  ;  ;  ;  ;  ;  ;  ;  ;

2025
Nature Publishing Group Basingstoke

Nature protocols nn, nn () [10.1038/s41596-025-01231-y]
 GO

This record in other databases:  

Please use a persistent id in citations: doi:

Abstract: Tandem repeats (TRs) are highly variable loci in the human genome that are linked to various human phenotypes. Accurate and reliable genotyping of TRs is important in understanding population TR variation dynamics and their effects in TR-trait association studies. In this protocol, we describe how to generate high-quality consensus TR genotypes for population genomics studies. In particular, we detail steps to: (i) perform TR genotyping from short-read whole-genome sequencing data by using the HipSTR, GangSTR, adVNTR and ExpansionHunter tools, (ii) perform quality control checks on TR genotypes by using TRTools and (iii) integrate TR genotypes from different tools by using EnsembleTR. We further discuss how to visualize and investigate TR variation patterns to identify population-specific expansions and perform TR-trait association analyses. We demonstrate the utility of these steps by analyzing a small dataset from the 1000 Genomes Project. In addition, we recapitulate a previously identified association between TR length and gene expression in the African population and provide a generalized discussion on TR analysis and its relevance to identifying complex traits. The expected time for installing the necessary software for each section is ~10 min. The expected run time on the user's desired dataset can vary from hours to days depending on factors such as the size of the data, input parameters and the capacity of the computing infrastructure.

Classification:

Note: #LA:B330# / epub

Contributing Institute(s):
  1. Angewandte Bioinformatik (B330)
Research Program(s):
  1. 312 - Funktionelle und strukturelle Genomforschung (POF4-312) (POF4-312)

Appears in the scientific report 2025
Database coverage:
Medline ; Clarivate Analytics Master Journal List ; Current Contents - Life Sciences ; DEAL Nature ; Essential Science Indicators ; IF >= 10 ; JCR ; NationallizenzNationallizenz ; SCOPUS ; Science Citation Index Expanded ; Web of Science Core Collection
Click to display QR Code for this record

The record appears in these collections:
Document types > Articles > Journal Article
Public records
Publications database

 Record created 2025-09-02, last modified 2025-09-07



Rate this document:

Rate this document:
1
2
3
 
(Not yet reviewed)