Human Population Genetics and Genomics ISSN 2770-5005

Human Population Genetics and Genomics 2024;4(1):0001 | https://doi.org/10.47248/hpgg2404010001

Original Research Open Access

GRUPS-rs, a high-performance ancient DNA genetic relatedness estimation software relying on pedigree simulations

Maël Lefeuvre 1 , Michael David Martin 2 , Flora Jay 3 , Marie-Claude Marsolier 1,4 , Céline Bon 1

  • UMR 7206 Eco-Anthropologie (EA), CNRS, Muséum National d’Histoire Naturelle, Université Paris Cité, 75116 Paris, France
  • Department of Natural History, NTNU University Museum, Norwegian University of Science and Technology (NTNU), 7491 Trondheim, Norway
  • UMR9015 - Laboratoire Interdisciplinaire des Sciences du Numérique (LISN), CNRS, INRIA, Université Paris-Saclay, 91400 Orsay, France
  • UMR 9198 - Institut de Biologie Intégrative de la Cellule (I2BC), CNRS, CEA, Université Paris-Saclay, 91190 Gif-sur-Yvette, France

Correspondence: Maël Lefeuvre; Céline Bon

Academic Editor(s): Maanasa Raghavan

Received: Sep 28, 2023 | Accepted: Dec 18, 2023 | Published: Jan 4, 2024

This article belongs to the Special Issue

Cite this article: Lefeuvre M, Martin M, Jay F, Marsolier M, Bon C. GRUPS-rs, a high-performance ancient DNA genetic relatedness estimation software relying on pedigree simulations. Hum Popul Genet Genom 2024; 4(1):0001. https://doi.org/10.47248/hpgg2404010001

Abstract

Background: The study of fine-grain genetic kinship ties (parents, siblings, cousins, etc.) from ancient remains is nowgaining significant interest within the field of paleogenetics, as a means of deciphering the social organization of past societies. However, kinship analyses are in practice often quite difficult to apply within paleogenetic studies, and may carry a high degree of uncertainty in the results they provide, especially when applied on low coverage and/or highly degraded samples, or when studying poorly characterized populations. To overcome these challenges, most of the available kinship estimation methods either refrain from inferring ties beyond the second degree (e.g., half-siblings), and/or rely on the use of a cohort of individuals to obtain a satisfactory statistical significance. Thus, the current state of the art remains intrinsically limited when attempting to estimate kinship on a small number of individuals, or when trying to detect more distant relationships (e.g., cousins).

Methods: Here, we present GRUPS-rs: an update and complete reimplementation of GRUPS (Get Relatedness Using Pedigree Simulations), an ancient DNA kinship estimation software based on the methods originally developed in (Martin et al. 2017). GRUPS-rs both computes an estimate of relatedness from randomly sampled pseudo-haploidized variant calls, and leverages high-definition pedigree simulations to bypass the use of a cohort of individuals.

Results: We highlight that GRUPS and GRUPS-rs are especially suitable to perform kinship analysis on a restricted number of ancient samples, and can provide a sufficient statistical significance to estimate genetic relatedness past the second degree, while taking into account user-defined contamination and sequencing error estimates. Importantly, GRUPS-rs offers an estimated 14000-fold speed-up in runtime performance compared to its predecessor — allowing the joint estimation of kinship between dozens of individuals in a matter of minutes — and is now bundled with a user-friendly Shiny interface, in which users can interactively visualize their results.

Conclusions: The GRUPS kinship estimation method is now fully operational in its “GRUPS-rs” implementation, whose use is particularly recommended when analyzing a restricted number of low coverage DNA samples.

Keywords

kinship estimation, genetic relatedness, ancient DNA, paleogenomics, pedigree simulations

Share this article

About Us Journals Join Us Submit Fees Contact