Link to bioRxiv paper: http://biorxiv.org/cgi/content/short/2023.04.17.537144v1?rss=1
Authors: Rohrlach, A. B., Tuke, J., Popli, D., Haak, W.
Abstract: Robust and reliable estimates of how individuals are biologically related to each other are a key source of information when reconstructing pedigrees. In combination with contextual data, reconstructed pedigrees can be used to infer possible kinship practices in prehistoric populations. However, standard methods to estimate biological relatedness from genome sequence data cannot be applied to low coverage sequence data, such as are common in ancient DNA (aDNA) studies. Critically, a statistically robust method for assessing and quantifying the confidence of a classification of a specific degree of relatedness for a pair of individuals, using low coverage genome data, is lacking. In this paper we present the R-package BREADR (Biological RElatedness from Ancient DNA in R), which leverages the so-called pairwise mismatch rate, calculated on optimally-thinned genome-wide pseudo-haploid sequence data, to estimate genetic relatedness up to the second degree, assuming an underlying binomial distribution. BREADR also returns a posterior probability for each degree of relatedness, from identical twins/same individual, first-degree, second-degree or "unrelated" pairs, allowing researchers to quantify and report the uncertainty, even for particularly low-coverage data. We show that this method accurately recovers degrees of relatedness for sequence data with coverage as low as 0.04X using simulated data, and then compare the performance of BREADR on empirical data from Bronze Age Iberian human sequence data. The BREADR package is designed for pseudo-haploid genotype data, common in aDNA studies.
Copy rights belong to original authors. Visit the link for more info
Podcast created by Paper Player, LLC