combine@alvar.ug

Low-coverage sequencing cost-effectively detects known and novel variation in underrepresented populations

Low-coverage sequencing cost-effectively detects known and novel variation in underrepresented populations

Show simple record

dc.contributor.author Alicia R. Martin
dc.contributor.author Elizabeth G. Atkinson
dc.contributor.author Sinéad B. Chapman
dc.contributor.author Anne Stevenson
dc.contributor.author Rocky E. Stroud
dc.contributor.author Tamrat Abebe
dc.contributor.author Dickens Akena
dc.contributor.author Melkam Alemayehu
dc.contributor.author Fred K. Ashaba
dc.contributor.author Lukoye Atwoli
dc.contributor.author Tera Bowers
dc.contributor.author Lori B. Chibnik
dc.contributor.author Mark J. Daly
dc.contributor.author Timothy DeSmet
dc.contributor.author Sheila Dodge
dc.contributor.author Abebaw Fekadu
dc.contributor.author Steven Ferriera
dc.contributor.author Bizu Gelaye
dc.contributor.author Stella Gichuru
dc.contributor.author Wilfred Injera
dc.contributor.author Roxanne James
dc.contributor.author Symon M. Kariuki
dc.contributor.author Gabriel Kigen
dc.contributor.author Karestan C. Koenen
dc.contributor.author Edith Kwobah
dc.contributor.author Joseph Kyebuzibwa
dc.contributor.author Lerato Majara
dc.contributor.author Henry Musinguzi
dc.contributor.author Rehema M. Mwema
dc.contributor.author Benjamin M. Neale
dc.contributor.author Carter P. Newman
dc.contributor.author Charles R. Newton
dc.contributor.author Joseph K. Pickrell
dc.contributor.author Raj Ramesar
dc.contributor.author Welelta Shiferaw
dc.contributor.author Dan J. Stein
dc.contributor.author Solomon Teferra
dc.contributor.author Celia van der Merwe
dc.contributor.author Zukiswa Zingela
dc.date.accessioned 2021-01-11T13:51:53Z
dc.date.available 2021-01-11T13:51:53Z
dc.date.issued 2020
dc.identifier.uri https://combine.alvar.ug/handle/1/49677
dc.description.abstract Background: Genetic studies of biomedical phenotypes in underrepresented populations identify disproportionate numbers of novel associations. However, current genomics infrastructure--including most genotyping arrays and sequenced reference panels--best serves populations of European descent. A critical step for facilitating genetic studies in underrepresented populations is to ensure that genetic technologies accurately capture variation in all populations. Here, we quantify the accuracy of low-coverage sequencing in diverse African populations. Results: We sequenced the whole genomes of 91 individuals to high-coverage (>20X) from the Neuropsychiatric Genetics of African Population-Psychosis (NeuroGAP-Psychosis) study, in which participants were recruited from Ethiopia, Kenya, South Africa, and Uganda. We empirically tested two data generation strategies, GWAS arrays versus low-coverage sequencing, by calculating the concordance of imputed variants from these technologies with those from deep whole genome sequencing data. We show that low-coverage sequencing at a depth of ≥4X captures variants of all frequencies more accurately than all commonly used GWAS arrays investigated and at a comparable cost. Lower depths of sequencing (0.5-1X) performed comparable to commonly used low-density GWAS arrays. Low-coverage sequencing is also sensitive to novel variation, with 4X sequencing detecting 45% of singletons and 95% of common variants identified in high-coverage African whole genomes. Conclusion: These results indicate that low-coverage sequencing approaches surmount the problems induced by the ascertainment of common genotyping arrays, including those that capture variation most common in Europeans and Africans. Low-coverage sequencing effectively identifies novel variation (particularly in underrepresented populations), and presents opportunities to enhance variant discovery at a similar cost to traditional approaches.
dc.publisher Cold Spring Harbor Laboratory
dc.title Low-coverage sequencing cost-effectively detects known and novel variation in underrepresented populations
dc.type Preprint
dc.identifier.doi 10.1101/2020.04.27.064832
dc.identifier.mag 3021321134
dc.identifier.lens 096-823-652-642-595
dc.subject.lens-fields Genome-wide association study
dc.subject.lens-fields Genome
dc.subject.lens-fields Concordance
dc.subject.lens-fields Genomics
dc.subject.lens-fields Whole genome sequencing
dc.subject.lens-fields European descent
dc.subject.lens-fields Genotyping
dc.subject.lens-fields Computational biology
dc.subject.lens-fields Biology


This record appears in the collections of the following institution(s)

Show simple record

Search Entire Database


Browse

My Account