combine@alvar.ug

Phylogenomics from Whole Genome Sequences Using aTRAM

Phylogenomics from Whole Genome Sequences Using aTRAM

Show simple record

dc.contributor.author Allen, Julie M.
dc.contributor.author Boyd, Bret
dc.contributor.author Nam-Phuong Nguyen
dc.contributor.author Vachaspati, Pranjal
dc.contributor.author Warnow, Tandy
dc.contributor.author Huang, Daisie I.
dc.contributor.author Grady, Patrick G. S.
dc.contributor.author Bell, Kayce C.
dc.contributor.author Cronk, Quentin C. B.
dc.contributor.author Mugisha, Lawrence
dc.contributor.author Pittendrigh, Barry R.
dc.contributor.author Soledad Leonardi, M.
dc.contributor.author Reed, David L.
dc.contributor.author Johnson, Kevin P.
dc.date.accessioned 2021-01-01T21:58:00Z
dc.date.available 2021-01-01T21:58:00Z
dc.date.issued 2017
dc.identifier.issn 1063-5157
dc.identifier.uri http://combine.alvar.ug/handle/1/48115
dc.description.abstract Novel sequencing technologies are rapidly expanding the size of data sets that can be applied to phylogenetic studies. Currently the most commonly used phylogenomic approaches involve some form of genome reduction. While these approaches make assembling phylogenomic data sets more economical for organisms with large genomes, they reduce the genomic coverage and thereby the long-term utility of the data. Currently, for organisms with moderate to small genomes (< 1000 Mbp) it is feasible to sequence the entire genome at modest coverage (10-30x). Computational challenges for handling these large data sets can be alleviated by assembling targeted reads, rather than assembling the entire genome, to produce a phylogenomic data matrix. Here we demonstrate the use of automated Target Restricted Assembly Method (aTRAM) to assemble 1107 single-copy ortholog genes from whole genome sequencing of sucking lice ( Anoplura) and out-groups. We developed a pipeline to extract exon sequences from the aTRAM assemblies by annotating them with respect to the original target protein. We aligned these protein sequences with the inferred amino acids and then performed phylogenetic analyses on both the concatenated matrix of genes and on each gene separately in a coalescent analysis. Finally, we tested the limits of successful assembly in aTRAM by assembling 100 genes from close-to distantly related taxa at high to low levels of coverage. Both the concatenated analysis and the coalescent-based analysis produced the same tree topology, which was consistent with previously published results and resolved weakly supported nodes. These results demonstrate that this approach is successful at developing phylogenomic data sets from raw genome sequencing reads. Further, we found that with coverages above 5-10x, aTRAM was successful at assembling 80-90% of the contigs for both close and distantly related taxa. As sequencing costs continue to decline, we expect full genome sequencing will become more feasible for a wider array of organisms, and aTRAM will enable mining of these genomic data sets for an extensive variety of applications, including phylogenomics.
dc.description.sponsorship National Science Foundation (NSF)National Science Foundation (NSF) [DEB-0612938, DEB-1050706, DEB-1239788, DEB-1342604]
dc.description.sponsorship NSF XSEDENational Science Foundation (NSF) [DEB-160002, DBI-1461364, ABI-1458652]
dc.description.sponsorship NSFNational Science Foundation (NSF) [DEB1310824, OCI-0725070, ACI-1238993]
dc.description.sponsorship state of Illinois
dc.language English
dc.publisher OXFORD UNIV PRESS
dc.relation.ispartof Systematic Biology
dc.subject Atram
dc.subject Gene Assembly
dc.subject Genome Sequencing
dc.subject Phylogenomics
dc.title Phylogenomics from Whole Genome Sequences Using aTRAM
dc.type Article
dc.identifier.isi 000408335700007
dc.identifier.doi 10.1093/sysbio/syw105
dc.identifier.pmid 28123117
dc.publisher.city OXFORD
dc.publisher.address GREAT CLARENDON ST, OXFORD OX2 6DP, ENGLAND
dc.identifier.eissn 1076-836X
dc.identifier.volume 66
dc.identifier.issue 5
dc.identifier.spage 786
dc.identifier.epage 798
dc.subject.wc Evolutionary Biology
dc.subject.sc Evolutionary Biology
dc.description.oa Bronze
dc.description.pages 13
dc.subject.kwp Ultraconserved Elements
dc.subject.kwp Read Alignment
dc.subject.kwp Tree
dc.subject.kwp Endosymbiont
dc.subject.kwp Enrichment
dc.subject.kwp Thousands
dc.subject.kwp Nuclear
dc.subject.kwp Resolve
dc.subject.kwp Taxa
dc.subject.kwp Aves
dc.description.affiliation Univ Illinois, Illinois Nat Hist Survey, Urbana, IL 61801 USA
dc.description.affiliation Univ Florida, Florida Museum Nat Hist, Gainesville, FL 32611 USA
dc.description.affiliation Univ Illinois, Carl R Woese Inst Genom Biol, Urbana, IL 61801 USA
dc.description.affiliation Univ Illinois, Dept Comp Sci, Urbana, IL 61801 USA
dc.description.affiliation Univ British Columbia, Biodivers Res Ctr, Vancouver, BC V6T 1Z4, Canada
dc.description.affiliation Univ New Mexico, Dept Biol, Albuquerque, NM 87131 USA
dc.description.affiliation Univ New Mexico, Museum Southwestern Biol, Albuquerque, NM 87131 USA
dc.description.affiliation CEHA, Kampala, Uganda
dc.description.affiliation Makerere Univ, Anim Resources & Biosecur COVAB, Coll Vet Med, Kampala, Uganda
dc.description.affiliation Michigan State Univ, Dept Entomol, E Lansing, MI 48823 USA
dc.description.affiliation Ctr Nacl Patagen, Inst Biol Organismos Marinos, Puerto Madryn, Argentina
dc.description.affiliation Univ Illinois, Dept Bioengn, Urbana, IL 61801 USA
dc.description.email juliema@illinois.edu
dc.description.corr Allen, JM (corresponding author), Univ Illinois, Illinois Nat Hist Survey, Urbana, IL 61801 USA.
dc.description.orcid Mugisha, Lawrence/0000-0003-4932-3356
dc.description.orcid Cronk, Quentin/0000-0002-4027-7368
dc.description.orcid Nguyen, Nam-phuong/0000-0002-3932-063X
dc.description.orcid Leonardi, Maria Soledad/0000-0002-1736-7031


This record appears in the collections of the following institution(s)

Show simple record

Search Entire Database


Browse

My Account