Synteny identifies reliable orthologs for phylogenomics and comparative genomics of the brassicaceae

Large genomic data sets are becoming the new normal in phylogenetic research, but the identification of true orthologous genes and the exclusion of problematic paralogs is still challenging when applying commonly used sequencing methods such as target enrichment. Here, we compared conventional ortho...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Walden, Nora (VerfasserIn) , Schranz, M. Eric (VerfasserIn)
Dokumenttyp: Article (Journal)
Sprache:Englisch
Veröffentlicht: 28 February 2023
In: Genome biology and evolution
Year: 2023, Jahrgang: 15, Heft: 3, Pages: 1-14
ISSN:1759-6653
DOI:10.1093/gbe/evad034
Online-Zugang:Verlag, lizenzpflichtig, Volltext: https://doi.org/10.1093/gbe/evad034
Volltext
Verfasserangaben:Nora Walden and Michael Eric Schranz
Beschreibung
Zusammenfassung:Large genomic data sets are becoming the new normal in phylogenetic research, but the identification of true orthologous genes and the exclusion of problematic paralogs is still challenging when applying commonly used sequencing methods such as target enrichment. Here, we compared conventional ortholog detection using OrthoFinder with ortholog detection through genomic synteny in a data set of 11 representative diploid Brassicaceae whole-genome sequences spanning the entire phylogenetic space. Then, we evaluated the resulting gene sets regarding gene number, functional annotation, and gene and species tree resolution. Finally, we used the syntenic gene sets for comparative genomics and ancestral genome analysis. The use of synteny resulted in considerably more orthologs and also allowed us to reliably identify paralogs. Surprisingly, we did not detect notable differences between species trees reconstructed from syntenic orthologs when compared with other gene sets, including the Angiosperms353 set and a Brassicaceae-specific target enrichment gene set. However, the synteny data set comprised a multitude of gene functions, strongly suggesting that this method of marker selection for phylogenomics is suitable for studies that value downstream gene function analysis, gene interaction, and network studies. Finally, we present the first ancestral genome reconstruction for the Core Brassicaceae which predating the Brassicaceae lineage diversification ∼25 million years ago.
Beschreibung:Gesehen am 19.04.2023
Beschreibung:Online Resource
ISSN:1759-6653
DOI:10.1093/gbe/evad034