Bridging the micro- and macroevolutionary levels in phylogenomics: Hyb-Seq solves relationships from populations to species and above
Tamara Villaverde, Elliot M. Gardner, Lisa Pokorny, Sanna Olsson, Mario Rincon-Barrado, Norman J. Wickett, Julia Molero, Ricarda Riina, Matthew G. Johnson and Isabel Sanmartın
https://nph.onlinelibrary.wiley.com/doi/10.1111/nph.15312
Richard Bačák
Introduction
Microevolution
Macroevolution
High-throughput sequencing (HTS)
Aims
Fig. 1 Morphology and geographic distribution of Euphorbia balsamifera. (a) Euphorbia balsamifera ssp. balsamifera from the coast of Western Sahara. (b) Cyathium of ssp. balsamifera from Tenerife (Canary Islands). (c) Euphorbia balsamifera ssp. sepium from inland populations in Western Sahara near the border with Mauritania. (d) Fruit and long narrow leaves typical of ssp. sepium. (e) Specimen of E. balsamifera ssp. adenensis from Oman. (f) Cyathium and apical leaves of ssp. adenensis. (g, h) Geographic distribution of Euphorbia balsamifera in the Canary Islands, North Africa, and Arabian Peninsula based on herbarium specimens: ssp. balsamifera (circles), ssp. sepium (triangles), and ssp. adenensis (diamonds); (h) Detailed distribution of ssp. balsamifera in the Canary Islands. Fuchsia, green and orange symbols represent sampled DNA specimens of each taxon included in the phylogeny shown in Fig. 3(a), whereas grey symbols (all shapes) represent herbarium records without genomic data. Photos by: (a–d) R. Riina; (e, f) J. J. Morawetz.
Methods
Sampling
Probe design and sequence capture
Methods
Data processing and phylogenetic analysis of targeted nuclear loci
Methods
Data processing and phylogenetic analysis of chloroplast (skimmed) data
Divergence time estimation
Results - Gene-capture success
Results - Phylogenetic estimation of gene tree-species tree from nuclear data
a, Maximum likelihood tree (296 concatenated exon loci,121 samples 486,878 bp) estimated in IQ-TREE
b, Species tree inferred with a multispecies coalescent (MSC) approach in ASTRAL-IIusing the 296-exon supermatrix ;SVDQUARTETS obtained the same tree exceptforthe position of E.noxia
c, Maximum likelihood tree obtained from 66 exon loci (63133 bp) from the chloroplast genome
Results - Phylogenetic estimation of gene tree-species tree and Divergence time estimation
Maximum clade credibility (MCC) tree showing Bayesian estimates of divergence times
Analysis of plastid data
Results - Divergence time estimation
Discussion
Hyb-Seq as a tool to bridge the micro- and macro evolutionary gap phylogenomics
Solving relationships within an ancient continental disjunct lineage
The sweet tabaiba or there and back again: phylogeographical history of the Macaronesian Euphorbia balsamifera
Mario Rincón-Barrado1,2,* , Tamara Villaverde3, , Manolo F. Perez4 , Isabel Sanmartín1,†,*, and Ricarda Riina1,†,*
2024
Bego
https://academic.oup.com/aob/article/133/5-6/883/7514051
Introduction
E. balsamifera, E. adenensis & E. sepium
Populations of E. balsamifera sampled
Rand Flora pattern
Introduction
Aims
(1) Test the persistence of relict populations of E. balsamifera in the continent (north-west Africa) vs. climatic extinction in north-west Africa and a posterior back-colonization to the continent hypothesis, by building a more robust population-level phylogeny of E. balsamifera and allies
(2) Discriminate among alternative colonization and diversification scenarios of E. balsamifera in the Canary Islands (the CIH and SSH models and the possibility of back-colonization), using CNNs and a large SNP dataset extracted from the target loci.
(3) Assessing the impact of degraded DNA in the Hyb-Seq technique and how this can affect the reconstruction of relationships at both micro- and macro-evolutionary levels.
Methods
Sampling
32 samples of E. balsamifera collected in silica gel from seven localities in the islands of Fuerteventura, Lanzarote, Tenerife, La Gomera, La Palma and El Hierro, and 15 samples collected from four localities along the coasts of Morocco and Western Sahara
Data processing
HybPiper 1 pipeline → final dataset of 298 gene matrices with exon information and a concatenated total length of 511 697 bp
298 matrices were analysed using two approaches:
1) Trees analysed with ASTRAL to estimate a species-level phylogenetic tree under the multispecies coalescent model
2) Phylogenetic tree was constructed using IQ-TREE
Convolutional neural network analysis
To test alternative colonization models of the Canary Islands → They considered each island as a single population
Results
Discusion
Back-colonization of Africa rather than relicts of mainland populations
(1) By increasing taxon sampling (adding field-collected samples from the continent and from the eastern islands), they recovered a position that was partly congruent with the IQ-TREE analysis of Villaverde et al. (2018): the continental samples from north-west Africa were placed within a clade that included individuals from Lanzarote and Fuerteventura. Moreover, this western–eastern topology was supported by all our analyses [IQ-TREE (Fig. 3), ASTRAL (Fig. 4), BEAST (Fig. 6) and SVDQuartets (Fig. 7)], with high support.
Therefore, their results reject the hypothesis that the north-west African populations of E. balsamifera are climatic relicts
Discussion
Macro- and micro-evolution in ‘sweet tabaiba’
(2) Phylogenetic relationships at the species level (Figs 3 and 4) were identical to those recovered by Villaverde et al. (2018).
The phylogenetic trees generated from the Hyb-Seq DNA sequences show a clear east/west division among the Canarian populations of E. balsamifera → This phylogeographical pattern fits better the CIH model, which predicts dispersal from the island of Tenerife towards the east and west of the archipelago
Discussion
Effect of the quality of herbarium material on Hyb-Seq efficiency at different evolutionary levels
(3) DNA sequences generated with Hyb-Seq from old herbarium samples might appear closely related and converge in the phylogenetic tree, not because of sequence similarity but because of the type and amount of damage sustained.
Artefacts, caused by low-quality DNA, are not always present and depend on the evolutionary level being tackled. For example, the herbarium samples of E. balsamifera, E. sepium and E. adenensis exhibited lower percentages of capture success. Despite that, all herbarium samples were correctly positioned within their respective species clades, and with high clade support in both studies.
In Villaverde et al. (2018), using only herbarium samples from the continental populations, obtained contradictory results between the multispecies coalescent analyses (ASTRAL and SVDQuartets) and the IQ-TREE concatenated approach. Here, they obtained same results
Conclusions
The study supports the efficiency of the Hyb-Seq technique to provide phylogenetic resolution at the micro- and macroevolutionary levels using freshly collected material and herbarium material, although the latter should be used with caution.
Also, their results support the Rand Flora hypothesis for the disjunct distribution of the Canarian E. balsamifera relative to its closest species (E. adenensis) on the opposite side of the Sahara Desert.
Phylogenetic relationships and the identification of
allopolyploidy in circumpolar Silene sect. Physolychnis
Anne‐Sophie Quatela, Patrik Cangren, Paola de Lima Ferreira, Yannick Woudstra, Andreas Zsoldos‐Skahjem, Christine D. Bacon, Hugo J. de Boer, Bengt Oxelman, 2025
https://doi.org/10.1002/ajb2.70051
Eliška Krtilová
Introduction
Aims of the study
Materials and methods
Sampling
Laboratory procedure
Quality control
Materials and methods
Bioinformatics
Materials and methods
Genetic distances between phased haplotypes
Phylogenomic analysis
Results
Results
Conclusion
Limitations
From museum drawer to tree: Historical DNA
phylogenomics clarifies the systematics of
rare dung beetles (Coleoptera: Scarabaeinae)
from museum collections
Fernando Lopes, Nicole Gunter, Conrad P. D. T. Gillett, Giulio Montanaro, Michele Rossini, Federica Losacco, Gimo M. Daniel, Nicolas Straube, Sergei Tarasov
Karolina Mahlerová
https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0309596
Introduction
Nesosisyphus rotundatus (1944): Holotype, potentially extinct.
Onychothecus (1985): Rare genus with no prior molecular data.
Helictopleurus spp. (2003–2010): Endemic Madagascar taxa.
Recover full phylogeny of this group + compare extraction methods
Broad coverage of biogeographical regions and evolutionary lineages.
Combines historical, archival, and fresh/alcohol-preserved specimens.
Materials and methods
Materials and methods
Data analysis pipeline
Demultiplexed, read clean
quality control
Assembly of reads into contigs
Extract UCE loci from contigs (Phyluce + Scarab 3Kv1 probe set)
extracting loci from GenBank
MAFFT alignment
Trimming of alignment
Symmetry test to remove gene trees that would violate phylogenetic assumption p ≤ 0.05
Dataset construction and defining completeness threshold (Partitioned and unpartitioned variants run for comparison)
ML species tree
Results
(A) Distribution of read length generated per sample, demonstrating that the density of shorter reads was generally higher from archival extractions
(B) Distribution of the number of UCE loci per sample, demonstrating that the greatest density of samples that generated large numbers of captured loci resulted from archival extractions
Shorter reads for dry museum specimens → Museum specimens, when processed correctly, yielded the highest density of samples with large numbers of recovered loci
Results
Zuzana Chumová, Eliška Záveská, Petra Hloušková, Jan Ponert, Philipp-André Schmidt, Martin Čertner, Terezie Mandáková, Pavel Trávníček
Přemysl Baláž
Motivation
Partial endopolyploidy
Custom probe design and data analysis
Results
Results
Dracula
Pleurothallis
Restrepia
Discussion topics
How deep the rabbit hole goes
Diploidizied polyploids?
70-chromosomes group vs rest of species with 36-48 chromosomes
Dimple
INTRODUCTION
Bovine leukemia virus (BLV) is an oncogenic retrovirus which induces malignant lymphoma termed enzootic bovine leukosis (EBL) after a long incubation period.
EBL is generally characterized by local and systemic tumor development following a long incubation period of several years after BLV infection
Only 5% of BLV infected cattle are thought to develop EBL
The BLV genome is reverse transcribed to double-stranded DNA and integrated into the host genome as a provirus.
genes coding regulatory protein
major genes coding structural proteins and enzymes
Oncogenic
OBJECTIVES
Aims of the Paper:
Materials and methods
1)DNA probes- designed 145 probes to capture BLV genome and Host TNF alpha promoter region
2)Sample collection, genotype analysis(genotype 1, genotype 3)(Categorization- AS, PL).
3)Library prep(NEBNext Ultra II DNA Library Prep Kit and NEBNext Multiplex Oligos for Illumina )and targeted enrichment (biotin label dna probe and DNA libraries enriched for proviral sequences).
4)High throughput sequencing data analysis.
5)IS and proviral structure analysis with DNA‑seq data.
6)PCR amplification of viral‑host junctions.
7)Estimating B cell clone abundance to understand progression of disease like leukemia.
8)Estimates of evolutionary divergence between BLV genotypes( Av. no. of base diff per site between sequences)
9)Detection of positive selection sites to see evolution favour virus genome focused on genotype 3.
Results
Figure 1. Application of DNA-capture-seq to analyze BLV proviral sequences and ISs. (A)
Schematic diagram of the application of the target enrichment. (B) Visualization of sequence reads mapped to the FLK-BLV sequence (EF600696). NGS reads mapped to BLV are shown below the reference sequence. (C) Maximumlikelihood phylogenetic tree analysis of BLV whole genome sequences were generated with five newly obtained sequences (a sequence from FLK-BLV cell line indicated by filled square and four sequences from EBL tumor samples indicated by filled circle) together with 53 sequences from the GenBank database. The phylogenetic tree was generated and visualized using MEGA 7 with 1000 bootstrap replicates. The bar at the bottom of the figure
denotes the estimated number of amino acid substitutions per site, indicating genetic variation for the length of the scale.
2) BLV ISs in FLK‑BLV cell line
low OCI values suggest abundant representation of multiple clones
polyclonal populations
Monoclonal populations
3) Application of DNA‑capture‑seq to the study of asymptomatic (AS) and persistent lymphocytosis (PL) cattle &
Integration Site (IS) Analysis
4) Application of DNA-Capture-Seq to BLV-Infected and EBL Cattle
Phylogenetic studies classify BLV into 11 genotypes globally.�Genotype 1 is predominant; genotype 3 is rare, with only one full-length genotype 3 sequence reported previously (LC164084).
5) SNP detection of BLV infected cattle-
Conclusion:
Targeted Enrichment of Large Gene Families for Phylogenetic Inference: Phylogeny and Molecular Evolution of Photosynthesis Genes in the Portullugo Clade (Caryophyllales)
ABIGAIL J. MOORE, JURRIAAN M. DE VOS, LILLIAN P. HANCOCK, ERIC GOOLSBY, AND ERIKA J. EDWARDS
Martina Omelková
Background and Study Goals
Background and Study Goals
Methods
Data Processing
➤ Part II then constructed longer sequences from contigs and assigned them to particular paralogs within a gene family.
➤ Part III identified gene duplications within gene families,�extracted phylogenetically useful sets of orthologs, �and used them for phylogenetic analysis.
Results – Phylogenetic Relationships in Portullugo
Robust backbone phylogeny
Gene tree discordance
Role of paralogs
Results – Gene Family Evolution and Photosynthetic Pathways
Summary- Photosynthetic Pathways
Broader implications