evolutionary genetics

Six reference-quality genomes reveal evolution of bat adaptations
Bats possess extraordinary adaptations, including flight, echolocation, extreme longevity and unique immunity. High-quality genomes are crucial for understanding the molecular basis and evolution of these traits. Here we incorporated long-read sequencing and state-of-the-art scaffolding protocols1 to generate, to our knowledge, the first reference-quality genomes of six bat species (Rhinolophus ferrumequinum, Rousettus aegyptiacus, Phyllostomus discolor, Myotis myotis, Pipistrellus kuhlii and Molossus molossus). We integrated gene projections from our ‘Tool to infer Orthologs from Genome Alignments’ (TOGA) software with de novo and homology gene predictions as well as short- and long-read transcriptomics to generate highly complete gene annotations. To resolve the phylogenetic position of bats within Laurasiatheria, we applied several phylogenetic methods to comprehensive sets of orthologous protein-coding and noncoding regions of the genome, and identified a basal origin for bats within Scrotifera. Our genome-wide screens revealed positive selection on hearing-related genes in the ancestral branch of bats, which is indicative of laryngeal echolocation being an ancestral trait in this clade. We found selection and loss of immunity-related genes (including pro-inflammatory NF-κB regulators) and expansions of anti-viral APOBEC3 genes, which highlights molecular mechanisms that may contribute to the exceptional immunity of bats. Genomic integrations of diverse viruses provide a genomic record of historical tolerance to viral infection in bats. Finally, we found and experimentally validated bat-specific variation in microRNAs, which may regulate bat-specific gene-expression programs. Our reference-quality bat genomes provide the resources required to uncover and validate the genomic basis of adaptations of bats, and stimulate new avenues of research that are directly relevant to human health and disease1.
Species tree disequilibrium positively misleads models of gene family evolution
Gene duplication is a key source of evolutionary innovation, and multigene families evolve in a birth-death process, continuously duplicating and pseudogenizing through time. To empirically test hypotheses about adaptive expansion and contraction of multigene families across species, models infer gene gain and loss in light of speciation events and these inferred gene family expansions may lead to interpretations of adaptations in particular lineages. While the relative abundance of a gene subfamily in the subgenome may reflect its functional importance, tests based on this expectation can be confounded by the complex relationship between the birth-death process of gene subfamily evolution and the species phylogeny. Using simulations, we confirmed tree heterogeneity as a confounding factor in inferring multi-gene adaptation, causing spurious associations between shifts in birth-death rate and lineages with higher branching rates. We then used the olfactory receptor (OR) repertoire, the largest gene family in the mammalian genome, of different bat species with divergent diets to test whether expansions in olfactory receptors are associated with shifts to frugivorous diets. After accounting for tree heterogeneity, we robustly inferred that certain OR subfamilies exhibited expansions associated with dietary shifts to frugivory. Taken together, these results suggest ecological correlates of individual OR gene subfamilies can be identified, setting the stage for detailed inquiry into within-subfamily functional differences.