Hepadnavirus-GLUE: Highlights


These highlights pages aim to provide a brief overview of selected data items contained within the Hepadnavirus-GLUE project, and to illustrate how it can be used to implement reproducible comparative analyses.

eHBVs: Avi.1-Neoaves


eHBV-Avi.1-Neoaves was the first hepadnavirus-derived EVE to be described (Katzourakis and Gifford, 2010; Suh et al, 2013). It is located in an intron of the FRY Microtubule Binding Protein gene and derives from a germline incorporation event involving an ancient avihepadnavirus. This event predates the divergence of the avian clade Neoaves, which includes all modern birds except the Galloanserae (ducks and chickens) and Palaeognathae. Consequently, an Avi.1-Neoaves insertions are present in multiple avian species.

Background images for Avi.28-Alauda

Left panel: the clade 'Neoaves' includes almost all modern birds, including (clockwise) flamingos (order Phoenicopteriformes), parrots (order Psittaciformes), songbirds (order Passeriformes) and all birds of prey. Right panel: Analysis using Hepadnavirus-GLUE has revealed that at least four distinct clades (I-IV) of avihepadnavirus have existed during hepadnavirus evolution.

Links to related data items:


eHBVs: Avi.14-Gavia and Avi.15-Gavia


These two eHBV loci in the genome of the common loon (Gavia immer) derive from germline incorporation events involving ancient avihepadnaviruses. The viruses involved belong to a distinct subgroup of avihepadnaviruses, which we labeled 'clade II'. So far this group of hepadnaviruses is only known from eHBV sequences, and it remains unknown whether any modern representatives still circulate in birds.

Background images for Avi.14-Gavia and Avi.15-Gavia

Left to right: (i) The common loon (Gavia immer) contains two eHBV insertions derived from clade II aviheapdnaviruses. (ii) Emperor penguins also contain clade II avihepadnaviruses, indicating their biogeographic host range could extend to the Antarctic; eHBV insertions derived from clade II were also identified in (iii) Great crested grebe (Podiceps cristatus) and; (iv) barn owl (Tyto alba)

The two insertions are similar in that they both span near-complete hepadnaviral genomes. Remarkably, however, they appear to have been acquired in entirely different germline integration events. In our recent paper we show that - consistent with this - both insertions show a greater degree of divergence in a region of the Surface protein associated with binding to host receptors than they do in other genomic regions.

Links to related data items:


eHBVs: Avi.27-Phalacrocoracidae


Avi.27-Phalacrocoracidae is a multicopy eHBV lineage identified in cormorants (family Phalacrocoracidae). We identified approximately 300 copies of this avihepadnavirus derived element in the cormorant germline. All avi.27 elements are quite short, spanning around 350 nucleotides of the polymerase gene. Analysis of flanking DNA regions suggested that the insertion has been mobilised within the cormorant germline by transposable elements (TEs).

Background images for Avi.28-Alauda

Cormorants have many anatomical and physiological adaptations for diving, including amphibious hearing. Left to right: (i) and (ii) Great cormorant (Phalacrocorax carbo); (iii) Brandt's cormorant (Phalacrocorax penicillatus); (iv) Cormorant fishing is a traditional fishing method in which fishermen use trained cormorants to fish in rivers. Historically, cormorant fishing has taken place in Japan and China, as well as some European countries.

Phylogenies indicate that the initial germline incorporation event that gave rise to this multicopy eHBV lineage predates the diversification of cormorants. How did so many distinct eHBV/TE insertions get fixed in the cormorant germline? Possibly, the capture of EVE sequences offered a selective advantage to TE lineages that allowed them to proliferate so widely that numerous copies were fixed only by chance. Alternatively, TE sequences containing hepadnavirus-derived DNA might, for some reason, be more likely to be fixed.

Numerous lines of evidence indicate that TEs play a role in rewiring gene regulatory networks. This process is thought to underlie many of the evolutionary adaptations seen in vertebrates. Possibly, fixation of eHBV-avi.28 copies in cormorants genomes reflects a role for these eHBVs - or the TEs they are embedded within - in facilitating cormorant evolution and adaptation.

Links to related data items:


eHBVs: Avi.28-Alauda


We identified a full-length eHBV insertion in the genome of the Eurasian skylark (Alauda arvensis). This sequence is notable for its close relationship to avihepadnavirus species that currently circulate in birds. All extant avihepadnaviruses fall into one subgroup (which we labeled 'IV' in our study). Avi.28.Alauda groups as a basal member of this clade.

Background images for Avi.28-Alauda

Eurasian skylark (Alauda arvensis). Skylarks sing beautiful, complicated songs over grasslands, rolling downs, coastal saltmarshes, and wide arable fields. Changes in farming have led to skylark populations crashing, and they are now a Red List species of conservation concern.

Links to related data items:

eHBVs: Meta.1-Sauria


Meta.1.Sauria is - we think - the oldest EVE ever described. We identified putatively orthologous copies in a lizard - the tuatara (Sphenodon) - as well as in multiple avian species. This implies that germline integration occurred prior to the divergence of these lineages in the late Paleozoic Era (~250-300 million years ago).

This insertion also provides important new insights into the evolution of hepadnaviruses. It belongs to the recently described Metahepadnavirus genus. So far, metahepadnaviruses have only been identifiied in fish. Unexpectedly, these fish viruses were found to group relatively closely with mammalian hepadnaviruses (genus Orthohepadnavirus). This seemed to imply a transfer of viruses between fish and mammals. However, eHBV sequences reveal that metahepadnaviruses have an ancient origin and - historically at least - a broad distribution.

Species containing Meta.1-Sauria insertions in their genomes

Metahepadnaviruses are a recently discovered group of hepadnaviruses left to right: (i) White sucker ( ) - one of the fish species in which metahepadnaviruses were first identfied; (i) eHBVs derived from metahapadnaviruses were identified in the tuatara (Sphenodon punctatus) - a highly divergent reptile; (iii) Tuataras are the only surviving representatives of the ancient reptile lineage that contained the marine ichthyosaurs and pleisiosaurs; (iv) Meta.1-Sauria insertions have also been preserved in some avian genomes. Emperor penguins contain a relatively well-preserved ortholog.

Links to related data items:


Related Publications


Lytras S, Arriagada G, and RJ Gifford (2020)
Ancient evolution of hepadnaviral paleoviruses and their impact on host genomes.
Virus Evolution [view]

Larsen ON, Wahlberg M, and J. Christensen-Dalsgaard (2020)
Amphibious hearing in a diving bird, the great cormorant (Phalacrocorax carbo sinensis).
J Exp Biol. [view]

Prum RO, Berv JS, Dornburg A, Field DJ, Townsend JP, Lemmon EM, Lemmon AR. (2015)
A comprehensive phylogeny of birds (Aves) using targeted next-generation DNA sequencing.
Nature [view]

Suh A, Brosius J, Schmitz J, and JO Kriegs. (2013)
The genome of a Mesozoic paleovirus reveals the evolution of hepatitis B viruses.
Nat Commun. [view]

Katzourakis A. and RJ. Gifford (2010)
Endogenous viral elements in animal genomes.
PLoS Genetics [view]