Tuesday, July 28 • 12:15 - 12:20
Comparative analysis of pangenome structure in generalist and specialist bacterial species

Pangenome is a total set of protein-coding genes present in the collection of genomes belonging to one bacterial taxon. The pangenome structure is commonly divided into three parts: “core” - genes present in all given genomes, “unique genome” - strain-specific genes, and “periphery”, or “accessory genome”, - genes present in more than two, but not all given genomes. Various bacterial species differ by their core-to-periphery ratio. However, the source of this difference is still unclear.

There is an implication that the bacterial species habitat ubiquity could be a crucial factor defining the pangenome structure. The data on bacterial presence in various environments could be obtained from metagenomic experiments. We are using metagenomic data from the Earth Microbiome Project (EMP) [1] which is to date the biggest collection of metagenomic samples from a wide range of environments and geographic regions prepared according to a unified pipeline.

It is currently debated which part of pangenome structure (core or accessory genome) is responsible for bacterial adaptation to a broad spectrum of ecological niches. We hypothesized that members of a bacterial species that is present in many habitats (generalist species) need to have specific accessory genes in order to adapt to various conditions in different environments. On the contrary, there are specialist species restricted by few habitats and therefore, probably, more genetically homogeneous with a core genome prevalence in their pangenomes.

This work aims to determine if bacterial pangenome structure is associated with the number of habitats where a bacterial species is present. A particular attention is devoted to the development of environment classification based on the EMP samples similarity.

1. Thompson L. R. et al. (2017) A communal catalogue reveals Earth’s multiscale microbial diversity, Nature. – Т. 551. – №. 7681.

avatar for Daria Nikolaeva

Daria Nikolaeva

Kharkevich Institute for Information Transmission Problems (IITP RAS)

