Integrating reference taxonomic databases for metabarcoding and metagenomics identification

Comparison of environmental sequences to reference sets from curated marker loci provides a mainstay for taxonomic analysis of microbial communities. Microbial eukaryotic sequencing requires many distinct reference sets to cover diversity adequately. Those producing reference sets follow different curation workflows, but share the need to provide their data onwards to a common set of tools and services, such as EMG, Megan, MetaPIPE and BioMaS. There are multiple inefficiencies: reference set providers must build services to sustain and feed their data to consumer tools and services; consumers must import reference sets from several sources with different formats. Led by the ITSoneDB team, who provide the leading fungi and other eukaryotes ITS1 reference set, we will develop a new data type within ENA that will capture systematically these reference sets and serve them to dependent resources, eliminating inefficiencies, leveraging this core ELIXIR resource and building sustainability into reference set generation workflows.


1 June 2018 to 31 May 2019

Nodes involved: 

Platform/Use Case: