ELIXIR Core Data Resources are a set of European data resources of fundamental importance to the wider life-science community and the long-term preservation of biological data.
Identification of the ELIXIR Core Data Resources involves a careful evaluation of the multiple facets of the data resources. Indicators used in the evaluation are grouped into five categories:
- Scientific focus and quality of science
- Community served by the resource
- Quality of service
- Legal and funding infrastructure, and governance
- Impact and translational stories
The details of the selection criteria are described in the F1000R ELIXIR track article 'Identifying ELIXIR Core Data Resources'.
The initial Core Data Resource list was defined in July of 2017. The list will be reviewed regularly - further rounds of selection are planned, going forward.
ELIXIR Core Data Resource list
|Core Data Resource||Data type|
|ArrayExpress||Functional Genomics Data from high-throughput functional genomics experiments.|
|ChEBI||Dictionary of molecular entities focused on ‘small’ chemical compounds.|
|ChEMBL||Database of bioactive drug-like small molecules, it contains 2-D structures, calculated properties and abstracted bioactivities.|
|EGA||Personally identifiable genetic and phenotypic data resulting from biomedical research projects.|
|ENA||Nucleotide sequencing information, covering raw sequencing data, sequence assembly information and functional annotation.|
|Ensembl||Genome browser for vertebrate genomes that supports research in comparative genomics, evolution, sequence variation and transcriptional regulation.|
|Ensembl Genomes||Comparative analysis, data mining and visualisation for the genomes of non-vertebrate species.|
|Europe PMC||Europe PMC is a repository, providing access to worldwide life sciences articles, books, patents and clinical guidelines.|
|Human Protein Atlas||The Human Protein Atlas contains information for a large majority of all human protein-coding genes regarding the expression and localization of the corresponding proteins based on both RNA and protein data.|
|The IMEx Consortium: represented by IntAct and MINT||IntAct provides a freely available, open source database system and analysis tools for molecular interaction data. MINT focuses on experimentally verified protein-protein interactions mined from the scientific literature by expert curators.|
|InterPro||Functional analysis of protein sequences by classifying them into families and predicting the presence of domains and important sites.
Note: This is an umbrella resource to which many collaborating databases contribute. In naming InterPro as a Core Data Resource, the critical role of the constituent databases is recognised.
|PDBe||Biological macromolecular structures.|
|PRIDE||Mass spectrometry-based proteomics data, including peptide and protein expression information (identifications and quantification values) and the supporting mass spectra evidence.|
|STRING-db||Known and predicted protein-protein interactions.|
|UniProt||Comprehensive resource for protein sequence and annotation data.|
In addition to the Core Data Resources, ELIXIR has compiled a list of recommended repositories for experimental data, the ELIXIR Deposition Databases.
Further information: Rachel Drysdale (rachel.drysdale[at]elixir-europe[dot]org)
1 Should a Core Data Resource lose its funding and the only option to remain viable requires a reversion to a non-open license, the license status and exception (e.g. requirement for industry co-funding) may be revisited by the ELIXIR Heads of Nodes Committee. Back to text.