Unknown

Dataset Information

0

Ensembl core software resources: storage and programmatic access for DNA sequence and genome annotation.


ABSTRACT: The Ensembl software resources are a stable infrastructure to store, access and manipulate genome assemblies and their functional annotations. The Ensembl 'Core' database and Application Programming Interface (API) was our first major piece of software infrastructure and remains at the centre of all of our genome resources. Since its initial design more than fifteen years ago, the number of publicly available genomic, transcriptomic and proteomic datasets has grown enormously, accelerated by continuous advances in DNA-sequencing technology. Initially intended to provide annotation for the reference human genome, we have extended our framework to support the genomes of all species as well as richer assembly models. Cross-referenced links to other informatics resources facilitate searching our database with a variety of popular identifiers such as UniProt and RefSeq. Our comprehensive and robust framework storing a large diversity of genome annotations in one location serves as a platform for other groups to generate and maintain their own tailored annotation. We welcome reuse and contributions: our databases and APIs are publicly available, all of our source code is released with a permissive Apache v2.0 licence at http://github.com/Ensembl and we have an active developer mailing list ( http://www.ensembl.org/info/about/contact/index.html ). Database URL:http://www.ensembl.org.

SUBMITTER: Ruffier M 

PROVIDER: S-EPMC5467575 | biostudies-literature | 2017 Jan

REPOSITORIES: biostudies-literature

altmetric image

Publications

Ensembl core software resources: storage and programmatic access for DNA sequence and genome annotation.

Ruffier Magali M   Kähäri Andreas A   Komorowska Monika M   Keenan Stephen S   Laird Matthew M   Longden Ian I   Proctor Glenn G   Searle Steve S   Staines Daniel D   Taylor Kieron K   Vullo Alessandro A   Yates Andrew A   Zerbino Daniel D   Flicek Paul P  

Database : the journal of biological databases and curation 20170101 1


The Ensembl software resources are a stable infrastructure to store, access and manipulate genome assemblies and their functional annotations. The Ensembl 'Core' database and Application Programming Interface (API) was our first major piece of software infrastructure and remains at the centre of all of our genome resources. Since its initial design more than fifteen years ago, the number of publicly available genomic, transcriptomic and proteomic datasets has grown enormously, accelerated by con  ...[more]

Similar Datasets

| S-EPMC6736197 | biostudies-literature
| S-EPMC6310513 | biostudies-literature
| S-EPMC4756621 | biostudies-literature
| S-EPMC2894800 | biostudies-literature
| S-EPMC4301745 | biostudies-literature
| S-EPMC8115729 | biostudies-literature
| S-EPMC4919035 | biostudies-literature
| S-EPMC3049762 | biostudies-literature
| S-EPMC6283364 | biostudies-literature