Unknown

Dataset Information

0

An Improved Genome Assembly of Azadirachta indica A. Juss.


ABSTRACT: Neem (Azadirachta indica A. Juss.), an evergreen tree of the Meliaceae family, is known for its medicinal, cosmetic, pesticidal and insecticidal properties. We had previously sequenced and published the draft genome of a neem plant, using mainly short read sequencing data. In this report, we present an improved genome assembly generated using additional short reads from Illumina and long reads from Pacific Biosciences SMRT sequencer. We assembled short reads and error-corrected long reads using Platanus, an assembler designed to perform well for heterozygous genomes. The updated genome assembly (v2.0) yielded 3- and 3.5-fold increase in N50 and N75, respectively; 2.6-fold decrease in the total number of scaffolds; 1.25-fold increase in the number of valid transcriptome alignments; 13.4-fold less misassembly and 1.85-fold increase in the percentage repeat, over the earlier assembly (v1.0). The current assembly also maps better to the genes known to be involved in the terpenoid biosynthesis pathway. Together, the data represent an improved assembly of the A. indica genome.

SUBMITTER: Krishnan NM 

PROVIDER: S-EPMC4938638 | biostudies-literature | 2016 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

An Improved Genome Assembly of Azadirachta indica A. Juss.

Krishnan Neeraja M NM   Jain Prachi P   Gupta Saurabh S   Hariharan Arun K AK   Panda Binay B  

G3 (Bethesda, Md.) 20160707 7


Neem (Azadirachta indica A. Juss.), an evergreen tree of the Meliaceae family, is known for its medicinal, cosmetic, pesticidal and insecticidal properties. We had previously sequenced and published the draft genome of a neem plant, using mainly short read sequencing data. In this report, we present an improved genome assembly generated using additional short reads from Illumina and long reads from Pacific Biosciences SMRT sequencer. We assembled short reads and error-corrected long reads using  ...[more]

Similar Datasets

| S-EPMC4433627 | biostudies-literature
| S-EPMC5538173 | biostudies-other
| PRJNA312760 | ENA
| S-EPMC6090013 | biostudies-literature
| PRJNA288247 | ENA
| S-EPMC8844965 | biostudies-literature
| S-EPMC6448914 | biostudies-literature
| S-EPMC9069239 | biostudies-literature
| PRJNA344442 | ENA
| S-EPMC4250567 | biostudies-literature