Unknown

Dataset Information

0

Pairwise sequence comparison data of the DNA barcodes of aquatic insects.


ABSTRACT: This study compared the DNA sequences of cytochrome c oxidase subunit I (COI) and histone H3 of Ephemeroptera, Odonata, Plecoptera, and Trichoptera in a pairwise manner, and calculated the sequence similarities based on uncorrected P-distance (number of identical sites in both sequences per total number of the sites compared). Datasets of annotated sequences, the source organisms of which are identified at the species level in taxonomy, were retrieved from INSD (GenBank/ ENA/ DDBJ) as of the end of May 2020. Similarity scores of the pairwise comparison were sorted by the combinations of taxonomic groups; intraspecific variations, intrageneric-interspecific divergences, intrafamily-intergeneric divergences, and intraorder-interfamily divergences for Ephemeroptera, Odonata, Plecoptera, and Trichoptera. Similarity scores at the cumulative relative frequency points (1%, 5%, 10%, and median) may be used as the threshold to differentiate between the taxonomic groups based on sequence match. This is often done in the characterization of morphologically-unidentified specimens using barcode sequences, in the metabarcoding analysis of the local fauna, and environmental DNA analysis.

SUBMITTER: Inai K 

PROVIDER: S-EPMC7502337 | biostudies-literature | 2020 Oct

REPOSITORIES: biostudies-literature

altmetric image

Publications

Pairwise sequence comparison data of the DNA barcodes of aquatic insects.

Inai Koji K   Wakimura Kei K   Kato Mikio M  

Data in brief 20200908


This study compared the DNA sequences of cytochrome c oxidase subunit I (COI) and histone H3 of Ephemeroptera, Odonata, Plecoptera, and Trichoptera in a pairwise manner, and calculated the sequence similarities based on uncorrected P-distance (number of identical sites in both sequences per total number of the sites compared). Datasets of annotated sequences, the source organisms of which are identified at the species level in taxonomy, were retrieved from INSD (GenBank/ ENA/ DDBJ) as of the end  ...[more]

Similar Datasets

| S-EPMC4971185 | biostudies-literature
| S-EPMC7173583 | biostudies-literature
| S-EPMC5306009 | biostudies-literature
| S-EPMC3446765 | biostudies-literature
| S-EPMC8700862 | biostudies-literature
| S-EPMC8031366 | biostudies-literature
| S-EPMC6978812 | biostudies-literature
| S-EPMC7563230 | biostudies-literature
| S-EPMC5136677 | biostudies-literature