Unknown

Dataset Information

0

4SpecID: Reference DNA Libraries Auditing and Annotation System for Forensic Applications.


ABSTRACT: Forensic genetics is a fast-growing field that frequently requires DNA-based taxonomy, namely, when evidence are parts of specimens, often highly processed in food, potions, or ointments. Reference DNA-sequences libraries, such as BOLD or GenBank, are imperative tools for taxonomic assignment, particularly when morphology is inadequate for classification. The auditing and curation of these datasets require reliable mechanisms, preferably with automated data preprocessing. Software tools were developed to grade these datasets considering as primary criterion the number of records, which is not compliant with forensic standards, where the priority is validation from independent sources. Moreover, 4SpecID is an efficient software tool developed to audit and annotate reference libraries, specifically designed for forensic applications. Its intuitive user-friendly interface virtually accesses any database and includes specific data mining functions tuned for the widespread BOLD repositories. The built tool was evaluated in laptop MacBook and a dual-Xeon server with a large BOLD dataset (Culicidae, 36,115 records), and the best execution time to grade the dataset on the laptop was 0.28 s. Datasets of Bovidae and Felidae families were used to evaluate the quality of the tool and the relevance of independent sources validation.

SUBMITTER: Neto L 

PROVIDER: S-EPMC7824288 | biostudies-literature | 2021 Jan

REPOSITORIES: biostudies-literature

altmetric image

Publications

4SpecID: Reference DNA Libraries Auditing and Annotation System for Forensic Applications.

Neto Luís L   Pinto Nádia N   Proença Alberto A   Amorim António A   Conde-Sousa Eduardo E  

Genes 20210102 1


Forensic genetics is a fast-growing field that frequently requires DNA-based taxonomy, namely, when evidence are parts of specimens, often highly processed in food, potions, or ointments. Reference DNA-sequences libraries, such as BOLD or GenBank, are imperative tools for taxonomic assignment, particularly when morphology is inadequate for classification. The auditing and curation of these datasets require reliable mechanisms, preferably with automated data preprocessing. Software tools were dev  ...[more]

Similar Datasets

| S-EPMC3338485 | biostudies-literature
| S-EPMC5456030 | biostudies-literature
| S-EPMC10914783 | biostudies-literature
| S-EPMC10146244 | biostudies-literature
| S-EPMC8405038 | biostudies-literature
| S-EPMC9004796 | biostudies-literature
| S-EPMC4071205 | biostudies-literature
| S-EPMC5838060 | biostudies-literature
| S-EPMC4161972 | biostudies-literature
| S-EPMC7911526 | biostudies-literature