Dataset Information

Genomics dataset on unclassified published organism (patent US 7547531).

ABSTRACT: Nucleotide (DNA) sequence analysis provides important clues regarding the characteristics and taxonomic position of an organism. With the intention that, DNA sequence analysis is very crucial to learn about hierarchical classification of that particular organism. This dataset (patent US 7547531) is chosen to simplify all the complex raw data buried in undisclosed DNA sequences which help to open doors for new collaborations. In this data, a total of 48 unidentified DNA sequences from patent US 7547531 were selected and their complete sequences were retrieved from NCBI BioSample database. Quick response (QR) code of those DNA sequences was constructed by DNA BarID tool. QR code is useful for the identification and comparison of isolates with other organisms. AT/GC content of the DNA sequences was determined using ENDMEMO GC Content Calculator, which indicates their stability at different temperature. The highest GC content was observed in GP445188 (62.5%) which was followed by GP445198 (61.8%) and GP445189 (59.44%), while lowest was in GP445178 (24.39%). In addition, New England BioLabs (NEB) database was used to identify cleavage code indicating the 5, 3 and blunt end and enzyme code indicating the methylation site of the DNA sequences was also shown. These data will be helpful for the construction of the organisms' hierarchical classification, determination of their phylogenetic and taxonomic position and revelation of their molecular characteristics.

SUBMITTER: Khan Shawan MM

PROVIDER: S-EPMC5066183 | biostudies-literature | 2016 Dec

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Genomics dataset on unclassified published organism (patent US 7547531).

Khan Shawan Mohammad Mahfuz Ali MM Hasan Md Ashraful MA Hossain Md Mozammel MM Hasan Md Mahmudul MM Parvin Afroza A Akter Salina S Uddin Kazi Rasel KR Banik Subrata S Morshed Mahbubul M Rahman Md Nazibur MN Rahman S M Badier SM

Data in brief 20161005

Nucleotide (DNA) sequence analysis provides important clues regarding the characteristics and taxonomic position of an organism. With the intention that, DNA sequence analysis is very crucial to learn about hierarchical <i>classification of that particular organism. This dataset</i> (patent US 7547531) is chosen to simplify all the complex raw data buried in undisclosed DNA sequences which help to open doors for new collaborations. In this data, a total of 48 unidentified DNA sequences from pate ...[more]

PMID: 27766287

Similar Datasets

Project description:Schistosomiasis endangers the lives of greater than 200 million people every year and is predominantly controlled by a single class chemotherapy, praziquantel (PZQ). Development of PZQ replacement (to combat the threat of PZQ insensitivity/resistance arising) or combinatorial (to facilitate the killing of PZQ-insensitive juvenile schistosomes) chemotherapies would help sustain this control strategy into the future. Here, we re-categorise two families of druggable epigenetic targets in Schistosoma mansoni, the histone methyltransferases (HMTs) and the histone demethylases (HDMs). Amongst these, a S. mansoni Lysine Specific Demethylase 1 (SmLSD1, Smp_150560) homolog was selected for further analyses. Homology modelling of SmLSD1 and in silico docking of greater than four thousand putative inhibitors identified seven (L1 - L7) showing more favourable binding to the target pocket of SmLSD1 vs Homo sapiens HsLSD1; six of these seven (L1 - L6) plus three structural analogues of L7 (L8 - L10) were subsequently screened against schistosomula using the Roboworm anthelmintic discovery platform. The most active compounds (L10 - pirarubicin > L8 - danunorubicin hydrochloride) were subsequently tested against juvenile (3 wk old) and mature (7 wk old) schistosome stages and found to impede motility, suppress egg production and affect tegumental surfaces. When compared to a surrogate human cell line (HepG2), a moderate window of selectivity was observed for the most active compound L10 (selectivity indices - 11 for schistosomula, 9 for juveniles, 1.5 for adults). Finally, RNA interference of SmLSD1 recapitulated the egg-laying defect of schistosomes co-cultivated in the presence of L10 and L8. These preliminary results suggest that SmLSD1 represents an attractive new target for schistosomiasis; identification of more potent and selective SmLSD1 compounds, however, is essential. Nevertheless, the approaches described herein highlight an interdisciplinary strategy for selecting and screening novel/repositioned anti-schistosomals, which can be applied to any druggable (epigenetic) target encoded by the parasite's genome.

Dataset Information

Genomics dataset on unclassified published organism (patent US 7547531).

Publications

Genomics dataset on unclassified published organism (patent US 7547531).

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets