Ontology highlight
ABSTRACT:
SUBMITTER: De Pierri CR
PROVIDER: S-EPMC6952362 | biostudies-literature | 2020 Jan
REPOSITORIES: biostudies-literature
De Pierri Camilla Reginatto CR Voyceik Ricardo R Santos de Mattos Letícia Graziela Costa LGC Kulik Mariane Gonçalves MG Camargo Josué Oliveira JO Repula de Oliveira Aryel Marlus AM de Lima Nichio Bruno Thiago BT Marchaukoski Jeroniza Nunes JN da Silva Filho Antonio Camilo AC Guizelini Dieval D Ortega J Miguel JM Pedrosa Fabio O FO Raittz Roberto Tadeu RT
Scientific reports 20200109 1
Vectoral and alignment-free approaches to biological sequence representation have been explored in bioinformatics to efficiently handle big data. Even so, most current methods involve sequence comparisons via alignment-based heuristics and fail when applied to the analysis of large data sets. Here, we present "Spaced Words Projection (SWeeP)", a method for representing biological sequences using relatively small vectors while preserving intersequence comparability. SWeeP uses spaced-words by sca ...[more]