Ontology highlight
ABSTRACT:
SUBMITTER: Thompson JA
PROVIDER: S-EPMC4736986 | biostudies-literature | 2016
REPOSITORIES: biostudies-literature
Thompson Jeffrey A JA Tan Jie J Greene Casey S CS
PeerJ 20160121
Large, publicly available gene expression datasets are often analyzed with the aid of machine learning algorithms. Although RNA-seq is increasingly the technology of choice, a wealth of expression data already exist in the form of microarray data. If machine learning models built from legacy data can be applied to RNA-seq data, larger, more diverse training datasets can be created and validation can be performed on newly generated data. We developed Training Distribution Matching (TDM), which tr ...[more]