Unknown

Dataset Information

0

Machine learning: A powerful tool for gene function prediction in plants.


ABSTRACT: Recent advances in sequencing and informatic technologies have led to a deluge of publicly available genomic data. While it is now relatively easy to sequence, assemble, and identify genic regions in diploid plant genomes, functional annotation of these genes is still a challenge. Over the past decade, there has been a steady increase in studies utilizing machine learning algorithms for various aspects of functional prediction, because these algorithms are able to integrate large amounts of heterogeneous data and detect patterns inconspicuous through rule-based approaches. The goal of this review is to introduce experimental plant biologists to machine learning, by describing how it is currently being used in gene function prediction to gain novel biological insights. In this review, we discuss specific applications of machine learning in identifying structural features in sequenced genomes, predicting interactions between different cellular components, and predicting gene function and organismal phenotypes. Finally, we also propose strategies for stimulating functional discovery using machine learning-based approaches in plants.

SUBMITTER: Mahood EH 

PROVIDER: S-EPMC7394712 | biostudies-literature | 2020 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

Machine learning: A powerful tool for gene function prediction in plants.

Mahood Elizabeth H EH   Kruse Lars H LH   Moghe Gaurav D GD  

Applications in plant sciences 20200728 7


Recent advances in sequencing and informatic technologies have led to a deluge of publicly available genomic data. While it is now relatively easy to sequence, assemble, and identify genic regions in diploid plant genomes, functional annotation of these genes is still a challenge. Over the past decade, there has been a steady increase in studies utilizing machine learning algorithms for various aspects of functional prediction, because these algorithms are able to integrate large amounts of hete  ...[more]

Similar Datasets

2013-01-01 | E-GEOD-29210 | biostudies-arrayexpress
| S-EPMC5374972 | biostudies-other
2021-07-26 | GSE175955 | GEO
| S-EPMC10601900 | biostudies-literature
| S-EPMC5701237 | biostudies-literature
| S-EPMC10248027 | biostudies-literature
| S-EPMC10080209 | biostudies-literature
| S-EPMC9241370 | biostudies-literature
| S-EPMC9270439 | biostudies-literature
| S-EPMC9421197 | biostudies-literature