Dataset Information

DeepLRR: An Online Webserver for Leucine-Rich-Repeat Containing Protein Characterization Based on Deep Learning.

ABSTRACT: Members of the leucine-rich repeat (LRR) superfamily play critical roles in multiple biological processes. As the LRR unit sequence is highly variable, accurately predicting the number and location of LRR units in proteins is a highly challenging task in the field of bioinformatics. Existing methods still need to be improved, especially when it comes to similarity-based methods. We introduce our DeepLRR method based on a convolutional neural network (CNN) model and LRR features to predict the number and location of LRR units in proteins. We compared DeepLRR with six existing methods using a dataset containing 572 LRR proteins and it outperformed all of them when it comes to overall F1 score. In addition, DeepLRR has integrated identifying plant disease-resistance proteins (NLR, LRR-RLK, LRR-RLP) and non-canonical domains. With DeepLRR, 223, 191 and 183 LRR-RLK genes in Arabidopsis (Arabidopsis thaliana), rice (Oryza sativa ssp. Japonica) and tomato (Solanum lycopersicum) genomes were re-annotated, respectively. Chromosome mapping and gene cluster analysis revealed that 24.2% (54/223), 29.8% (57/191) and 16.9% (31/183) of LRR-RLK genes formed gene cluster structures in Arabidopsis, rice and tomato, respectively. Finally, we explored the evolutionary relationship and domain composition of LRR-RLK genes in each plant and distributions of known receptor and co-receptor pairs. This provides a new perspective for the identification of potential receptors and co-receptors.

SUBMITTER: Liu Z

PROVIDER: S-EPMC8796025 | biostudies-literature | 2022 Jan

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

DeepLRR: An Online Webserver for Leucine-Rich-Repeat Containing Protein Characterization Based on Deep Learning.

Liu Zhenya Z Ren Zirui Z Yan Lunyi L Li Feng F

Plants (Basel, Switzerland) 20220104 1

Members of the leucine-rich repeat (LRR) superfamily play critical roles in multiple biological processes. As the LRR unit sequence is highly variable, accurately predicting the number and location of LRR units in proteins is a highly challenging task in the field of bioinformatics. Existing methods still need to be improved, especially when it comes to similarity-based methods. We introduce our DeepLRR method based on a convolutional neural network (CNN) model and LRR features to predict the nu ...[more]

PMID: 35009139

Dataset Information

DeepLRR: An Online Webserver for Leucine-Rich-Repeat Containing Protein Characterization Based on Deep Learning.

Publications

DeepLRR: An Online Webserver for Leucine-Rich-Repeat Containing Protein Characterization Based on Deep Learning.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Leucine-rich pentatricopeptide-repeat containing protein regulates mitochondrial transcription.
| S-EPMC2932791 | biostudies-literature

Leucine-rich repeat and immunoglobulin domain-containing protein-1 (Lrig1) negative regulatory action toward ErbB receptor tyrosine kinases is opposed by leucine-rich repeat and immunoglobulin domain-containing protein 3 (Lrig3).
| S-EPMC3724619 | biostudies-literature

Dynamic expression patterns of leucine-rich repeat containing protein 10 in the heart.
| S-EPMC2002521 | biostudies-literature

BK potassium channel modulation by leucine-rich repeat-containing proteins.
| S-EPMC3356614 | biostudies-literature

Leucine-Rich Repeat (LRR) Domains Containing Intervening Motifs in Plants.
| S-EPMC4030839 | biostudies-literature

+mRNA expression of LRRC55 protein (leucine-rich repeat-containing protein 55) in the adult mouse brain.
| S-EPMC5784982 | biostudies-literature

Nucleotide binding domain and leucine-rich repeat pyrin domain-containing protein 12: characterization of its binding to hematopoietic cell kinase.
| S-EPMC7097926 | biostudies-literature

The leucine-rich pentatricopeptide repeat-containing protein (LRPPRC) does not activate transcription in mammalian mitochondria.
| S-EPMC3668712 | biostudies-literature

Stat3 upregulates leucine-rich repeat-containing g protein-coupled receptor 4 expression in osteosarcoma cells.
| S-EPMC3886594 | biostudies-literature

Crystal structures of FNIP/FGxxFN motif-containing leucine-rich repeat proteins.
| S-EPMC9525666 | biostudies-literature