Other

Dataset Information

0

Peptide gene sequences of substrate phage for different proteases


ABSTRACT: We present a method that employs two genetically encoded substrate phage display libraries coupled with next generation sequencing (SPD-NGS) that allows up to 10,000-fold deeper sequence coverage of the typical 6 to 8 residue protease cleavage sites compared to state-of-the-art synthetic peptide libraries or proteomics. We applied SPD-NGS to two classes of proteases, the intracellular caspases 2, 3, 6, 7 and 8, and the ectodomains of the membrane sheddases, ADAMs 10 and 17. The first library (Lib 10AA) was used to determine substrate cleavage motifs. Lib 10AA contains a highly diverse randomized 10-mer substrate peptide sequences (10^9 unique members) that was displayed mono-valently on filamentous phage and bound to magnetic beads via an N-terminal biotin. The protease was allowed to cleave the SPD beads, and the released phage subjected to up to three total rounds of positive selection followed by next generation sequencing (NGS). This allowed us to identify from 10^4 to 10^5 unique cleavage sites over a broad dynamic range of NGS counts (ranging from 3-5000), and produced consensus and optimal cleavage motifs based positional sequencing scoring matrices and state-of the-art machine learning algorithm that closely matched synthetic peptide data. A second SPD-NGS library (Lib hP) was constructed that allowed us to identify candidate human proteome sequences. Lib hP displayed virtually the entire human proteome tiled in contiguous 49AA sequences with 25AA overlaps (nearly 1 million members). After three rounds of positive selection we identified up to 10^4 natural linear cut sites depending on the protease and captured most of the examples previously identified by proteomics (ranging from 30 to 1000) and predicted 10 to 100-fold more.

ORGANISM(S): synthetic construct

PROVIDER: GSE154923 | GEO | 2020/07/23

REPOSITORIES: GEO

Dataset's files

Source:
Action DRS
Other
Items per page:
1 - 1 of 1

Similar Datasets

2024-05-08 | MODEL2405080006 | BioModels
| PRJNA647886 | ENA
2013-11-27 | PXD000152 | Pride
2016-06-08 | GSE73203 | GEO
2013-11-27 | PXD000252 | Pride
2024-10-09 | GSE253516 | GEO
2022-07-20 | GSE196622 | GEO
2013-02-25 | PXD000124 | Pride
2023-10-05 | PXD038399 | Pride
2022-07-25 | GSE209538 | GEO