Project description:This is a Random Forest algorithm-based machine learning model to predict lncRNAs from coding mRNAs in plant transcriptomic data. The model assigns 1 for coding sequences and 2 for long non-coding sequences. The prediction is performed using a combination of Open Reading Frame (ORF) based, Sequence-based and Codon-bias features. Users need to download the curated ONNX model and also need to convert the sequences into feature matrix as mentioned in PLIT paper (Deshpande et al. 2019) to make predictions on sequences from Zea Mays sequence data.
Project description:This is a randomized, open-label, active-control, multicenter trial comparing two oxaliplatin/Avastin-based treatment sequences as first-line therapy for metastatic colorectal cancer. The study is designed to compare the efficacy of these two treatment sequences with respect to progression free survival (PFS) and overall survival.
Project description:We report the sequences bound to CENP-A in the dog genome (Canis familiaris) for high-throughput characterization of centromeric sequences. We compare these ChIPSeq reads (72 bp, single read) against a reference centromeric satellite DNA domain database for the dog genome, resulting in the annotation of sequence variation and estimated abundance of seven satellite families together with adjacent, non-satellite sequences. To study global patterns of sequence diversity and characterizing the subset of sequences correlated with centromere function, these sequences were evaluated relative to a comprehensive centromere sequence domain k-mer library. From this analysis, we identify functional sequence features from two satellite families (CarSat1 and CarSat2) that are defined by distinct arrays subtypes. Sequences bound to CENP-A in MDCK (dog) cell line
Project description:Specification and propagation of the centromeres of eukaryotic chromosomes is determined by epigenetic mechanisms. Unfortunately, the epigenetic characteristics of centromeric DNA and chromatin are difficult to define because the centromeres are composed of highly repetitive DNA sequences in most eukaryotic species. Several rice centromeres have been fully sequenced, making rice an excellent model for centromere research. We conducted genome-wide mapping of cytosine methylation using methylcytosine immunoprecipitation combined with Illumina sequencing. The DNA sequences in the core domains of rice Cen4, Cen5, and Cen8 showed elevated methylation levels compared to the sequences in the pericentromeric regions. In addition, elevated methylation levels were associated with the DNA sequences in the CENH3-binding subdomains compared to the sequences in the flanking H3 subdomains. In contrast, the centromeric domain of Cen11, which is composed exclusively of centromeric satellite DNA, is hypomethylated compared to the pericentromeric domains. Thus, the DNA sequences associated with functional centromeres can be either hypomethylated or hypermethylated. The methylation patterns of centromeric DNA appear to be correlated with the composition of the associated DNA sequences. We propose that both hypomethylation and hypermethylation of CENH3-associated DNA sequences can serve as epigenetic marks to distinguish where CENH3 deposition will occur within the surrounding H3 chromatin. mCIP-seq of one sample of rice seedling
Project description:Centromeres typically contain repeat sequences, but centromere function does not necessarily depend on these sequences. In aneuploid wheat (Triticum aestivum) and wheat distant hybridization offspring, we found functional centromeres with dramatic changes to centromeric retrotransposon of wheat (CRW) sequences. CRW sequences were greatly reduced in the ditelosomic lines 1BS, 5DS, 5DL, and a wheat-Thinopyrum elongatum addition line. CRWs were completely lost in the ditelosomic line 4DS, but a 994 kb ectopic genomic DNA sequence was involved in de novo centromere formation on the 4DS chromosome. In addition, two ectopic sequences were incorporated in a de novo centromere in a wheat-Th. intermedium addition line. Centromeric sequences were also expanded to the chromosome arm in wide hybridizations. Stable alien chromosomes with two and three regions containing centromeric sequences were found in wheat-Th. elongatum hybrid derivatives, but only one is functional. In wheat-rye (Secale cereale) hybrids, rye centromere specific sequences spread to the chromosome arm and may cause centromere expansion. Thus, distant wheat hybridizations cause frequent and significant changes to the centromere via centromere misdivision, which may affect retention or loss of alien chromosomes in hybrids. ChIP-seq was carried out with anti-CENH3 antibody using material 4DS and control (Chinese Spring, CS as short).
Project description:Centromeres typically contain repeat sequences, but centromere function does not necessarily depend on these sequences. In aneuploid wheat (Triticum aestivum) and wheat distant hybridization offspring, we found functional centromeres with dramatic changes to centromeric retrotransposon of wheat (CRW) sequences. CRW sequences were greatly reduced in the ditelosomic lines 1BS, 5DS, 5DL, and a wheat-Thinopyrum elongatum addition line. CRWs were completely lost in the ditelosomic line 4DS, but a 994 kb ectopic genomic DNA sequence was involved in de novo centromere formation on the 4DS chromosome. In addition, two ectopic sequences were incorporated in a de novo centromere in a wheat-Th. intermedium addition line. Centromeric sequences were also expanded to the chromosome arm in wide hybridizations. Stable alien chromosomes with two and three regions containing centromeric sequences were found in wheat-Th. elongatum hybrid derivatives, but only one is functional. In wheat-rye (Secale cereale) hybrids, rye centromere specific sequences spread to the chromosome arm and may cause centromere expansion. Thus, distant wheat hybridizations cause frequent and significant changes to the centromere via centromere misdivision, which may affect retention or loss of alien chromosomes in hybrids.
Project description:Multi-channel Equivariant Attention Network (MEAN) to co-design 1D sequences and 3D structures of CDRs. To be specific, MEAN formulates antibody design as a conditional graph translation problem by importing extra components including the target antigen and the light chain of the antibody. Then, MEAN resorts to E(3)-equivariant message passing along with a proposed attention mechanism to better capture the geometrical correlation between different components. Finally, it outputs both the 1D sequences and 3D structure via a multi-round progressive full-shot scheme, which enjoys more efficiency and precision against previous autoregressive approaches.
Project description:We report the sequences bound to CENP-A in the dog genome (Canis familiaris) for high-throughput characterization of centromeric sequences. We compare these ChIPSeq reads (72 bp, single read) against a reference centromeric satellite DNA domain database for the dog genome, resulting in the annotation of sequence variation and estimated abundance of seven satellite families together with adjacent, non-satellite sequences. To study global patterns of sequence diversity and characterizing the subset of sequences correlated with centromere function, these sequences were evaluated relative to a comprehensive centromere sequence domain k-mer library. From this analysis, we identify functional sequence features from two satellite families (CarSat1 and CarSat2) that are defined by distinct arrays subtypes.
Project description:In the ciliated protozoan Tetrahymena, an RNAi-mediated feedback loop is important for assembling heterochromatin on the sequences that are removed from the somatic genome by programmed DNA elimination. Because heterochromatin is formed exclusively on the eliminated sequences, some mechanism must inhibit this feedback loop at the boundaries of the eliminated sequences. In this study, we show that the HP1-like protein Coi6p, its interaction partners Coi7p and Lia5p, and the histone demethylase Jmj1p are crucial for confining the production of small RNAs to the eliminated sequences.
Project description:In the ciliated protozoan Tetrahymena, an RNAi-mediated feedback loop is important for assembling heterochromatin on the sequences that are removed from the somatic genome by programmed DNA elimination. Because heterochromatin is formed exclusively on the eliminated sequences, some mechanism must inhibit this feedback loop at the boundaries of the eliminated sequences. In this study, we show that the HP1-like protein Coi6p, its interaction partners Coi7p and Lia5p, and the histone demethylase Jmj1p are crucial for confining the formation of heterochromatin to the eliminated sequences.