Project description:News analysis is a popular task in Natural Language Processing (NLP). In particular, the problem of clickbait in news analysis has gained attention in recent years [1, 2]. However, the majority of the tasks has been focused on English news, in which there is already a rich representative resource. For other languages, such as Indonesian, there is still a lack of resource for clickbait tasks. Therefore, we introduce the CLICK-ID dataset of Indonesian news headlines extracted from 12 Indonesian online news publishers. It is comprised of 15,000 annotated headlines with clickbait and non-clickbait labels. Using the CLICK-ID dataset, we then developed an Indonesian clickbait classification model achieving favourable performance. We believe that this corpus will be useful for replicable experiments in clickbait detection or other experiments in NLP areas.
Project description:BACKGROUND:Annotation of cell identity is an essential process in neuroscience that allows comparison of cells, including that of neural activities across different animals. In Caenorhabditis elegans, although unique identities have been assigned to all neurons, the number of annotatable neurons in an intact animal has been limited due to the lack of quantitative information on the location and identity of neurons. RESULTS:Here, we present a dataset that facilitates the annotation of neuronal identities, and demonstrate its application in a comprehensive analysis of whole-brain imaging. We systematically identified neurons in the head region of 311 adult worms using 35 cell-specific promoters and created a dataset of the expression patterns and the positions of the neurons. We found large positional variations that illustrated the difficulty of the annotation task. We investigated multiple combinations of cell-specific promoters driving distinct fluorescence and generated optimal strains for the annotation of most head neurons in an animal. We also developed an automatic annotation method with human interaction functionality that facilitates annotations needed for whole-brain imaging. CONCLUSION:Our neuron ID dataset and optimal fluorescent strains enable the annotation of most neurons in the head region of adult C. elegans, both in full-automated fashion and a semi-automated version that includes human interaction functionalities. Our method can potentially be applied to model species used in research other than C. elegans, where the number of available cell-type-specific promoters and their variety will be an important consideration.
Project description:The dataset contains 1339 cone penetration tests (CPT, CPTu, SCPT, SCPTu) executed within Austria and Germany by the company Premstaller Geotechnik ZT GmbH. As a first processing step, core drillings, located within a maximum distance of approximately 50 m to the insitu tests, were assigned to these cone penetration tests, which allow an interpretation of the insitu measurements based on its grain size distribution. In a second step, the software Geologismiki was used to calculate various normalized measures, which can e.g. be used as input parameters for soil behaviour type charts. The present data can be utilized by researches for example to develop new approaches related to soil classification based on cone penetration test. Furthermore, it provides a framework for combining insitu measurements (qc, fs, Rf, u2, Vs), normalized measures (i.e. Qt, Bq, U2) and soil classifications.
Project description:Protein abnormalities are the major cause of neurodegenerative diseases such as spinocerebellar ataxia (SCA). Protein misfolding and impaired degradation leads to the build-up of protein aggregates inside the cell, which may further cause cellular degeneration. Reducing levels of either the soluble misfolded form of the protein or its precipitated aggregate, even marginally, could significantly improve cellular health. Despite numerous pre-existing strategies to target these protein aggregates, there is considerable room to improve their specificity and efficiency. In this study, we demonstrated the enhanced intracellular degradation of both monomers and aggregates of mutant ataxin1 (Atxn1 82Q) by engineering an E3 ubiquitin ligase enzyme, promyelocytic leukemia protein (PML). Specifically, we showed enhanced degradation of both soluble and aggregated Atxn1 82Q in mammalian cells by targeting this protein using PML fused to single chain variable fragments (scFvs) specific for monomers and aggregates of the target protein. The ability to solubilize Atxn1 82Q aggregates was due to the PML-mediated enhanced SUMOylation of the target protein. This ability to reduce the intracellular levels of both misfolded forms of Atxn1 82Q may not only be useful for treating SCA, but also applicable for the treatment of other PolyQ disorders.
Project description:A simple HEK293 lysate, with two files containing a putative mycoplasma contamination, and two negative control samples, taken from Geiger etc al. (Mol Cell Proteomics. 2012 Mar;11(3):M111.014050. doi: 10.1074/mcp.M111.014050. Epub 2012 Jan 25.) [PXD002395].
Project description:Masonry structures are widely used nowadays for their advantages like low-cost workmanship, efficiency and fast construction techniques. The compressive strength of the materials that compose masonry (block and mortar) is very important to the behavior of the system, but the tensile strength is even more significant for the standards and building codes. In this work, a dataset for indirect tensile tests of hollow concrete blocks is obtained. Splitting tests as described in ASTM C-1006-13 are applied. Two sets of blocks were tested, one with medium compressive strength and the other with high compressive strength. The first set was tested in three directions named A, B, and C; the second one was tested in two directions, A and B. The data was collected with a servo-hydraulic machine. The data is presented in tables and can be used by material researchers, as well as in numerical modeling.