Dataset Information

SPARSE: quadratic time simultaneous alignment and folding of RNAs without sequence-based heuristics.

ABSTRACT:

Motivation

RNA-Seq experiments have revealed a multitude of novel ncRNAs. The gold standard for their analysis based on simultaneous alignment and folding suffers from extreme time complexity of [Formula: see text]. Subsequently, numerous faster 'Sankoff-style' approaches have been suggested. Commonly, the performance of such methods relies on sequence-based heuristics that restrict the search space to optimal or near-optimal sequence alignments; however, the accuracy of sequence-based methods breaks down for RNAs with sequence identities below 60%. Alignment approaches like LocARNA that do not require sequence-based heuristics, have been limited to high complexity ([Formula: see text] quartic time).

Results

Breaking this barrier, we introduce the novel Sankoff-style algorithm 'sparsified prediction and alignment of RNAs based on their structure ensembles (SPARSE)', which runs in quadratic time without sequence-based heuristics. To achieve this low complexity, on par with sequence alignment algorithms, SPARSE features strong sparsification based on structural properties of the RNA ensembles. Following PMcomp, SPARSE gains further speed-up from lightweight energy computation. Although all existing lightweight Sankoff-style methods restrict Sankoff's original model by disallowing loop deletions and insertions, SPARSE transfers the Sankoff algorithm to the lightweight energy model completely for the first time. Compared with LocARNA, SPARSE achieves similar alignment and better folding quality in significantly less time (speedup: 3.7). At similar run-time, it aligns low sequence identity instances substantially more accurate than RAF, which uses sequence-based heuristics.

SUBMITTER: Will S

PROVIDER: S-EPMC4514930 | biostudies-literature | 2015 Aug

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

SPARSE: quadratic time simultaneous alignment and folding of RNAs without sequence-based heuristics.

Will Sebastian S Otto Christina C Miladi Milad M Möhl Mathias M Backofen Rolf R

Bioinformatics (Oxford, England) 20150402 15

<h4>Motivation</h4>RNA-Seq experiments have revealed a multitude of novel ncRNAs. The gold standard for their analysis based on simultaneous alignment and folding suffers from extreme time complexity of [Formula: see text]. Subsequently, numerous faster 'Sankoff-style' approaches have been suggested. Commonly, the performance of such methods relies on sequence-based heuristics that restrict the search space to optimal or near-optimal sequence alignments; however, the accuracy of sequence-based m ...[more]

PMID: 25838465

Similar Datasets

Project description:BackgroundComputed tomography (CT) and spirometry are the current standard methods for assessing lung anatomy and pulmonary ventilation, respectively. However, CT provides limited ventilation information and spirometry only provides global measures of lung ventilation. Thus, a method that can enable simultaneous examination of lung anatomy and ventilation is of clinical interest.PurposeTo develop and test a 4D respiratory-resolved sparse lung MRI (XD-UTE: eXtra-Dimensional Ultrashort TE imaging) approach for simultaneous evaluation of lung anatomy and pulmonary ventilation.Study typeProspective.PopulationIn all, 23 subjects (11 volunteers and 12 patients, mean age = 63.6 ± 8.4).Field strength/sequence3T MR; a prototype 3D golden-angle radial UTE sequence, a Cartesian breath-hold volumetric-interpolated examination (BH-VIBE) sequence.AssessmentAll subjects were scanned using the 3D golden-angle radial UTE sequence during normal breathing. Ten subjects underwent an additional scan during alternating normal and deep breathing. Respiratory-motion-resolved sparse reconstruction was performed for all the acquired data to generate dynamic normal-breathing or deep-breathing image series. For comparison, BH-VIBE was performed in 12 subjects. Lung images were visually scored by three experienced chest radiologists and were analyzed by two observers who segmented the left and right lung to derive ventilation parameters in comparison with spirometry.Statistical testsNonparametric paired two-tailed Wilcoxon signed-rank test; intraclass correlation coefficient, Pearson correlation coefficient.ResultsXD-UTE achieved significantly improved image quality compared both with Cartesian BH-VIBE and radial reconstruction without motion compensation (P < 0.05). The global ventilation parameters (a sum of the left and right lung measures) were in good correlation with spirometry in the same subjects (correlation coefficient = 0.724). There were excellent correlations between the results obtained by two observers (intraclass correlation coefficient ranged from 0.8855-0.9995).Data conclusionSimultaneous evaluation of lung anatomy and ventilation using XD-UTE is demonstrated, which have shown good potential for improved diagnosis and management of patients with heterogeneous lung diseases.Level of evidence2 Technical Efficacy: Stage 2 J. Magn. Reson. Imaging 2019;49:411-422.

Project description:Accurate recognition and understating of human emotions is an essential skill that can improve the collaboration between humans and machines. In this vein, electroencephalogram (EEG)-based emotion recognition is considered an active research field with challenging issues regarding the analyses of the nonstationary EEG signals and the extraction of salient features that can be used to achieve accurate emotion recognition. In this paper, an EEG-based emotion recognition approach with a novel time-frequency feature extraction technique is presented. In particular, a quadratic time-frequency distribution (QTFD) is employed to construct a high resolution time-frequency representation of the EEG signals and capture the spectral variations of the EEG signals over time. To reduce the dimensionality of the constructed QTFD-based representation, a set of 13 time- and frequency-domain features is extended to the joint time-frequency-domain and employed to quantify the QTFD-based time-frequency representation of the EEG signals. Moreover, to describe different emotion classes, we have utilized the 2D arousal-valence plane to develop four emotion labeling schemes of the EEG signals, such that each emotion labeling scheme defines a set of emotion classes. The extracted time-frequency features are used to construct a set of subject-specific support vector machine classifiers to classify the EEG signals of each subject into the different emotion classes that are defined using each of the four emotion labeling schemes. The performance of the proposed approach is evaluated using a publicly available EEG dataset, namely the DEAPdataset. Moreover, we design three performance evaluation analyses, namely the channel-based analysis, feature-based analysis and neutral class exclusion analysis, to quantify the effects of utilizing different groups of EEG channels that cover various regions in the brain, reducing the dimensionality of the extracted time-frequency features and excluding the EEG signals that correspond to the neutral class, on the capability of the proposed approach to discriminate between different emotion classes. The results reported in the current study demonstrate the efficacy of the proposed QTFD-based approach in recognizing different emotion classes. In particular, the average classification accuracies obtained in differentiating between the various emotion classes defined using each of the four emotion labeling schemes are within the range of 73.8 % ? 86.2 % . Moreover, the emotion classification accuracies achieved by our proposed approach are higher than the results reported in several existing state-of-the-art EEG-based emotion recognition studies.

Dataset Information

SPARSE: quadratic time simultaneous alignment and folding of RNAs without sequence-based heuristics.

Motivation

Results

Publications

SPARSE: quadratic time simultaneous alignment and folding of RNAs without sequence-based heuristics.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets