Dataset Information

Artificial Neural Networks Combined with the Principal Component Analysis for Non-Fluent Speech Recognition.

ABSTRACT: The presented paper introduces principal component analysis application for dimensionality reduction of variables describing speech signal and applicability of obtained results for the disturbed and fluent speech recognition process. A set of fluent speech signals and three speech disturbances-blocks before words starting with plosives, syllable repetitions, and sound-initial prolongations-was transformed using principal component analysis. The result was a model containing four principal components describing analysed utterances. Distances between standardised original variables and elements of the observation matrix in a new system of coordinates were calculated and then applied in the recognition process. As a classifying algorithm, the multilayer perceptron network was used. Achieved results were compared with outcomes from previous experiments where speech samples were parameterised with the Kohonen network application. The classifying network achieved overall accuracy at 76% (from 50% to 91%, depending on the dysfluency type).

SUBMITTER: Swietlicka I

PROVIDER: S-EPMC8749906 | biostudies-literature | 2022 Jan

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Artificial Neural Networks Combined with the Principal Component Analysis for Non-Fluent Speech Recognition.

Świetlicka Izabela I Kuniszyk-Jóźkowiak Wiesława W Świetlicki Michał M

Sensors (Basel, Switzerland) 20220101 1

The presented paper introduces principal component analysis application for dimensionality reduction of variables describing speech signal and applicability of obtained results for the disturbed and fluent speech recognition process. A set of fluent speech signals and three speech disturbances-blocks before words starting with plosives, syllable repetitions, and sound-initial prolongations-was transformed using principal component analysis. The result was a model containing four principal compon ...[more]

PMID: 35009863

Similar Datasets

Project description:The non-fluent/agrammatic variant of primary progressive aphasia (nfvPPA) is a neurodegenerative syndrome primarily defined by the presence of apraxia of speech (AoS) and/or expressive agrammatism. In addition, many patients exhibit dysarthria and/or receptive agrammatism. This leads to substantial phenotypic variation within the speech-language domain across individuals and time, in terms of both the specific combination of symptoms as well as their severity. How to resolve such phenotypic heterogeneity in nfvPPA is a matter of debate. 'Splitting' views propose separate clinical entities: 'primary progressive apraxia of speech' when AoS occurs in the absence of expressive agrammatism, 'progressive agrammatic aphasia' (PAA) in the opposite case, and 'AOS + PAA' when mixed motor speech and language symptoms are clearly present. While therapeutic interventions typically vary depending on the predominant symptom (e.g. AoS versus expressive agrammatism), the existence of behavioural, anatomical and pathological overlap across these phenotypes argues against drawing such clear-cut boundaries. In the current study, we contribute to this debate by mapping behaviour to brain in a large, prospective cohort of well characterized patients with nfvPPA (n = 104). We sought to advance scientific understanding of nfvPPA and the neural basis of speech-language by uncovering where in the brain the degree of MRI-based atrophy is associated with inter-patient variability in the presence and severity of AoS, dysarthria, expressive agrammatism or receptive agrammatism. Our cross-sectional examination of brain-behaviour relationships revealed three main observations. First, we found that the neural correlates of AoS and expressive agrammatism in nfvPPA lie side by side in the left posterior inferior frontal lobe, explaining their behavioural dissociation/association in previous reports. Second, we identified a 'left-right' and 'ventral-dorsal' neuroanatomical distinction between AoS versus dysarthria, highlighting (i) that dysarthria, but not AoS, is significantly influenced by tissue loss in right-hemisphere motor-speech regions; and (ii) that, within the left hemisphere, dysarthria and AoS map onto dorsally versus ventrally located motor-speech regions, respectively. Third, we confirmed that, within the large-scale grammar network, left frontal tissue loss is preferentially involved in expressive agrammatism and left temporal tissue loss in receptive agrammatism. Our findings thus contribute to define the function and location of the epicentres within the large-scale neural networks vulnerable to neurodegenerative changes in nfvPPA. We propose that nfvPPA be redefined as an umbrella term subsuming a spectrum of speech and/or language phenotypes that are closely linked by the underlying neuroanatomy and neuropathology.

Project description:Deficits in fluent speech production following left hemisphere stroke are a central concern because of their impact on patients' lives and the insight they provide about the neural organization of language processing. Fluent speech production requires the rapid coordination of phonological, semantic, and syntactic processing, so this study examined how deficits in connected speech relate to these language sub-systems. Behavioural data (N = 69 participants with aphasia following left hemisphere stroke) consisted of a diverse and comprehensive set of narrative speech production measures and measures of overall severity, semantic deficits, and phonological deficits. These measures were entered into a principal component analysis with bifactor rotation-a latent structure model where each item loads on a general factor that reflects what is common among the items, and orthogonal factors that explain variance not accounted for by the general factor. Lesion data were available for 58 of the participants, and each factor score was analysed with multivariate lesion-symptom mapping. Effects of connectivity disruption were evaluated using robust regression with tract disconnection or graph theoretic measures of connectivity as predictors. The principal component analysis produced a four-factor solution that accounted for 70.6% of the variance in the data, with a general factor corresponding to the overall severity and length and complexity of speech output (complexity factor), a lexical syntax factor, and independent factors for Semantics and Phonology. Deficits in the complexity of speech output were associated with a large temporo-parietal region, similar to overall aphasia severity. The lexical syntax factor was associated with damage in a relatively small set of fronto-parietal regions which may reflect the recruitment of control systems to support retrieval and correct usage of lexical items that primarily serve a syntactic rather than semantic function. Tract-based measures of connectivity disruption were not statistically associated with the deficit scores after controlling for overall lesion volume. Language network efficiency and average clustering coefficient within the language network were significantly associated with deficit scores after controlling for overall lesion volume. These results highlight overall severity as the critical contributor to fluent speech in post-stroke aphasia, with a dissociable factor corresponding to lexical syntax.

Project description:Patients with non-fluent aphasias display impairments of expressive and receptive grammar. This has been attributed to deficits in processing configurational and hierarchical sequencing relationships. This hypothesis had not been formally tested. It was also controversial whether impairments are specific to language, or reflect domain general deficits in processing structured auditory sequences. Here we used an artificial grammar learning paradigm to compare the abilities of controls to participants with agrammatic aphasia of two different aetiologies: stroke and frontotemporal dementia. Ten patients with non-fluent variant primary progressive aphasia (nfvPPA), 12 with non-fluent aphasia due to stroke, and 11 controls implicitly learned a novel mixed-complexity artificial grammar designed to assess processing of increasingly complex sequencing relationships. We compared response profiles for otherwise identical sequences of speech tokens (nonsense words) and tone sweeps. In all three groups the ability to detect grammatical violations varied with sequence complexity, with performance improving over time and being better for adjacent than non-adjacent relationships. Patients performed less well than controls overall, and this was related more strongly to aphasia severity than to aetiology. All groups improved with practice and performed well at a control task of detecting oddball nonwords. Crucially, group differences did not interact with sequence complexity, demonstrating that aphasic patients were not disproportionately impaired on complex structures. Hierarchical cluster analysis revealed that response patterns were very similar across all three groups, but very different between the nonsense word and tone tasks, despite identical artificial grammar structures. Overall, we demonstrate that agrammatic aphasics of two different aetiologies are not disproportionately impaired on complex sequencing relationships, and that the learning of phonological and non-linguistic sequences occurs independently. The similarity of profiles of discriminatory abilities and rule learning across groups suggests that insights from previous studies of implicit sequence learning in vascular aphasia are likely to prove applicable in nfvPPA.

Project description:The non-fluent/agrammatic variant of primary progressive aphasia (nfvPPA) presents with a gradual decline in grammar and motor speech resulting from selective degeneration of speech-language regions in the brain. There has been considerable progress in identifying treatment approaches to remediate language deficits in other primary progressive aphasia variants; however, interventions for the core deficits in nfvPPA have yet to be systematically investigated. Further, the neural mechanisms that support behavioural restitution in the context of neurodegeneration are not well understood. We examined the immediate and long-term benefits of video implemented script training for aphasia (VISTA) in 10 individuals with nfvPPA. The treatment approach involved repeated rehearsal of individualized scripts via structured treatment with a clinician as well as intensive home practice with an audiovisual model using 'speech entrainment'. We evaluated accuracy of script production as well as overall intelligibility and grammaticality for trained and untrained scripts. These measures and standardized test scores were collected at post-treatment and 3-, 6-, and 12-month follow-up visits. Treatment resulted in significant improvement in production of correct, intelligible scripted words for trained topics, a reduction in grammatical errors for trained topics, and an overall increase in intelligibility for trained as well as untrained topics at post-treatment. Follow-up testing revealed maintenance of gains for trained scripts up to 1 year post-treatment on the primary outcome measure. Performance on untrained scripts and standardized tests remained relatively stable during the follow-up period, indicating that treatment helped to stabilize speech and language despite disease progression. To identify neural predictors of responsiveness to intervention, we examined treatment effect sizes relative to grey matter volumes in regions of interest derived from a previously identified speech production network. Regions of significant atrophy within this network included bilateral inferior frontal cortices and supplementary motor area as well as left striatum. Volumes in a left middle/inferior temporal region of interest were significantly correlated with the magnitude of treatment effects. This region, which was relatively spared anatomically in nfvPPA patients, has been implicated in syntactic production as well as visuo-motor facilitation of speech. This is the first group study to document the benefits of behavioural intervention that targets both linguistic and motoric deficits in nfvPPA. Findings indicate that behavioural intervention may result in lasting and generalized improvement of communicative function in individuals with neurodegenerative disease and that the integrity of spared regions within the speech-language network may be an important predictor of treatment response.

Dataset Information

Artificial Neural Networks Combined with the Principal Component Analysis for Non-Fluent Speech Recognition.

Publications

Artificial Neural Networks Combined with the Principal Component Analysis for Non-Fluent Speech Recognition.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets