Dataset Information

Identifying communicative functions in discourse with content types.

ABSTRACT: Texts are not monolithic entities but rather coherent collections of micro illocutionary acts which help to convey a unitary message of content and purpose. Identifying such text segments is challenging because they require a fine-grained level of analysis even within a single sentence. At the same time, accessing them facilitates the analysis of the communicative functions of a text as well as the identification of relevant information. We propose an empirical framework for modelling micro illocutionary acts at clause level, that we call content types, grounded on linguistic theories of text types, in particular on the framework proposed by Werlich in 1976. We make available a newly annotated corpus of 279 documents (for a total of more than 180,000 tokens) belonging to different genres and temporal periods, based on a dedicated annotation scheme. We obtain an average Cohen's kappa of 0.89 at token level. We achieve an average F1 score of 74.99% on the automatic classification of content types using a bi-LSTM model. Similar results are obtained on contemporary and historical documents, while performances on genres are more varied. This work promotes a discourse-oriented approach to information extraction and cross-fertilisation across disciplines through a computationally-aided linguistic analysis.

SUBMITTER: Caselli T

PROVIDER: S-EPMC8335719 | biostudies-literature | 2022

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Identifying communicative functions in discourse with content types.

Caselli Tommaso T Sprugnoli Rachele R Moretti Giovanni G

Language resources and evaluation 20210804 2

Texts are not monolithic entities but rather coherent collections of micro illocutionary acts which help to convey a unitary message of content and purpose. Identifying such text segments is challenging because they require a fine-grained level of analysis even within a single sentence. At the same time, accessing them facilitates the analysis of the communicative functions of a text as well as the identification of relevant information. We propose an empirical framework for modelling micro illo ...[more]

PMID: 34366751

Similar Datasets

Project description:BackgroundThe ability to communicate is a prerequisite for participation in today's society. To measure participation in adults with communication disorders, the Communicative Participation Item Bank (CPIB) was developed in 2006. Since then, several new PROMs have been developed to measure communication and the impact of communication disorders on participation. Moreover, the CPIB items do not all appear to be relevant to certain populations with communication problems and context of communicative participation is changing rapidly, given the increased use of digital communication forms. The purpose of this study was to identify new PROMs developed since 2006 that aim to measure (aspects of) communication, in order to select items that are suitable for expanding the Communicative Participation Item Bank to make the item bank more widely applicable (e.g., to the hearing-impaired population) and tailored to the current societal context.MethodsMedline and Embase were used to search for PROMs that aim to measure (aspects of) communication. Each new PROM as well as the CPIB, was evaluated to determine to what extent it contains items that measure communicative participation and to what extent these items capture all communicative participation domains by linking each item to the ICF Activities and Participation domains.ResultsThis study identified 31 new PROMs, containing 391 items that were labelled as measuring communicative participation. The majority of the 391 items measure aspects of ICF Activities and Participation domain 'communication', followed by the domain 'interpersonal interactions and relationships'. The other ICF Activity and Participation domains were less often addressed. Analysis of the CPIB showed that items do not cover all domains of participation as defined in the ICF, such as the 'major life areas' domain.ConclusionsWe found a potential pool of 391 items measuring communicative participation that could be considered for extending the CPIB. We found items in domains that are already present in the CPIB, but also items that relate to new domains, such as an item on talking with customers or clients for the 'major life areas' domain. Inclusion of new items in other domains would benefit the comprehensiveness of the item bank.

Project description:BackgroundIt is possible that tailoring dietary approaches to an individual's genomic profile could provide optimal dietary inputs for biological functioning and support adherence to dietary management protocols. The science required for such nutrigenetic and nutrigenomic profiling is not yet considered ready for broad application by the scientific and medical communities; however, many personalized nutrition products are available in the marketplace, creating the potential for hype and misleading information on social media. Twitter provides a unique big data source that provides real-time information. Therefore, it has the potential to disseminate evidence-based health information, as well as misinformation.ObjectiveWe sought to characterize the landscape of precision nutrition content on Twitter, with a specific focus on nutrigenetics and nutrigenomics. We focused on tweet authors, types of content, and presence of misinformation.MethodsTwitter Archiver was used to capture tweets from September 1, 2020, to December 1, 2020, using keywords related to nutrition and genetics. A random sample of tweets was coded using quantitative content analysis by 4 trained coders. Codebook-driven, quantified information about tweet authors, content details, information quality, and engagement metrics were compiled and analyzed.ResultsThe most common categories of tweets were precision nutrition products and nutrigenomic concepts. About a quarter (132/504, 26.2%) of tweet authors presented themselves as science experts, medicine experts, or both. Nutrigenetics concepts most frequently came from authors with science and medicine expertise, and tweets about the influence of genes on weight were more likely to come from authors with neither type of expertise. A total of 14.9% (75/504) of the tweets were noted to contain untrue information; these were most likely to occur in the nutrigenomics concepts topic category.ConclusionsBy evaluating social media discourse on precision nutrition on Twitter, we made several observations about the content available in the information environment through which individuals can learn about related concepts and products. Tweet content was consistent with the indicators of medical hype, and the inclusion of potentially misleading and untrue information was common. We identified a contingent of users with scientific and medical expertise who were active in discussing nutrigenomics concepts and products and who may be encouraged to share credible expert advice on precision nutrition and tackle false information as this technology develops.

Project description:ImportanceRap artists are among the most recognizable celebrities in the US, serving as role models to an increasingly diverse audience of listeners. Through their lyrics, these artists have the potential to shape mental health discourse and reduce stigma.ObjectiveTo investigate the prevalence and nature of mental health themes in popular rap music amid a period of documented increases in mental health distress and suicide risk among young people in the US and young Black/African American male individuals in particular.Design and settingLyric sheets from the 25 most popular rap songs in the US in 1998, 2003, 2008, 2013, and 2018, totaling 125 songs, were analyzed by 2 trained coders from March 1 to April 15, 2019, for references to anxiety, depression, suicide, metaphors suggesting mental health struggles, and stressors associated with mental health risk.Main outcomes and measuresMental health references were identified and categorized based on Diagnostic and Statistical Manual of Mental Disorders (Fifth Edition) and Mayo Clinic definitions. Stressors included issues with authorities, environmental conditions, work, and love life. Descriptive language and trend analyses were used to examine changes over time in the proportion of songs with mental health references. Stressors were analyzed for their co-occurrence with mental health references.ResultsMost of the 125 analyzed songs featured lead artists from North America (123 [98%]). Most lead artists were Black/African American male individuals (97 [78%]), and artists' mean (SD) age was 28.2 (4.5) years. Across the sample, 35 songs (28%) referenced anxiety; 28 (22%) referenced depression; 8 (6%) referenced suicide; and 26 (21%) used a mental health metaphor. Significant increases were found from 1998 to 2018 in the proportion of songs referencing suicide (0% to 12%), depression (16% to 32%), and mental health metaphors (8% to 44%). Stressors related to environmental conditions (adjusted odds ratio, 8.1; 95% CI, 2.1-32.0) and love life (adjusted odds ratio, 4.8; 95% CI, 1.3-18.1) were most likely to co-occur with lyrics referencing mental health.Conclusions and relevanceReferences to mental health struggles have increased significantly in popular rap music from 1998 to 2018. Future research is needed to examine the potential positive and negative effects these increasingly prevalent messages may have in shaping mental health discourse and behavioral intentions for US youth.

Project description:Drug addiction can cause severe damage to the human brain, leading to significant problems in cognitive processing, such as irritability, speech distortions, and exaggeration of negative stimuli. Speech plays a fundamental role in social interaction, including both the production and perception. The ability to perceive communicative functions conveyed through speech is crucial for successful interpersonal communication and the maintaining good social relationships. However, due to the limited number of previous studies, it remains unclear whether the cognitive disorder caused by drug addiction affects the perception of communicative function conveyed in Mandarin speech. To address this question, we conducted a perception experiment involving sixty male participants, including 25 heroin addicts and 35 healthy controls. The experiment aimed to examine the perception of three communicative functions (i.e., statement, interrogative, and imperative) under three background noise conditions (i.e., no noise, SNR [Signal to Noise Ratio] = 10, and SNR = 0). Eight target sentences were first recorded by two native Mandarin speakers for each of the three communicative functions. Each half was then combined with Gaussian White Noise under two background noise conditions (i.e., SNR = 10 and SNR = 0). Finally, 48 speech stimuli were included in the experiment with four options provided for perceptual judgment. The results showed that, under the three noise conditions, the average perceptual accuracies of the three communicative functions were 80.66% and 38% for the control group and the heroin addicts, respectively. Significant differences were found in the perception of the three communicative functions between the control group and the heroin addicts under the three noise conditions, except for the recognition of imperative under strong noise condition (i.e., SNR = 0). Moreover, heroin addicts showed good accuracy (around 50%) in recognizing imperative and poor accuracy (i.e., lower than the chance level) in recognizing interrogative. This paper not only fills the research gap in the perception of communicative functions in Mandarin speech among drug addicts but also enhances the understanding of the effects of drugs on speech perception and provides a foundation for the speech rehabilitation of drug addicts.

Dataset Information

Identifying communicative functions in discourse with content types.

Publications

Identifying communicative functions in discourse with content types.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets