Dataset Information

Can we predict firms' innovativeness? The identification of innovation performers in an Italian region through a supervised learning approach.

ABSTRACT: The study shows the feasibility of predicting firms' expenditures in innovation, as reported in the Community Innovation Survey, applying a supervised machine-learning approach on a sample of Italian firms. Using an integrated dataset of administrative records and balance sheet data, designed to include all informative variables related to innovation but also easily accessible for most of the cohort, random forest algorithm is implemented to obtain a classification model aimed to identify firms that are potential innovation performers. The performance of the classifier, estimated in terms of AUC, is 0.794. Although innovation investments do not always result in patenting, the model is able to identify 71.92% of firms with patents. More encouraging results emerge from the analysis of the inner working of the model: predictors identified as most important-such as firm size, sector belonging and investment in intangible assets-confirm previous findings of literature, but in a completely different framework. The outcomes of this study are considered relevant for both economic analysts, because it demonstrates the potential of data-driven models for understanding the nature of innovation behaviour, and practitioners, such as policymakers or venture capitalists, who can benefit by evidence-based tools in the decision-making process.

SUBMITTER: Gandin I

PROVIDER: S-EPMC6559647 | biostudies-literature | 2019

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Can we predict firms' innovativeness? The identification of innovation performers in an Italian region through a supervised learning approach.

Gandin Ilaria I Cozza Claudio C

PloS one 20190611 6

The study shows the feasibility of predicting firms' expenditures in innovation, as reported in the Community Innovation Survey, applying a supervised machine-learning approach on a sample of Italian firms. Using an integrated dataset of administrative records and balance sheet data, designed to include all informative variables related to innovation but also easily accessible for most of the cohort, random forest algorithm is implemented to obtain a classification model aimed to identify firms ...[more]

PMID: 31185045

Similar Datasets

Project description:The objective of the paper is to diagnose organisational culture of selected universities and analyse its impact on the innovation processes within them. The subject matter of the study was organisational culture and innovation at universities. The subjects were four selected universities in Poland, Austria, Germany, and Ukraine. The paper provided a definition of organisational culture and its typology. It further discussed the organisational culture of universities and the relationships between organisational culture and innovativeness. The literature review provided foundations for building a model for the formation of a type of organisational culture at universities that is innovation-friendly, which is the added value of the paper. It offers actions worth taking to shape innovation-friendly culture at universities. It is particularly important during difficult time of changing labour market, when universities greatly impact the attitudes of young people. The knowledge of how to shape innovation-friendly organisational culture at universities is necessary for academia to profile future employees in times of continuous changes. To investigate the relationship between organisational culture and the innovativeness of universities, we designed an original survey questionnaire [S1 File]. Organisational culture was diagnosed with the Organizational Culture Assessment Instrument by K.S. Cameron and R.E. Quinn. The analyses were conducted in Dell Statistica v. 13.1 (StatSoft Polska). We normalised data from the Likert rating scale using Kaufman's and Rousseeuw's formula. We used Spearman's correlation coefficient and Kendall's W to calculate correlations. The research shows that the investigated Polish and Austrian universities are dominated by hierarchy and market cultures. On the other hand, the German and Ukrainian universities host all cultures, but clan and adhocracy dominate there. Moreover, the analyses demonstrated that although the adhocracy culture was the least visible in the investigated organisations, it contributes to university innovativeness the most. The conclusions were used to build a model for promoting innovation-friendly organisational culture at universities. The model contains answers to the research questions. In addition, it offers guidelines for shaping organisational culture to bolster innovation at universities. The research identified relationships between organisational culture and university innovativeness and components that create innovation opportunities at universities as its contribution to management theory. When applied in practice, the guidelines can help form the university's organisational culture bottom-up.

Project description:BackgroundGenetic interaction profiles are highly informative and helpful for understanding the functional linkages between genes, and therefore have been extensively exploited for annotating gene functions and dissecting specific pathway structures. However, our understanding is rather limited to the relationship between double concurrent perturbation and various higher level phenotypic changes, e.g. those in cells, tissues or organs. Modifier screens, such as synthetic genetic arrays (SGA) can help us to understand the phenotype caused by combined gene mutations. Unfortunately, exhaustive tests on all possible combined mutations in any genome are vulnerable to combinatorial explosion and are infeasible either technically or financially. Therefore, an accurate computational approach to predict genetic interaction is highly desirable, and such methods have the potential of alleviating the bottleneck on experiment design.ResultsIn this work, we introduce a computational systems biology approach for the accurate prediction of pairwise synthetic genetic interactions (SGI). First, a high-coverage and high-precision functional gene network (FGN) is constructed by integrating protein-protein interaction (PPI), protein complex and gene expression data; then, a graph-based semi-supervised learning (SSL) classifier is utilized to identify SGI, where the topological properties of protein pairs in weighted FGN is used as input features of the classifier. We compare the proposed SSL method with the state-of-the-art supervised classifier, the support vector machines (SVM), on a benchmark dataset in S. cerevisiae to validate our method's ability to distinguish synthetic genetic interactions from non-interaction gene pairs. Experimental results show that the proposed method can accurately predict genetic interactions in S. cerevisiae (with a sensitivity of 92% and specificity of 91%). Noticeably, the SSL method is more efficient than SVM, especially for very small training sets and large test sets.ConclusionsWe developed a graph-based SSL classifier for predicting the SGI. The classifier employs topological properties of weighted FGN as input features and simultaneously employs information induced from labelled and unlabelled data. Our analysis indicates that the topological properties of weighted FGN can be employed to accurately predict SGI. Also, the graph-based SSL method outperforms the traditional standard supervised approach, especially when used with small training sets. The proposed method can alleviate experimental burden of exhaustive test and provide a useful guide for the biologist in narrowing down the candidate gene pairs with SGI. The data and source code implementing the method are available from the website: http://home.ustc.edu.cn/~yzh33108/GeneticInterPred.htm.

Project description:Spontaneous activity is a common feature of immature neuronal networks throughout the central nervous system and plays an important role in network development and consolidation. In postnatal rodents, spontaneous activity in the spinal cord exhibits complex, stochastic patterns that have historically proven challenging to characterize. We developed a software tool for quickly and automatically characterizing and classifying episodes of spontaneous activity generated from developing spinal networks. We recorded spontaneous activity from in vitro lumbar ventral roots of 16 neonatal [postnatal day (P)0-P3] mice. Recordings were DC coupled and detrended, and episodes were separated for analysis. Amplitude-, duration-, and frequency-related features were extracted from each episode and organized into five classes. Paired classes and features were used to train and test supervised machine learning algorithms. Multilayer perceptrons were used to classify episodes as rhythmic or multiburst. We increased network excitability with potassium chloride and tested the utility of the tool to detect changes in features and episode class. We also demonstrate usability by having a novel experimenter use the program to classify episodes collected at a later time point (P5). Supervised machine learning-based classification of episodes accounted for changes that traditional approaches cannot detect. Our tool, named SpontaneousClassification, advances the detail in which we can study not only developing spinal networks, but also spontaneous networks in other areas of the nervous system. NEW & NOTEWORTHY Spontaneous activity is important for nervous system network development and consolidation. Our software uses machine learning to automatically and quickly characterize and classify episodes of spontaneous activity in the spinal cord of newborn mice. It detected changes in network activity following KCl-enhanced excitation. Using our software to classify spontaneous activity throughout development, in pathological models, or with neuromodulation, may offer insight into the development and organization of spinal circuits.

Dataset Information

Can we predict firms' innovativeness? The identification of innovation performers in an Italian region through a supervised learning approach.

Publications

Can we predict firms' innovativeness? The identification of innovation performers in an Italian region through a supervised learning approach.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets