Project description:Online citizen science projects such as GalaxyZoo1, Eyewire2 and Phylo3 have proven very successful for data collection, annotation and processing, but for the most part have harnessed human pattern-recognition skills rather than human creativity. An exception is the game EteRNA4, in which game players learn to build new RNA structures by exploring the discrete two-dimensional space of Watson-Crick base pairing possibilities. Building new proteins, however, is a more challenging task to present in a game, as both the representation and evaluation of a protein structure are intrinsically three-dimensional. We posed the challenge of de novo protein design in the online protein-folding game Foldit5. Players were presented with a fully extended peptide chain and challenged to craft a folded protein structure and an amino acid sequence encoding that structure. After many iterations of player design, analysis of the top-scoring solutions and subsequent game improvement, Foldit players can now-starting from an extended polypeptide chain-generate a diversity of protein structures and sequences that encode them in silico. One hundred forty-six Foldit player designs with sequences unrelated to naturally occurring proteins were encoded in synthetic genes; 56 were found to be expressed and soluble in Escherichia coli, and to adopt stable monomeric folded structures in solution. The diversity of these structures is unprecedented in de novo protein design, representing 20 different folds-including a new fold not observed in natural proteins. High-resolution structures were determined for four of the designs, and are nearly identical to the player models. This work makes explicit the considerable implicit knowledge that contributes to success in de novo protein design, and shows that citizen scientists can discover creative new solutions to outstanding scientific challenges such as the protein design problem.
Project description:We are currently in the midst of Earth's sixth extinction event, and measuring biodiversity trends in space and time is essential for prioritizing limited resources for conservation. At the same time, the scope of the necessary biodiversity monitoring is overwhelming funding for professional scientific monitoring. In response, scientists are increasingly using citizen science data to monitor biodiversity. But citizen science data are 'noisy', with redundancies and gaps arising from unstructured human behaviours in space and time. We ask whether the information content of these data can be maximized for the express purpose of trend estimation. We develop and execute a novel framework which assigns every citizen science sampling event a marginal value, derived from the importance of an observation to our understanding of overall population trends. We then make this framework predictive, estimating the expected marginal value of future biodiversity observations. We find that past observations are useful in forecasting where high-value observations will occur in the future. Interestingly, we find high value in both 'hotspots', which are frequently sampled locations, and 'coldspots', which are areas far from recent sampling, suggesting that an optimal sampling regime balances 'hotspot' sampling with a spread across the landscape.
Project description:Technological advances and data availability have enabled artificial intelligence-driven tools that can increasingly successfully assist in identifying species from images. Especially within citizen science, an emerging source of information filling the knowledge gaps needed to solve the biodiversity crisis, such tools can allow participants to recognize and report more poorly known species. This can be an important tool in addressing the substantial taxonomic bias in biodiversity data, where broadly recognized, charismatic species are highly over-represented. Meanwhile, the recognition models are trained using the same biased data, so it is important to consider what additional images are needed to improve recognition models. In this study, we investigated how the amount of training data influenced the performance of species recognition models for various taxa. We utilized a large citizen science dataset collected in Norway, where images are added independently from identification. We demonstrate that while adding images of currently under-represented taxa will generally improve recognition models more, there are important deviations from this general pattern. Thus, a more focused prioritization of data collection beyond the basic paradigm that “more is better” is likely to significantly improve species recognition models and advance the representativeness of biodiversity data.
Project description:Citizen scientists play an increasingly important role in biodiversity monitoring. Most of the data, however, are unstructured-collected by diverse methods that are not documented with the data. Insufficient understanding of the data collection processes presents a major barrier to the use of citizen science data in biodiversity research. We developed a questionnaire to ask citizen scientists about their decision-making before, during and after collecting and reporting species observations, using Germany as a case study. We quantified the greatest sources of variability among respondents and assessed whether motivations and experience related to any aspect of data collection. Our questionnaire was answered by almost 900 people, with varying taxonomic foci and expertise. Respondents were most often motivated by improving species knowledge and supporting conservation, but there were no linkages between motivations and data collection methods. By contrast, variables related to experience and knowledge, such as membership of a natural history society, were linked with a greater propensity to conduct planned searches, during which typically all species were reported. Our findings have implications for how citizen science data are analysed in statistical models; highlight the importance of natural history societies and provide pointers to where citizen science projects might be further developed.
Project description:Research benefits increasingly from valuable contributions by citizen scientists. Mostly, participating adults investigate specific species, ecosystems or phenology to address conservation issues, but ecosystem functions supporting ecosystem health are rarely addressed and other demographic groups rarely involved. As part of a project investigating seed predation and dispersal as ecosystem functions along an urban-rural gradient, we tested whether elementary school children can contribute to the project as citizen scientists. Specifically, we compared data estimating vegetation cover, measuring vegetation height and counting seeds from a seed removal experiment, that were collected by children and scientists in schoolyards. Children counted seeds similarly to scientists but under- or overestimated vegetation cover and measured different heights. We conclude that children can be involved as citizen scientists in research projects according to their skill level. However, more sophisticated tasks require specific training to become familiarized with scientific experiments and the development of needed skills and methods.
Project description:Residents in rural Kentucky (KY) and suburban Ohio (OH) expressed concerns about radon exposure and lung cancer. Although 85% of lung cancer cases are caused by tobacco smoke, radon exposure accounts for 10-15% of lung cancer cases. Academic and community members from the University of KY and the University of Cincinnati developed and pilot-tested a family-centered, youth-engaged home radon testing toolkit. The radon toolkit included radon information, and how to test, interpret, and report back findings. We educated youth as citizen scientists and their teachers in human subjects protection and home radon testing using the toolkit in the classroom. Youth citizen scientists explained the study to their parents and obtained informed consent. One hundred students were trained in human subjects protection, 27 had parental permission to be citizen scientists, and 18 homeowners completed surveys. Radon values ranged from < 14.8 Bq/m3 to 277.5 Bq/m3. Youth were interested and engaged in citizen science and this family-centered, school-based project provided a unique opportunity to further the healthy housing and quality education components of the Sustainable Development Goals for 2030. Further research is needed to test the impact of student-led, family-centered citizen science projects in environmental health as part of school curricula.
Project description:To facilitate longer duration space travel, flight crew sickness and disease transmission amongst the crew must be eliminated. High contact surfaces within space vehicles provide an opportunity for bacterial adhesion, which can lead to biofilm formation or disease transmission. This study evaluates the performance of several nonfouling polymers using citizen science, to identify the best performing chemistry for future applications as bacteria resistant coatings. The specific polymer chemistries tested were zwitterionic sulfobetaine methacrylate (SBMA), and polyampholytes composed of [2-(acryloyloxy)ethyl] trimethylammonium chloride and 2-carboxyethyl acrylate (TMA/CAA), or TMA and 3-sulfopropyl methacrylate (TMA/SA). Each polymer chemistry is known to exhibit bacteria resistance, and this study provides a direct side-by-side comparison between the chemistries using a citizen science approach. Nearly 100 citizen scientists returned results comparing the performance of these polymers over repeat exposure to bacteria and 30 total days of growth. The results demonstrate that TMA/CAA polyampholyte hydrogels show the best long-term resistance to bacteria adhesion.
Project description:Citizen science is an increasingly popular way of engaging volunteers in the collection of scientific data. Despite this, data quality remains a concern and there is little published evidence about the accuracy of records generated by citizen scientists. Here we compare data generated by two British citizen science projects, Blooms for Bees and BeeWatch, to determine the ability of volunteer recorders to identify bumblebee (Bombus) species. We assessed recorders' identification ability in two ways-as recorder accuracy (the proportion of expert-verified records correctly identified by recorders) and recorder success (the proportion of recorder-submitted identifications confirmed correct by verifiers). Recorder identification ability was low (<50% accuracy; <60% success), despite access to project specific bumblebee identification materials. Identification ability varied significantly depending on bumblebee species, with recorders most able to correctly identify species with distinct appearances. Blooms for Bees recorders (largely recruited from the gardening community) were markedly less able to identify bumblebees than BeeWatch recorders (largely individuals with a more specific interest in bumblebees). Within both projects, recorders demonstrated an improvement in identification ability over time. Here we demonstrate and quantify the essential role of expert verification within citizen science projects, and highlight where resources could be strengthened to improve recorder ability.
Project description:With the rapid improvement of cryo-electron microscopy (cryo-EM) resolution, new computational tools are needed to assist and improve upon atomic model building and refinement options. This communication demonstrates that microscopists can now collaborate with the players of the computer game Foldit to generate high-quality de novo structural models. This development could greatly speed the generation of excellent cryo-EM structures when used in addition to current methods.
Project description:This article presents a geolocated dataset of rural home annotations on very high resolution satellite imagery from Uganda, Kenya, and Sierra Leone. This dataset was produced through a citizen science project called "Power to the People", which mapped rural homes for electrical infrastructure planning and computer-vision-based mapping. Additional details on this work are presented in "Power to the People: Applying citizen science to home-level mapping for rural energy access" [1]. 578,010 home annotations were made on approximately 1,267 km2 of imagery over 179 days by over 6,000 volunteers. The bounding-box annotations produced in this work have been anonymized and georeferenced. These raw annotations were found to have a precision of 49% and recall of 93% compared to a researcher-generated set of gold standard annotations. Data on roof colour and shape were also collected and are provided. Metadata about the sensors used to capture the original images and the annotation process are also attached to data records. While this dataset was collected for electrical infrastructure planning research, it can be useful in diverse sectors, including humanitarian assistance and public health.