scholarly journals Perspective: The Power (Dynamics) of Open Data in Citizen Science

2021 ◽  
Vol 3 ◽  
Author(s):  
Caren B. Cooper ◽  
Lisa M. Rasmussen ◽  
Elizabeth D. Jones

In citizen science, data stewards and data producers are often not the same people. When those who have labored on data collection are not in control of the data, ethical problems could arise from this basic structural feature. In this Perspective, we advance the proposition that stewarding data sets generated by volunteers involves the typical technical decisions in conventional research plus a suite of ethical decisions stemming from the relationship between professionals and volunteers. Differences in power, priorities, values, and vulnerabilities are features of the relationship between professionals and volunteers. Thus, ethical decisions about open data practices in citizen science include, but are not limited to, questions grounded in respect for volunteers: who decides data governance structures, who receives attribution for a data set, which data are accessible and to whom, and whose interests are served by the data use/re-use. We highlight ethical issues that citizen science practitioners should consider when making data governance decisions, particularly with respect to open data.

2021 ◽  
Author(s):  
Viviane Zulian ◽  
David A. W. Miller ◽  
Goncalo Ferraz

Mapping species distributions is a crucial but challenging requirement of wildlife management. The frequent need to sample vast expanses of potential habitat increases the cost of planned surveys and rewards accumulation of opportunistic observations. In this paper, we integrate planned survey data from roost counts with opportunistic samples from eBird, WikiAves and Xeno-canto citizen-science platforms to map the geographic range of the endangered Vinaceous-breasted Parrot. We demonstrate the estimation and mapping of species occurrence based on data integration while accounting for specifics of each data set, including observation technique and uncertainty about the observations. Our analysis illustrates 1) the incorporation of sampling effort, spatial autocorrelation, and site covariates in a joint-likelihood, hierarchical, data-integration model; 2) the evaluation of the contribution of each data set, as well as the contribution of effort covariates, spatial autocorrelation, and site covariates to the predictive ability of fitted models using a cross-validation approach; and 3) how spatial representation of the latent occupancy state (i.e. realized occupancy) helps identify areas with high uncertainty that should be prioritized in future field work. Our results reveal a Vinaceous-breasted Parrot geographic range of 434,670 square kilometers, which is three times larger than the Extant area previously reported in the IUCN Red List. The exclusion of one data set at a time from the analyses always resulted in worse predictions by the models of truncated data than by the full model, which included all data sets. Likewise, exclusion of spatial autocorrelation, site covariates, or sampling effort resulted in worse predictions. The integration of different data sets into one joint-likelihood model produced a more reliable representation of the species range than any individual data set taken on its own improving the use of citizen science data in combination with planned survey results.


2021 ◽  
Vol 99 (Supplement_1) ◽  
pp. 218-219
Author(s):  
Andres Fernando T Russi ◽  
Mike D Tokach ◽  
Jason C Woodworth ◽  
Joel M DeRouchey ◽  
Robert D Goodband ◽  
...  

Abstract The swine industry has been constantly evolving to select animals with improved performance traits and to minimize variation in body weight (BW) in order to meet packer specifications. Therefore, understanding variation presents an opportunity for producers to find strategies that could help reduce, manage, or deal with variation of pigs in a barn. A systematic review and meta-analysis was conducted by collecting data from multiple studies and available data sets in order to develop prediction equations for coefficient of variation (CV) and standard deviation (SD) as a function of BW. Information regarding BW variation from 16 papers was recorded to provide approximately 204 data points. Together, these data included 117,268 individually weighed pigs with a sample size that ranged from 104 to 4,108 pigs. A random-effects model with study used as a random effect was developed. Observations were weighted using sample size as an estimate for precision on the analysis, where larger data sets accounted for increased accuracy in the model. Regression equations were developed using the nlme package of R to determine the relationship between BW and its variation. Polynomial regression analysis was conducted separately for each variation measurement. When CV was reported in the data set, SD was calculated and vice versa. The resulting prediction equations were: CV (%) = 20.04 – 0.135 × (BW) + 0.00043 × (BW)2, R2=0.79; SD = 0.41 + 0.150 × (BW) - 0.00041 × (BW)2, R2 = 0.95. These equations suggest that there is evidence for a decreasing quadratic relationship between mean CV of a population and BW of pigs whereby the rate of decrease is smaller as mean pig BW increases from birth to market. Conversely, the rate of increase of SD of a population of pigs is smaller as mean pig BW increases from birth to market.


2017 ◽  
Vol 3 (5) ◽  
pp. e192 ◽  
Author(s):  
Corina Anastasaki ◽  
Stephanie M. Morris ◽  
Feng Gao ◽  
David H. Gutmann

Objective:To ascertain the relationship between the germline NF1 gene mutation and glioma development in patients with neurofibromatosis type 1 (NF1).Methods:The relationship between the type and location of the germline NF1 mutation and the presence of a glioma was analyzed in 37 participants with NF1 from one institution (Washington University School of Medicine [WUSM]) with a clinical diagnosis of NF1. Odds ratios (ORs) were calculated using both unadjusted and weighted analyses of this data set in combination with 4 previously published data sets.Results:While no statistical significance was observed between the location and type of the NF1 mutation and glioma in the WUSM cohort, power calculations revealed that a sample size of 307 participants would be required to determine the predictive value of the position or type of the NF1 gene mutation. Combining our data set with 4 previously published data sets (n = 310), children with glioma were found to be more likely to harbor 5′-end gene mutations (OR = 2; p = 0.006). Moreover, while not clinically predictive due to insufficient sensitivity and specificity, this association with glioma was stronger for participants with 5′-end truncating (OR = 2.32; p = 0.005) or 5′-end nonsense (OR = 3.93; p = 0.005) mutations relative to those without glioma.Conclusions:Individuals with NF1 and glioma are more likely to harbor nonsense mutations in the 5′ end of the NF1 gene, suggesting that the NF1 mutation may be one predictive factor for glioma in this at-risk population.


2019 ◽  
Vol 75 (4) ◽  
pp. 53-58
Author(s):  
K. S. Rohozinnikova

The author of the article has provided the results of the analysis of the methods of administrative and legal protection taking into account the changes within the relations between public administration and taxpayers and the course chosen by the state for liberalization of tax relations. The place of the concept of the methods of administrative and legal protection in the term system of the science of administrative law and their dialectical relations with the methods of public administration and administrative activity has been established. The author has indicated generic and specific features of the methods of administrative and legal protection of tax relations, where the latter will depend on the peculiarities of the means and methods of influence used by the public administration for the purpose of exercising security functions. The system of methods of administrative and legal protection of tax relations has been offered to form from three elements: general methods of administrative activity (persuasion and coercion), service tools of influence (provision of administrative services, creation of electronic services and publication of open data sets) and organizational methods. The expediency of distinguishing service means of influence into a separate group of methods of administrative and legal protection has been proved. It is conditioned by their special functional purpose – creation of conditions for independent prevention of possible breach of protected relations by the taxpayer. The role and correlation of persuasion and coercion in the system of methods of administrative and legal protection of tax relations have been clarified. Despite the presented importance of the persuasion within the relationship between the controlling agencies and the taxpayers, it has been stated that state coercion remains the main mean of administrative and legal protection of tax relations. Particular attention has been paid on the need to reconsider the correlation of tax and administrative coercion within tax relations. It has been proved that the basis of their delimitation should be not the branch of legislation, where the authority to apply the appropriate measure is assigned, but the essential criterion and the subject matter of regulation (influence) – relations arising from incomplete calculation and late and incomplete payment of taxes and fees, or relationships related to the organization and enforcement of tax responsibilities and the proper exercise by the supervisory authorities of their powers. It has been emphasized that tax coercion, unlike administrative, performs both punitive and compensatory functions.


2017 ◽  
Vol 44 (2) ◽  
pp. 203-229 ◽  
Author(s):  
Javier D Fernández ◽  
Miguel A Martínez-Prieto ◽  
Pablo de la Fuente Redondo ◽  
Claudio Gutiérrez

The publication of semantic web data, commonly represented in Resource Description Framework (RDF), has experienced outstanding growth over the last few years. Data from all fields of knowledge are shared publicly and interconnected in active initiatives such as Linked Open Data. However, despite the increasing availability of applications managing large-scale RDF information such as RDF stores and reasoning tools, little attention has been given to the structural features emerging in real-world RDF data. Our work addresses this issue by proposing specific metrics to characterise RDF data. We specifically focus on revealing the redundancy of each data set, as well as common structural patterns. We evaluate the proposed metrics on several data sets, which cover a wide range of designs and models. Our findings provide a basis for more efficient RDF data structures, indexes and compressors.


Author(s):  
Liah Shonhe

The main focus of the study was to explore the practices of open data sharing in the agricultural sector, including establishing the research outputs concerning open data in agriculture. The study adopted a desktop research methodology based on literature review and bibliographic data from WoS database. Bibliometric indicators discussed include yearly productivity, most prolific authors, and enhanced countries. Study findings revealed that research activity in the field of agriculture and open access is very low. There were 36 OA articles and only 6 publications had an open data badge. Most researchers do not yet embrace the need to openly publish their data set despite the availability of numerous open data repositories. Unfortunately, most African countries are still lagging behind in management of agricultural open data. The study therefore recommends that researchers should publish their research data sets as OA. African countries need to put more efforts in establishing open data repositories and implementing the necessary policies to facilitate OA.


Sensors ◽  
2020 ◽  
Vol 20 (3) ◽  
pp. 879 ◽  
Author(s):  
Uwe Köckemann ◽  
Marjan Alirezaie ◽  
Jennifer Renoux ◽  
Nicolas Tsiftes ◽  
Mobyen Uddin Ahmed ◽  
...  

As research in smart homes and activity recognition is increasing, it is of ever increasing importance to have benchmarks systems and data upon which researchers can compare methods. While synthetic data can be useful for certain method developments, real data sets that are open and shared are equally as important. This paper presents the E-care@home system, its installation in a real home setting, and a series of data sets that were collected using the E-care@home system. Our first contribution, the E-care@home system, is a collection of software modules for data collection, labeling, and various reasoning tasks such as activity recognition, person counting, and configuration planning. It supports a heterogeneous set of sensors that can be extended easily and connects collected sensor data to higher-level Artificial Intelligence (AI) reasoning modules. Our second contribution is a series of open data sets which can be used to recognize activities of daily living. In addition to these data sets, we describe the technical infrastructure that we have developed to collect the data and the physical environment. Each data set is annotated with ground-truth information, making it relevant for researchers interested in benchmarking different algorithms for activity recognition.


BMJ Open ◽  
2016 ◽  
Vol 6 (10) ◽  
pp. e011784 ◽  
Author(s):  
Anisa Rowhani-Farid ◽  
Adrian G Barnett

ObjectiveTo quantify data sharing trends and data sharing policy compliance at the British Medical Journal (BMJ) by analysing the rate of data sharing practices, and investigate attitudes and examine barriers towards data sharing.DesignObservational study.SettingThe BMJ research archive.Participants160 randomly sampled BMJ research articles from 2009 to 2015, excluding meta-analysis and systematic reviews.Main outcome measuresPercentages of research articles that indicated the availability of their raw data sets in their data sharing statements, and those that easily made their data sets available on request.Results3 articles contained the data in the article. 50 out of 157 (32%) remaining articles indicated the availability of their data sets. 12 used publicly available data and the remaining 38 were sent email requests to access their data sets. Only 1 publicly available data set could be accessed and only 6 out of 38 shared their data via email. So only 7/157 research articles shared their data sets, 4.5% (95% CI 1.8% to 9%). For 21 clinical trials bound by the BMJ data sharing policy, the per cent shared was 24% (8% to 47%).ConclusionsDespite the BMJ's strong data sharing policy, sharing rates are low. Possible explanations for low data sharing rates could be: the wording of the BMJ data sharing policy, which leaves room for individual interpretation and possible loopholes; that our email requests ended up in researchers spam folders; and that researchers are not rewarded for sharing their data. It might be time for a more effective data sharing policy and better incentives for health and medical researchers to share their data.


2020 ◽  
pp. 20200375
Author(s):  
Min-Suk Heo ◽  
Jo-Eun Kim ◽  
Jae-Joon Hwang ◽  
Sang-Sun Han ◽  
Jin-Soo Kim ◽  
...  

Artificial intelligence, which has been actively applied in a broad range of industries in recent years, is an active area of interest for many researchers. Dentistry is no exception to this trend, and the applications of artificial intelligence are particularly promising in the field of oral and maxillofacial (OMF) radiology. Recent researches on artificial intelligence in OMF radiology have mainly used convolutional neural networks, which can perform image classification, detection, segmentation, registration, generation, and refinement. Artificial intelligence systems in this field have been developed for the purposes of radiographic diagnosis, image analysis, forensic dentistry, and image quality improvement. Tremendous amounts of data are needed to achieve good results, and involvement of OMF radiologist is essential for making accurate and consistent data sets, which is a time-consuming task. In order to widely use artificial intelligence in actual clinical practice in the future, there are lots of problems to be solved, such as building up a huge amount of fine-labeled open data set, understanding of the judgment criteria of artificial intelligence, and DICOM hacking threats using artificial intelligence. If solutions to these problems are presented with the development of artificial intelligence, artificial intelligence will develop further in the future and is expected to play an important role in the development of automatic diagnosis systems, the establishment of treatment plans, and the fabrication of treatment tools. OMF radiologists, as professionals who thoroughly understand the characteristics of radiographic images, will play a very important role in the development of artificial intelligence applications in this field.


BioScience ◽  
2020 ◽  
Author(s):  
Corey T Callaghan ◽  
Alistair G B Poore ◽  
Thomas Mesaglio ◽  
Angela T Moles ◽  
Shinichi Nakagawa ◽  
...  

Abstract Citizen science is fundamentally shifting the future of biodiversity research. But although citizen science observations are contributing an increasingly large proportion of biodiversity data, they only feature in a relatively small percentage of research papers on biodiversity. We provide our perspective on three frontiers of citizen science research, areas that we feel to date have had minimal scientific exploration but that we believe deserve greater attention as they present substantial opportunities for the future of biodiversity research: sampling the undersampled, capitalizing on citizen science's unique ability to sample poorly sampled taxa and regions of the world, reducing taxonomic and spatial biases in global biodiversity data sets; estimating abundance and density in space and time, develop techniques to derive taxon-specific densities from presence or absence and presence-only data; and capitalizing on secondary data collection, moving beyond data on the occurrence of single species and gain further understanding of ecological interactions among species or habitats. The contribution of citizen science to understanding the important biodiversity questions of our time should be more fully realized.


Sign in / Sign up

Export Citation Format

Share Document