scholarly journals Using CMS Open Data in research – challenges and directions

2021 ◽  
Vol 251 ◽  
pp. 01004
Author(s):  
Kati Lassila-Perini ◽  
Clemens Lange ◽  
Edgar Carrera Jarrin ◽  
Matthew Bellis

The CMS experiment at CERN has released research-quality data from particle collisions at the LHC since 2014. Almost all data from the first LHC run in 2010–2012 with the corresponding simulated samples are now in the public domain, and several scientific studies have been performed using these data. This paper summarizes the available data and tools, reviews the challenges in using them in research, and discusses measures to improve their usability.

Author(s):  
Muhammad Habib Algifari ◽  
Winda Yulita ◽  
Eko Dwi Nugroho

In the current information age, data is a valuable asset for companies [1]. Technological advances encourage the digitization of information in almost all fields of science, including astronomy. Technological developments make it easy to access open data in the public domain. The availability of open data will encourage the acceleration of research. ITERA Astronomical Observatory is an observatory located in Lampung. This observatory is claimed to be the largest in Southeast Asia [3]. To face the challenges in the digitization of information, ITERA Astronomical Observatory plans to build an information system specifically for Storing and handling astronomical data


2020 ◽  
Vol 101 (12) ◽  
pp. E2113-E2132
Author(s):  
David M. L. Sills ◽  
Gregory A. Kopp ◽  
Lesley Elliott ◽  
Aaron L. Jaffe ◽  
Liz Sutherland ◽  
...  

AbstractCanada is a vast country with most of its population located along its southern border. Large areas are sparsely populated and/or heavily forested, and severe weather reports are rare when thunderstorms occur there. Thus, it has been difficult to accurately assess the true tornado climatology and risk. It is also important to establish a reliable baseline for tornado-related climate change studies. The Northern Tornadoes Project (NTP), led by Western University, is an ambitious multidisciplinary initiative aimed at detecting and documenting every tornado that occurs across Canada. A team of meteorologists and wind engineers collects research-quality data during each damage investigation via thorough ground surveys and high-resolution satellite, aircraft, and drone imaging. Crowdsourcing through social media is also key to tracking down events. In addition, NTP conducts research to improve our ability to detect and accurately assess tornadoes that affect forests, cropland, and grassland. An open data website allows sharing of resulting datasets and analyses. Pilot investigations were carried out during the warm seasons of 2017 and 2018, with the scope expanding from the detection of any tornadoes in heavily forested regions of central Canada in 2017 to the detection of all EF1+ tornadoes in Ontario plus all significant events outside of Ontario in 2018. The 2019 season was the first full campaign, systematically collecting research-quality tornado data across the entire country. To date, the project has found 89 tornadoes that otherwise would not have been identified, and increased the national tornado count in 2019 by 78%.


2016 ◽  
Vol 12 (2) ◽  
Author(s):  
Aurelie Larquemin ◽  
Jyoti Prasad Mukhopadhyay ◽  
Sharon Buteau

Public entities are one of the main producers of socio-economic data around the world. The Open Government Data (OGD) movement encourages these entities to make their data publicly available in order to improve transparency and accountability, which may lead to good governance. Thus, OGD can promote evidence-based public policy by supporting empirical research through making quality data available. Hence, in this paper we discuss the current status of OGD initiative in India, how its principles are considered and applied by the public authorities, and the feedback of the research community about OGD in India.   Les institutions publiques sont parmi les principaux producteurs de données socio-économiques. Le mouvement « Données Gouvernementales ouvertes » les encourage et assiste parfois dans la mise à disposition de leurs données au public, pour améliorer la transparence, ce qui peut conduire à une meilleure gouvernance. Ainsi, les données ouvertes gouvernementales peuvent conduire à de meilleures politiques publiques basées sur leurs résultats en soutenant la recherche par la publication de données de qualité. Ce document traite de la situation des données ouvertes en Inde, leur publication et usage par les institutions publiques et par la communauté de recherche.   Las instituciones públicas son los principales productores de datos socio-económicos. El movimiento de " datos gubernamentales abiertos" alienta estas entidades de poner sus datos a disposición del público para mejorar la transparencia, y la gobernanza. Por lo tanto los datos gubernamentales abiertos pueden promover políticas públicas basadas en evidencia, mediante el apoyo a la investigación empírica a través de hacer datos de calidad disponibles.  En este trabajo se discute lo que es la realidad de los datos gubernamentales abiertos en la India, cómo sus principios están consideradas y aplicadas por las autoridades públicas y la comunidad de investigación.  


2022 ◽  
Vol 14 (1) ◽  
pp. 1-9
Author(s):  
Saravanan Thirumuruganathan ◽  
Mayuresh Kunjir ◽  
Mourad Ouzzani ◽  
Sanjay Chawla

The data and Artificial Intelligence revolution has had a massive impact on enterprises, governments, and society alike. It is fueled by two key factors. First, data have become increasingly abundant and are often available openly. Enterprises have more data than they can process. Governments are spearheading open data initiatives by setting up data portals such as data.gov and releasing large amounts of data to the public. Second, AI engineering development is becoming increasingly democratized. Open source frameworks have enabled even an individual developer to engineer sophisticated AI systems. But with such ease of use comes the potential for irresponsible use of data. Ensuring that AI systems adhere to a set of ethical principles is one of the major problems of our age. We believe that data and model transparency has a key role to play in mitigating the deleterious effects of AI systems. In this article, we describe a framework to synthesize ideas from various domains such as data transparency, data quality, data governance among others to tackle this problem. Specifically, we advocate an approach based on automated annotations (of both data and the AI model), which has a number of appealing properties. The annotations could be used by enterprises to get visibility of potential issues, prepare data transparency reports, create and ensure policy compliance, and evaluate the readiness of data for diverse downstream AI applications. We propose a model architecture and enumerate its key components that could achieve these requirements. Finally, we describe a number of interesting challenges and opportunities.


2017 ◽  
Vol 33 (1-2) ◽  
pp. 203-231
Author(s):  
Antonio Terrone
Keyword(s):  

The study of Buddhist texts can inform us of the way scriptures were composed, as well as illuminate the reasons behind their production. This study examines the phenomenon of borrowing and reusing portions of texts without attributing them to their ‘legitimate authors’ within the Buddhist world of contemporary Tibet. It shows that not only is such a practice not at all infrequent and is often socially accepted, but that it is used in this case as a platform to advance specific claims and promote an explicit agenda. Therefore, rather than considering these as instances of plagiarism, this essay looks at the practice of copying and borrowing as an exercise in intertextuality, intended as the faithful retransmission of ancient truths, and as an indication of the public domain of texts in Tibet.


2019 ◽  
Vol 47 (3) ◽  
pp. 80-91
Author(s):  
V. G. Neiman

The main content of the work consists of certain systematization and addition of longexisting, but eventually deformed and partly lost qualitative ideas about the role of thermal and wind factors that determine the physical mechanism of the World Ocean’s General Circulation System (OGCS). It is noted that the conceptual foundations of the theory of the OGCS in one form or another are contained in the works of many well-known hydrophysicists of the last century, but the aggregate, logically coherent description of the key factors determining the physical model of the OGCS in the public literature is not so easy to find. An attempt is made to clarify and concretize some general ideas about the two key blocks that form the basis of an adequate physical model of the system of oceanic water masses motion in a climatic scale. Attention is drawn to the fact that when analyzing the OGCS it is necessary to take into account not only immediate but also indirect effects of thermal and wind factors on the ocean surface. In conclusion, it is noted that, in the end, by the uneven flow of heat to the surface of the ocean can be explained the nature of both external and almost all internal factors, in one way or another contributing to the excitation of the general, or climatic, ocean circulation.


Sign in / Sign up

Export Citation Format

Share Document