scholarly journals What is Data Science? An Operational Definition based on Text Mining of Data Science Curricula

2021 ◽  
Vol 1 (1) ◽  
Author(s):  
Zhiyong Zhang ◽  

Data science has maintained its popularity for about 20 years. This study adopts a bottom-up approach to understand what data science is by analyzing the descriptions of courses offered by the data science programs in the United States. Through topic modeling, 14 topics are identified from the current curricula of 56 data science programs. These topics reiterate that data science is at the intersection of statistics, computer science, and substantive fields.

Author(s):  
Trefor Williams ◽  
John Betak ◽  
Bridgette Findley

The National Transportation Safety Board in the United States and the Transportation Safety Board of Canada publish reports about major railroad accidents. The text from these accident reports were analyzed using the text mining techniques of probabilistic topic modeling and k-means clustering to identify the recurring themes in major railroad accidents. The output from these analyses indicates that the railroad accidents can be successfully grouped into different topics. The output also suggests that recurring accident types are track defects, wheel defects, grade crossing accidents, and switching accidents. A major difference between the Canadian and U.S. reports is the finding that accidents related to bridges are found to be more prominent in the Canadian reports.


Author(s):  
Senqi Zhang ◽  
Li Sun ◽  
Daiwei Zhang ◽  
Pin Li ◽  
Yue Liu ◽  
...  

AbstractBackgroundMental health illness is a growing problem in recent years. During the COVID-19 pandemic, mental health concerns (such as fear and loneliness) have been actively discussed on social media.ObjectiveIn this study, we aim to examine mental health discussions on Twitter during the COVID-19 pandemic in the United States and infer the demographic composition of Twitter users who had mental health concerns.MethodsCOVID-19 related tweets from March 5th, 2020 to January 31st, 2021 were collected through Twitter streaming API using COVID-19 related keywords (e.g., “corona”, “covid19”, “covid”). By further filtering using mental health keywords (e.g., “depress”, “failure”, “hopeless”), we extracted mental health-related tweets from the US. Topic modeling using the Latent Dirichlet Allocation model was conducted to monitor users’ discussions surrounding mental health concerns. Demographic inference using deep learning algorithms (including Face++ and Ethnicolr) was performed to infer the demographic composition of Twitter users who had mental health concerns during the COVID-19 pandemic.ResultsWe observed a positive correlation between mental health concerns on Twitter and the COVID-19 pandemic in the US. Topic modeling showed that “stay-at-home”, “death poll” and “politics and policy” were the most popular topics in COVID-19 mental health tweets. Among Twitter users who had mental health concerns during the pandemic, Males, White, and 30-49 age group people were more likely to express mental health concerns. In addition, Twitter users from the east and west coast had more mental health concerns.ConclusionsThe COVID-19 pandemic has a significant impact on mental health concerns on Twitter in the US. Certain groups of people (such as Males, White) were more likely to have mental health concerns during the COVID-19 pandemic.


2022 ◽  
pp. 193-203

The purpose of this chapter is to analyze and review trends in digital terminology. The chapter begins by examining the origins of computerization in the United States during the 19th and 20th centuries. Next, the chapter examines the key concepts punch cards and computer science. The chapter then discusses how the term computer science is misleading. This is followed by a discussion of how information technology became the most popular term in the US. The chapter then switches focus to Europe and discusses France's promotion of informatique as well as Europe's switch from informatics to ICT. Next, the chapter considers how the internet has given rise to terms like e-commerce. The chapter concludes by considering the transition from ICT to digital informing and informing technology.


2015 ◽  
pp. 918-933
Author(s):  
Eric P. Jiang

With the rapid growth of the Internet and telecommunication networks, computer technology has been a driving force in global economic development and in advancing many areas in science, engineering, health care, business, and finance that carry significant impacts on people and society. As a primary source for producing the workforce of software engineers, computer scientists and information technology specialists, computer science education plays a particularly important role in modern economic growth and it has been invested heavily in many countries around the world. This chapter provides a comparative study of undergraduate computer science programs between China and the United States. The study focuses on the current curricula of computer science programs. It in part is based on the author's direct observation from his recent visits to several universities in China and the conversations he had with administrators and faculty of computer science programs at the universities. It is also based on the author's over two decades experience as a computer science educator at several public and private American institutions of higher educations. The education systems in China and the United States have different features and each of the systems has its strengths and weaknesses. This is likely also true for education systems in other countries. It would be an interesting and important task for us to explore an innovative computer science education program, which perhaps blends the best features of different systems and helps better prepare graduates for the challenges working in an increasingly globalized world. We hope the study presented in this chapter provides some useful insights in this direction.


Author(s):  
Patrick Letouze ◽  
David N. Prata

In 2012, the internet advertising revenue in the United States of America reached a total of 36.6 billion dollars, a growth of 15.2% when compared to 2011. The efficiency of a marketing strategy relies in the ability to understand and to direct the consumers' desires. In this work, the authors propose an approach that combines the Internet-Based Information Consumer Theory (IBICT) with semiotics to bring consumers' desires to e-Market. Hence, we present IBICT's framework as a collective network set based on a semiotic human-machine approach. For implementation purposes, we propose a text mining architecture towards IBICT's framework, which leads to an IBICT's architecture, and an Interdisciplinary Research Project Management (IRPM) approach to determine IBICT's dimensions.


Author(s):  
Lech J. Janczewski ◽  
Andrew M. Colarik

The current state of the information security domain in the United States and much of the rest of the industrialized world can best be characterized as overly optimistic. The protection of computing systems and telecommunication infrastructures from unauthorized usage, manipulation, and sabotage faces serious challenges to ensure ongoing serviceability. This is especially true when we consider our growing dependence on these infrastructures. The state of affairs regarding the security aspects of these systems is even worse. Peter G. Neumann of the Computer Science Laboratory at SRI International in Menlo Park, California states:


2003 ◽  
Vol 5 (1) ◽  
pp. 74-101 ◽  
Author(s):  
Howard Elcock

Regionalism: the development of governments and governance structures intermediate between state and local levels has become increasingly significant in the government of both Britain and the United States. Functional issues concerned with the regeneration of rustbelt areas or controlling growth in prosperous areas have resulted in searches for regionalist solutions on both sides of the Atlantic. However, in Britain there are additional pressures from regions with distinctive cultures, as well as from the increasingly influential ‘Europe of the Regions’. Demands for regional government and governance may be generated from the bottom up by a region's politicians, business leaders and others but they are unlikely to be successful unless they are encouraged by higher levels of government, at state, national or supranational levels.


Sign in / Sign up

Export Citation Format

Share Document