scholarly journals Abusers indoors and coronavirus outside: an examination of public discourse about COVID-19 and family violence on Twitter using machine learning

Author(s):  
Jia Xue ◽  
Junxiang Chen ◽  
Chen Chen ◽  
Ran Hu ◽  
Tingshao Zhu

Purpose: This brief report aims to provide the first large-scale analysis of public discourse regarding family violence and the COVID-19 pandemic on Twitter. Method: We analyzed 301,606 Tweets related to family violence and COVID-19 from April 12 to July 16, 2020, for this study. We used the machine learning approach, Latent Dirichlet Allocation, and identified salient themes, topics, and representative Twitter examples. Results: We extracted nine themes on family violence and COVID-19 pandemic, including (1) the Impact of COVID-19 on family violence (e.g., rising rates, hotline calls increased, murder & homicide); (2) the types (e.g., child abuse, domestic violence, sexual violence) and (3) forms of family violence (e.g., physical aggression, coercive control); (4) risk factors of family violence (e.g., alcohol abuse, financial constraints, gun, quarantine); (5) victims of family violence (e.g., LGBTQ, women, and women of color, children); (6) social services of family violence (e.g., hotlines, social workers, confidential services, shelters, funding); (7) law enforcement response (e.g., 911 calls, police arrest, protective orders, abuse reports); (8) Social movement/ awareness (e.g., support victims, raise awareness); and (9) domestic violence-related news (e.g., Tara Reade, Melissa Derosa). Conclusions: The COVID-19 has an impact on family violence. This report overcomes the limitation of existing scholarship that lacks data for consequences of COVID-19 on family violence. We contribute to the understanding of family violence during the pandemic by providing surveillance in Tweets, which is essential to identify potentially effective policy programs in offering targeted support for victims and survivors and preparing for the next wave.

2020 ◽  
Author(s):  
Jia Xue ◽  
Junxiang Chen ◽  
Chen Chen ◽  
Ran Hu ◽  
Tingshao Zhu

BACKGROUND Family violence (including intimate partner violence/domestic violence, child abuse, and elder abuse) is a hidden pandemic happening alongside COVID-19. The rates of family violence are rising fast, and women and children are disproportionately affected and vulnerable during this time. OBJECTIVE This study aims to provide a large-scale analysis of public discourse on family violence and the COVID-19 pandemic on Twitter. METHODS We analyzed over 1 million tweets related to family violence and COVID-19 from April 12 to July 16, 2020. We used the machine learning approach Latent Dirichlet Allocation and identified salient themes, topics, and representative tweets. RESULTS We extracted 9 themes from 1,015,874 tweets on family violence and the COVID-19 pandemic: (1) increased vulnerability: COVID-19 and family violence (eg, rising rates, increases in hotline calls, homicide); (2) types of family violence (eg, child abuse, domestic violence, sexual abuse); (3) forms of family violence (eg, physical aggression, coercive control); (4) risk factors linked to family violence (eg, alcohol abuse, financial constraints, guns, quarantine); (5) victims of family violence (eg, the LGBTQ [lesbian, gay, bisexual, transgender, and queer or questioning] community, women, women of color, children); (6) social services for family violence (eg, hotlines, social workers, confidential services, shelters, funding); (7) law enforcement response (eg, 911 calls, police arrest, protective orders, abuse reports); (8) social movements and awareness (eg, support victims, raise awareness); and (9) domestic violence–related news (eg, Tara Reade, Melissa DeRosa). CONCLUSIONS This study overcomes limitations in the existing scholarship where data on the consequences of COVID-19 on family violence are lacking. We contribute to understanding family violence during the pandemic by providing surveillance via tweets. This is essential for identifying potentially useful policy programs that can offer targeted support for victims and survivors as we prepare for future outbreaks.


10.2196/24361 ◽  
2020 ◽  
Vol 22 (11) ◽  
pp. e24361
Author(s):  
Jia Xue ◽  
Junxiang Chen ◽  
Chen Chen ◽  
Ran Hu ◽  
Tingshao Zhu

Background Family violence (including intimate partner violence/domestic violence, child abuse, and elder abuse) is a hidden pandemic happening alongside COVID-19. The rates of family violence are rising fast, and women and children are disproportionately affected and vulnerable during this time. Objective This study aims to provide a large-scale analysis of public discourse on family violence and the COVID-19 pandemic on Twitter. Methods We analyzed over 1 million tweets related to family violence and COVID-19 from April 12 to July 16, 2020. We used the machine learning approach Latent Dirichlet Allocation and identified salient themes, topics, and representative tweets. Results We extracted 9 themes from 1,015,874 tweets on family violence and the COVID-19 pandemic: (1) increased vulnerability: COVID-19 and family violence (eg, rising rates, increases in hotline calls, homicide); (2) types of family violence (eg, child abuse, domestic violence, sexual abuse); (3) forms of family violence (eg, physical aggression, coercive control); (4) risk factors linked to family violence (eg, alcohol abuse, financial constraints, guns, quarantine); (5) victims of family violence (eg, the LGBTQ [lesbian, gay, bisexual, transgender, and queer or questioning] community, women, women of color, children); (6) social services for family violence (eg, hotlines, social workers, confidential services, shelters, funding); (7) law enforcement response (eg, 911 calls, police arrest, protective orders, abuse reports); (8) social movements and awareness (eg, support victims, raise awareness); and (9) domestic violence–related news (eg, Tara Reade, Melissa DeRosa). Conclusions This study overcomes limitations in the existing scholarship where data on the consequences of COVID-19 on family violence are lacking. We contribute to understanding family violence during the pandemic by providing surveillance via tweets. This is essential for identifying potentially useful policy programs that can offer targeted support for victims and survivors as we prepare for future outbreaks.


Author(s):  
Ioannis T. Georgiou

Abstract This work presents a data-driven explorative study of the physics of the dynamics of a physical structure of complicated geometry. The geometric complexity of the physical system renders the typical single sensor acceleration signal quite complicated for a physics interpretation. We need the spatial dimension to resolve the single sensory signal over its entire time horizon. Thus we are introducing the spatial dimension by the canonical eight-dimensional data cloud (Canonical 8D-Data Cloud) concept to build methods to explore the impact-induced free dynamics of physical complex mechanical structures. The complex structure in this study is a large scale aluminum alloy plate stiffened by a frame made of T-section beams. The Canonical 8D-Data Cloud is identified with the simultaneous acceleration measurements by eight piezoelectric sensors equally spaced and attached on the periphery of a circular material curve drawn on the uniform surface of the stiffened plate. The Data Cloud approach leads to a systematic exploration-discovery-quantification of uncertainty in this physical complex structure. It is found that considerable uncertainty is stemming from the sensitivity of transient dynamics on the parameters of space-time localized force pulses, the latter being used as a means to diagnose the presence of structural anomalies. The Data Cloud approach leads to aspects of machine learning such as reduced dynamics analytics of big sensory data by means of heavenly machine-assisted computations to carry out the unparalleled data reduction analysis enabled by the Advanced Proper Orthogonal Decomposition Transform. Emphasized is the connection between the characteristic geometric features of high-dimensional datasets as a whole, the Data Cloud, and the modal physics of the dynamics.


Author(s):  
Sangeeta Lal ◽  
Neetu Sardana ◽  
Ashish Sureka

Log statements present in source code provide important information to the software developers because they are useful in various software development activities such as debugging, anomaly detection, and remote issue resolution. Most of the previous studies on logging analysis and prediction provide insights and results after analyzing only a few code constructs. In this chapter, the authors perform an in-depth, focused, and large-scale analysis of logging code constructs at two levels: the file level and catch-blocks level. They answer several research questions related to statistical and content analysis. Statistical and content analysis reveals the presence of differentiating properties among logged and nonlogged code constructs. Based on these findings, the authors propose a machine-learning-based model for catch-blocks logging prediction. The machine-learning-based model is found to be effective in catch-blocks logging prediction.


Metabolites ◽  
2020 ◽  
Vol 10 (10) ◽  
pp. 381
Author(s):  
Lisa Eisenbeiss ◽  
Tina M. Binz ◽  
Markus R. Baumgartner ◽  
Thomas Kraemer ◽  
Andrea E. Steuer

Untargeted metabolomic studies are used for large-scale analysis of endogenous compounds. Due to exceptional long detection windows of incorporated substances in hair, analysis of hair samples for retrospective monitoring of metabolome changes has recently been introduced. However, information on the general behavior of metabolites in hair samples is scarce, hampering correct data interpretation so far. The presented study aimed to investigate endogenous metabolites depending on hair color and along the hair strand and to propose recommendations for best practice in hair metabolomic studies. A metabolite selection was analyzed using untargeted data acquisition in genuine hair samples from different hair colors and after segmentation in 3 cm segments. Significant differences in metabolites among hair colors and segments were found. In conclusion, consideration of hair color and hair segments is necessary for hair metabolomic studies and, subsequently, recommendations for best practice in hair metabolomic studies were proposed.


2020 ◽  
Author(s):  
Alex Sun ◽  
Bridget Scanlon ◽  
Himanshu Save ◽  
Ashraf Rateb

<p>The GRACE satellite mission and its follow-on, GRACE-FO, have provided unprecedented opportunities to quantify the impact of climate extremes and human activities on total water storage at large scales. The approximately one-year data gap between the two GRACE missions needs to be filled to maintain data continuity and maximize mission benefits. There is strong interest in using machine learning (ML) algorithms to reconstruct GRACE-like data to fill this gap. So far, most studies attempted to train and select a single ML algorithm to work for global basins. However, hydrometeorological predictors may exhibit strong spatial variability which, in turn, may affect the performance of ML models. Existing studies have already shown that no single algorithm consistently outperformed others over all global basins. In this study, we applied an automated machine learning (AutoML) workflow to perform GRACE data reconstruction. AutoML represents a new paradigm for optimal model structure selection, hyperparameter tuning, and model ensemble stacking, addressing some of the most challenging issues related to ML applications. We demonstrated the AutoML workflow over the conterminous U.S. (CONUS) using six types of ML algorithms and multiple groups of meteorological and climatic variables as predictors. Results indicate that the AutoML-assisted gap filling achieved satisfactory performance over the CONUS. For the testing period (2014/06–2017/06), the mean gridwise Nash-Sutcliffe efficiency is around 0.85, the mean correlation coefficient is around 0.95, and the mean normalized root-mean square error is about 0.09. Trained models maintain good performance when extrapolating to the mission gap and to GRACE-FO periods (after 2017/06). Results further suggest that no single algorithm provides the best predictive performance over the entire CONUS, stressing the importance of using an end-to-end workflow to train, optimize, and combine multiple machine learning models to deliver robust performance, especially when building large-scale hydrological prediction systems and when predictor importance exhibits strong spatial variability.</p>


10.2196/23957 ◽  
2021 ◽  
Vol 23 (2) ◽  
pp. e23957
Author(s):  
Chengda Zheng ◽  
Jia Xue ◽  
Yumin Sun ◽  
Tingshao Zhu

Background During the COVID-19 pandemic in Canada, Prime Minister Justin Trudeau provided updates on the novel coronavirus and the government’s responses to the pandemic in his daily briefings from March 13 to May 22, 2020, delivered on the official Canadian Broadcasting Corporation (CBC) YouTube channel. Objective The aim of this study was to examine comments on Canadian Prime Minister Trudeau’s COVID-19 daily briefings by YouTube users and track these comments to extract the changing dynamics of the opinions and concerns of the public over time. Methods We used machine learning techniques to longitudinally analyze a total of 46,732 English YouTube comments that were retrieved from 57 videos of Prime Minister Trudeau’s COVID-19 daily briefings from March 13 to May 22, 2020. A natural language processing model, latent Dirichlet allocation, was used to choose salient topics among the sampled comments for each of the 57 videos. Thematic analysis was used to classify and summarize these salient topics into different prominent themes. Results We found 11 prominent themes, including strict border measures, public responses to Prime Minister Trudeau’s policies, essential work and frontline workers, individuals’ financial challenges, rental and mortgage subsidies, quarantine, government financial aid for enterprises and individuals, personal protective equipment, Canada and China’s relationship, vaccines, and reopening. Conclusions This study is the first to longitudinally investigate public discourse and concerns related to Prime Minister Trudeau’s daily COVID-19 briefings in Canada. This study contributes to establishing a real-time feedback loop between the public and public health officials on social media. Hearing and reacting to real concerns from the public can enhance trust between the government and the public to prepare for future health emergencies.


2021 ◽  
Author(s):  
Sven Hilbert ◽  
Stefan Coors ◽  
Elisabeth Barbara Kraus ◽  
Bernd Bischl ◽  
Mario Frei ◽  
...  

Classical statistical methods are limited in the analysis of highdimensional datasets. Machine learning (ML) provides a powerful framework for prediction by using complex relationships, often encountered in modern data with a large number of variables, cases and potentially non-linear effects. ML has turned into one of the most influential analytical approaches of this millennium and has recently become popular in the behavioral and social sciences. The impact of ML methods on research and practical applications in the educational sciences is still limited, but continuously grows as larger and more complex datasets become available through massive open online courses (MOOCs) and large scale investigations.The educational sciences are at a crucial pivot point, because of the anticipated impact ML methods hold for the field. Here, we review the opportunities and challenges of ML for the educational sciences, show how a look at related disciplines can help learning from their experiences, and argue for a philosophical shift in model evaluation. We demonstrate how the overall quality of data analysis in educational research can benefit from these methods and show how ML can play a decisive role in the validation of empirical models. In this review, we (1) provide an overview of the types of data suitable for ML, (2) give practical advice for the application of ML methods, and (3) show how ML-based tools and applications can be used to enhance the quality of education. Additionally we provide practical R code with exemplary analyses, available at https: //osf.io/ntre9/?view only=d29ae7cf59d34e8293f4c6bbde3e4ab2.


2019 ◽  
Vol 2019 ◽  
pp. 1-30 ◽  
Author(s):  
Dongdong Lv ◽  
Shuhan Yuan ◽  
Meizi Li ◽  
Yang Xiang

According to the forecast of stock price trends, investors trade stocks. In recent years, many researchers focus on adopting machine learning (ML) algorithms to predict stock price trends. However, their studies were carried out on small stock datasets with limited features, short backtesting period, and no consideration of transaction cost. And their experimental results lack statistical significance test. In this paper, on large-scale stock datasets, we synthetically evaluate various ML algorithms and observe the daily trading performance of stocks under transaction cost and no transaction cost. Particularly, we use two large datasets of 424 S&P 500 index component stocks (SPICS) and 185 CSI 300 index component stocks (CSICS) from 2010 to 2017 and compare six traditional ML algorithms and six advanced deep neural network (DNN) models on these two datasets, respectively. The experimental results demonstrate that traditional ML algorithms have a better performance in most of the directional evaluation indicators. Unexpectedly, the performance of some traditional ML algorithms is not much worse than that of the best DNN models without considering the transaction cost. Moreover, the trading performance of all ML algorithms is sensitive to the changes of transaction cost. Compared with the traditional ML algorithms, DNN models have better performance considering transaction cost. Meanwhile, the impact of transparent transaction cost and implicit transaction cost on trading performance are different. Our conclusions are significant to choose the best algorithm for stock trading in different markets.


Sign in / Sign up

Export Citation Format

Share Document