scholarly journals Offensive keyword extraction based on the attention mechanism of BERT and the eigenvector centrality using a graph representation

Author(s):  
Gretel Liz De la Peña Sarracén ◽  
Paolo Rosso

AbstractThe proliferation of harmful content on social media affects a large part of the user community. Therefore, several approaches have emerged to control this phenomenon automatically. However, this is still a quite challenging task. In this paper, we explore the offensive language as a particular case of harmful content and focus our study in the analysis of keywords in available datasets composed of offensive tweets. Thus, we aim to identify relevant words in those datasets and analyze how they can affect model learning. For keyword extraction, we propose an unsupervised hybrid approach which combines the multi-head self-attention of BERT and a reasoning on a word graph. The attention mechanism allows to capture relationships among words in a context, while a language model is learned. Then, the relationships are used to generate a graph from what we identify the most relevant words by using the eigenvector centrality. Experiments were performed by means of two mechanisms. On the one hand, we used an information retrieval system to evaluate the impact of the keywords in recovering offensive tweets from a dataset. On the other hand, we evaluated a keyword-based model for offensive language detection. Results highlight some points to consider when training models with available datasets.

2020 ◽  
pp. 1-10
Author(s):  
Colin J. McMahon ◽  
Justin T. Tretter ◽  
Theresa Faulkner ◽  
R. Krishna Kumar ◽  
Andrew N. Redington ◽  
...  

Abstract Objective: This study investigated the impact of the Webinar on deep human learning of CHD. Materials and methods: This cross-sectional survey design study used an open and closed-ended questionnaire to assess the impact of the Webinar on deep learning of topical areas within the management of the post-operative tetralogy of Fallot patients. This was a quantitative research methodology using descriptive statistical analyses with a sequential explanatory design. Results: One thousand-three-hundred and seventy-four participants from 100 countries on 6 continents joined the Webinar, 557 (40%) of whom completed the questionnaire. Over 70% of participants reported that they “agreed” or “strongly agreed” that the Webinar format promoted deep learning for each of the topics compared to other standard learning methods (textbook and journal learning). Two-thirds expressed a preference for attending a Webinar rather than an international conference. Over 80% of participants highlighted significant barriers to attending conferences including cost (79%), distance to travel (49%), time commitment (51%), and family commitments (35%). Strengths of the Webinar included expertise, concise high-quality presentations often discussing contentious issues, and the platform quality. The main weakness was a limited time for questions. Just over 53% expressed a concern for the carbon footprint involved in attending conferences and preferred to attend a Webinar. Conclusion: E-learning Webinars represent a disruptive innovation, which promotes deep learning, greater multidisciplinary participation, and greater attendee satisfaction with fewer barriers to participation. Although Webinars will never fully replace conferences, a hybrid approach may reduce the need for conferencing, reduce carbon footprint. and promote a “sustainable academia”.


2021 ◽  
Vol 128 (1) ◽  
Author(s):  
Michael J. Negus ◽  
Matthew R. Moore ◽  
James M. Oliver ◽  
Radu Cimpeanu

AbstractThe high-speed impact of a droplet onto a flexible substrate is a highly non-linear process of practical importance, which poses formidable modelling challenges in the context of fluid–structure interaction. We present two approaches aimed at investigating the canonical system of a droplet impacting onto a rigid plate supported by a spring and a dashpot: matched asymptotic expansions and direct numerical simulation (DNS). In the former, we derive a generalisation of inviscid Wagner theory to approximate the flow behaviour during the early stages of the impact. In the latter, we perform detailed DNS designed to validate the analytical framework, as well as provide insight into later times beyond the reach of the proposed analytical model. Drawing from both methods, we observe the strong influence that the mass of the plate, resistance of the dashpot, and stiffness of the spring have on the motion of the solid, which undergo forced damped oscillations. Furthermore, we examine how the plate motion affects the dynamics of the droplet, predominantly through altering its internal hydrodynamic pressure distribution. We build on the interplay between these techniques, demonstrating that a hybrid approach leads to improved model and computational development, as well as result interpretation, across multiple length and time scales.


2021 ◽  
Vol 13 (5) ◽  
pp. 905
Author(s):  
Chuyi Wu ◽  
Feng Zhang ◽  
Junshi Xia ◽  
Yichen Xu ◽  
Guoqing Li ◽  
...  

The building damage status is vital to plan rescue and reconstruction after a disaster and is also hard to detect and judge its level. Most existing studies focus on binary classification, and the attention of the model is distracted. In this study, we proposed a Siamese neural network that can localize and classify damaged buildings at one time. The main parts of this network are a variety of attention U-Nets using different backbones. The attention mechanism enables the network to pay more attention to the effective features and channels, so as to reduce the impact of useless features. We train them using the xBD dataset, which is a large-scale dataset for the advancement of building damage assessment, and compare their result balanced F (F1) scores. The score demonstrates that the performance of SEresNeXt with an attention mechanism gives the best performance, with the F1 score reaching 0.787. To improve the accuracy, we fused the results and got the best overall F1 score of 0.792. To verify the transferability and robustness of the model, we selected the dataset on the Maxar Open Data Program of two recent disasters to investigate the performance. By visual comparison, the results show that our model is robust and transferable.


Author(s):  
Meysam Goodarzi ◽  
Darko Cvetkovski ◽  
Nebojsa Maletic ◽  
Jesús Gutiérrez ◽  
Eckhard Grass

AbstractClock synchronization has always been a major challenge when designing wireless networks. This work focuses on tackling the time synchronization problem in 5G networks by adopting a hybrid Bayesian approach for clock offset and skew estimation. Furthermore, we provide an in-depth analysis of the impact of the proposed approach on a synchronization-sensitive service, i.e., localization. Specifically, we expose the substantial benefit of belief propagation (BP) running on factor graphs (FGs) in achieving precise network-wide synchronization. Moreover, we take advantage of Bayesian recursive filtering (BRF) to mitigate the time-stamping error in pairwise synchronization. Finally, we reveal the merit of hybrid synchronization by dividing a large-scale network into local synchronization domains and applying the most suitable synchronization algorithm (BP- or BRF-based) on each domain. The performance of the hybrid approach is then evaluated in terms of the root mean square errors (RMSEs) of the clock offset, clock skew, and the position estimation. According to the simulations, in spite of the simplifications in the hybrid approach, RMSEs of clock offset, clock skew, and position estimation remain below 10 ns, 1 ppm, and 1.5 m, respectively.


2021 ◽  
Vol 13 (11) ◽  
pp. 2172
Author(s):  
Sarah Carter ◽  
Martin Herold ◽  
Inge Jonckheere ◽  
Andres Espejo ◽  
Carly Green ◽  
...  

Four workshops and a webinar series were organized, with the aim of building capacity in countries to use Earth Observation Remote Sensing data to monitor forest cover changes and measure emissions reductions for REDD+ results-based payments. Webinars and workshops covered a variety of relevant tools and methods. The initiative was collaboratively organised by a number of Global Forest Observations Initiative (GFOI) partner institutions with funding from the World Bank’s Forest Carbon Partnership Facility (FCPF). The collaborative approach with multiple partners proved to be efficient and was able to reach a large audience, particularly in the case of the webinars. However, the impact in terms of use of tools and training of others after the events was higher for the workshops. In addition, engagement with experts was higher from workshop participants. In terms of efficiency, webinars are significantly cheaper to organize. A hybrid approach might be considered for future initiatives; and, this study of the effectiveness of both in-person and online capacity building can guide the development of future initiatives, something that is particularly pertinent in a COVID-19 era.


2015 ◽  
Vol 8 (7) ◽  
pp. 2153-2165 ◽  
Author(s):  
C. E. Ivey ◽  
H. A. Holmes ◽  
Y. T. Hu ◽  
J. A. Mulholland ◽  
A. G. Russell

Abstract. An integral part of air quality management is knowledge of the impact of pollutant sources on ambient concentrations of particulate matter (PM). There is also a growing desire to directly use source impact estimates in health studies; however, source impacts cannot be directly measured. Several limitations are inherent in most source apportionment methods motivating the development of a novel hybrid approach that is used to estimate source impacts by combining the capabilities of receptor models (RMs) and chemical transport models (CTMs). The hybrid CTM–RM method calculates adjustment factors to refine the CTM-estimated impact of sources at monitoring sites using pollutant species observations and the results of CTM sensitivity analyses, though it does not directly generate spatial source impact fields. The CTM used here is the Community Multiscale Air Quality (CMAQ) model, and the RM approach is based on the chemical mass balance (CMB) model. This work presents a method that utilizes kriging to spatially interpolate source-specific impact adjustment factors to generate revised CTM source impact fields from the CTM–RM method results, and is applied for January 2004 over the continental United States. The kriging step is evaluated using data withholding and by comparing results to data from alternative networks. Data withholding also provides an estimate of method uncertainty. Directly applied (hybrid, HYB) and spatially interpolated (spatial hybrid, SH) hybrid adjustment factors at withheld observation sites had a correlation coefficient of 0.89, a linear regression slope of 0.83 ± 0.02, and an intercept of 0.14 ± 0.02. Refined source contributions reflect current knowledge of PM emissions (e.g., significant differences in biomass burning impact fields). Concentrations of 19 species and total PM2.5 mass were reconstructed for withheld observation sites using HYB and SH adjustment factors. The mean concentrations of total PM2.5 at withheld observation sites were 11.7 (± 8.3), 16.3 (± 11), 8.59 (± 4.7), and 9.2 (± 5.7) μg m−3 for the observations, CTM, HYB, and SH predictions, respectively. Correlations improved for concentrations of major ions, including nitrate (CMAQ–DDM (decoupled direct method): 0.404, SH: 0.449), ammonium (CMAQ–DDM: 0.454, SH: 0.492), and sulfate (CMAQ–DDM: 0.706, SH: 0.730). Errors in simulated concentrations of metals were reduced considerably: 295 % (CMAQ–DDM) to 139 % (SH) for vanadium; and 1340 % (CMAQ–DDM) to 326 % (SH) for manganese. Errors in simulated concentrations of some metals are expected to remain given the uncertainties in source profiles. Species concentrations were reconstructed using SH results, and the error relative to observed concentrations was greatly reduced as compared to CTM-simulated concentrations. Results demonstrate that the hybrid method along with a spatial extension can be used for large-scale, spatially resolved source apportionment studies where observational data are spatially and temporally limited.


2019 ◽  
Vol 14 (1) ◽  
pp. 175-198 ◽  
Author(s):  
S.M.T. Fatemi Ghomi ◽  
B. Asgarian

PurposeFinding a rational approach to maintain a freshness of foods and perishable goods and saving their intrinsic attributes during a distribution of these products is one of the main issues for distribution and logistics companies. This paper aims to provide a framework for distribution of perishable goods which can be applied for real life situations.Design/methodology/approachThis paper proposes a novel mathematical model for transportation inventory location routing problem. In addition, the paper addresses the impact of perishable goods age on the demand of final customers. The model is optimally solved for small- and medium-scale problems. Moreover, regarding to NP-hard nature of the proposed model, two simple and one hybrid metaheuristic algorithms are developed to cope with the complexity of problem in large scale problems.FindingsNumerical examples with different scenarios and sensitivity analysis are conducted to investigate the performance of proposed algorithms and impacts of important parameters on optimal solutions. The results show the acceptable performance of proposed algorithms.Originality/valueThe authors formulate a novel mathematical model which can be applicable in perishable goods distribution systems In this regard, the authors consider lost sale which is proportional to age of products. A new hybrid approach is applied to tackle the problem and the results show the rational performance of the algorithm.


Author(s):  
Robespierre Pita ◽  
Clicia Pinto ◽  
Marcos Barreto ◽  
Samila Sena ◽  
Rosemeire Fiaccone ◽  
...  

ABSTRACTBackground and aimsA cooperation Brazil-UK was set in mid-2013 aiming at to build a huge cohort comprised by individuals registered in CadastroÚnico (CADU), a socioeconomic database used in social programmes of the Brazilian government. Epidemiologists and statisticians wish to assess the impact of Bolsa Família (PBF), a conditional cash transfer programme, on the incidence of several diseases (tuberculosis, leprosy, HIV etc). The cohort must contain all individuals who received at least one payment from PBF between 2007 and 2012, which results in a 100-million records according to our preliminary analysis. These individuals must be probabilistically linked with databases from the Unified Health System (SUS), such as hospitalization (SIH), notifiable diseases (SINAN), mortality (SIM), live births (SINASC), to produce data marts (domain-specific data) to the proposed studies. Within this cooperation, our first goal was to design and evaluate probabilistic methods to routine link the cohort, PBF, and SUS outcomes. ApproachWe implemented two probabilistic linkage methods: a full probabilistic, based on the Dice similarity (Sorensen index) of Bloom filters; and an hybrid approach, based on rules to deterministic and probabilistic matching. We performed linkages involving CADU (2011 extraction) and SUS outcomes (SIH, SINAN, and SIM) with samples from 3 states (Sergipe, Santa Catarina and Bahia) with an increasing size (from 1,447,512 to 12,036,010). ResultsUsing a Dice between 0.90 and 0.92, our methods retrieved more than 95% of true positive pairs amongst the linked pairs. For Sergipe, we obtained as <linked pairs,true positives>: <23,22>, <315,300>, <32,32>, respectively for SIH, SINAN, and SIM. For Bahia: <771,593>, <2677,2626>, <208,207>. Another linkage between CADU (1,447,512 records) and SINAN (624 records), for tuberculosis in Sergipe, returned 397 (full probabilistic) and 311 (hybrid) linked pairs, being 306 and 300 true positives. Another execution considering CADU (1,988,599 records) and SINAN (2,094 records), for tuberculosis in Santa Catarina, returned 791 (full probabilistic) and 500 (hybrid) linked pairs, with 667 and 472 true positives. Linking CADU (1.685,697 records) and SIM, for mortality of children under-4, returned 18 linked pairs, all of them true positives, for a Dice between 0.90 and 0.92 and with 100% of sensitivity, specificity, and positive predictive value. ConclusionDue to the absence of gold standards, we use samples with increasing sizes and manual review when adequate. Our results are quite accurate, although obtained with an unique extraction of CADU. We are starting to run linkages with the entire cohort.


2021 ◽  
Vol 1 (1) ◽  
pp. 20-29
Author(s):  
Iskandarsyah Siregar ◽  
Firlii Rahmadiyah ◽  
Alisha Firiska Qatrunnada Siregar

Every human being tries to communicate what he wants to say to whatever or whomever he wants. Dysarthria is a condition in which the muscles in humans that are active when speaking become weak or difficult to control. Problems or speech disorders experienced by a child with dysarthria are obstacles to children's social and personal adjustment. Schoolchildren who mispronounce the words will feel ashamed and alien from others. This problem motivates the presence of Multisensory Stimulation therapy to help improve and even restore speech problems or disorders experienced by children with dysarthria. This study tries to explain the impact of Multisensory Stimulation therapy and then evaluates the results of the application of Multisensory Stimulation therapy to children with dysarthria. The study that took five sufferers as the object of this study used a hybrid approach that mutually used a qualitative and quantitative perspective. The type of research used is classroom action research. This study concluded that the participants' enthusiasm greatly influenced the process and outcome of therapy.


2021 ◽  
Author(s):  
Maryam DehghanChenary ◽  
Arman Ferdowsi ◽  
Fariborz Jolai ◽  
Reza Tavakkoli-Moghaddam

<pre>The focus of this paper is to propose a bi-objective mathematical model for a new extension of a multi-period p-mobile hub location problem and then to devise an algorithm for solving it. The developed model considers the impact of the time spent traveling at the hubs' network, the time spent at hubs for processing the flows, and the delay caused by congestion at hubs with specific capacities. Following the unveiled model, a hybrid meta-heuristic algorithm will be devised that simultaneously takes advantage of a novel evaluation function, a clustering technique, and a genetic approach for solving the proposed model.</pre>


Sign in / Sign up

Export Citation Format

Share Document