RPM-Oriented Query Rewriting Framework for E-commerce Keyword-Based Sponsored Search (Student Abstract)

Xiuying Chen; Daorui Xiao; Shen Gao; Guojun Liu; Wei Lin; Bo Zheng; Dongyan Zhao; Rui Yan

doi:10.1609/aaai.v34i10.7156

RPM-Oriented Query Rewriting Framework for E-commerce Keyword-Based Sponsored Search (Student Abstract)

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i10.7156 ◽

2020 ◽

Vol 34 (10) ◽

pp. 13769-13770

Author(s):

Xiuying Chen ◽

Daorui Xiao ◽

Shen Gao ◽

Guojun Liu ◽

Wei Lin ◽

...

Keyword(s):

Statistical Models ◽

Large Scale ◽

Query Rewriting ◽

A New Right-Skewed Upside Down Bathtub Shaped Heavy-tailed Distribution and its Applications

Journal of Modern Applied Statistical Methods ◽

10.22237/jmasm/1608552600 ◽

2021 ◽

Vol 19 (1) ◽

Author(s):

Sandeep Kumar Maurya ◽

Sanjay K Singh ◽

Umesh Singh

Keyword(s):

Maximum Likelihood ◽

Real Data ◽

Statistical Properties ◽

New Right ◽

Data Sets ◽

Proposed Model ◽

Heavy Tailed Distribution ◽

Heavy Tailed

A one parameter right skewed, upside down bathtub type, heavy-tailed distribution is derived. Various statistical properties and maximum likelihood approaches for estimation purpose are studied. Five different real data sets with four different models are considered to illustrate the suitability of the proposed model.

Download Full-text

Modified Lomax model: a heavy-tailed distribution for fitting large-scale real-world complex networks

Social Network Analysis and Mining ◽

10.1007/s13278-021-00751-1 ◽

2021 ◽

Vol 11 (1) ◽

Author(s):

Swarup Chattopadhyay ◽

Tanujit Chakraborty ◽

Kuntal Ghosh ◽

Asit K. Das

Keyword(s):

Complex Networks ◽

Real World ◽

Large Scale ◽

Heavy Tailed Distribution ◽

Heavy Tailed

Download Full-text

Model and Method for Contributor’s Quality Assessment in Community Image Tagging Systems

Information and Control Systems ◽

10.31799/1684-8853-2018-4-45-51 ◽

2018 ◽

pp. 45-51

Author(s):

A. V. Ponomarev

Keyword(s):

Large Scale ◽

Wide Spectrum ◽

Preference Relation ◽

Pairwise Comparison ◽

Ground Truth ◽

Comparison Method ◽

Characteristic Matrix ◽

Image Tagging ◽

Proposed Model

Introduction: Large-scale human-computer systems involving people of various skills and motivation into the information processing process are currently used in a wide spectrum of applications. An acute problem in such systems is assessing the expected quality of each contributor; for example, in order to penalize incompetent or inaccurate ones and to promote diligent ones.Purpose: To develop a method of assessing the expected contributor’s quality in community tagging systems. This method should only use generally unreliable and incomplete information provided by contributors (with ground truth tags unknown).Results:A mathematical model is proposed for community image tagging (including the model of a contributor), along with a method of assessing the expected contributor’s quality. The method is based on comparing tag sets provided by different contributors for the same images, being a modification of pairwise comparison method with preference relation replaced by a special domination characteristic. Expected contributors’ quality is evaluated as a positive eigenvector of a pairwise domination characteristic matrix. Community tagging simulation has confirmed that the proposed method allows you to adequately estimate the expected quality of community tagging system contributors (provided that the contributors' behavior fits the proposed model).Practical relevance: The obtained results can be used in the development of systems based on coordinated efforts of community (primarily, community tagging systems).

Download Full-text

Multi Disease-Prediction Framework Using Hybrid Deep Learning: An Optimal Prediction Model (Preprint)

10.2196/preprints.22865 ◽

2020 ◽

Author(s):

Anusha Ampavathi ◽

Vijaya Saradhi T

Keyword(s):

Feature Extraction ◽

Big Data ◽

Deep Learning ◽

Weight Function ◽

Optimization Algorithm ◽

Large Scale ◽

Heuristic Algorithms ◽

Disease Prediction ◽

Health Care Decisions ◽

Proposed Model

UNSTRUCTURED Big data and its approaches are generally helpful for healthcare and biomedical sectors for predicting the disease. For trivial symptoms, the difficulty is to meet the doctors at any time in the hospital. Thus, big data provides essential data regarding the diseases on the basis of the patient’s symptoms. For several medical organizations, disease prediction is important for making the best feasible health care decisions. Conversely, the conventional medical care model offers input as structured that requires more accurate and consistent prediction. This paper is planned to develop the multi-disease prediction using the improvised deep learning concept. Here, the different datasets pertain to “Diabetes, Hepatitis, lung cancer, liver tumor, heart disease, Parkinson’s disease, and Alzheimer’s disease”, from the benchmark UCI repository is gathered for conducting the experiment. The proposed model involves three phases (a) Data normalization (b) Weighted normalized feature extraction, and (c) prediction. Initially, the dataset is normalized in order to make the attribute's range at a certain level. Further, weighted feature extraction is performed, in which a weight function is multiplied with each attribute value for making large scale deviation. Here, the weight function is optimized using the combination of two meta-heuristic algorithms termed as Jaya Algorithm-based Multi-Verse Optimization algorithm (JA-MVO). The optimally extracted features are subjected to the hybrid deep learning algorithms like “Deep Belief Network (DBN) and Recurrent Neural Network (RNN)”. As a modification to hybrid deep learning architecture, the weight of both DBN and RNN is optimized using the same hybrid optimization algorithm. Further, the comparative evaluation of the proposed prediction over the existing models certifies its effectiveness through various performance measures.

Download Full-text

Cyberstalking Victimization Model Using Criminological Theory: A Systematic Literature Review, Taxonomies, Applications, Tools, and Validations

Electronics ◽

10.3390/electronics10141670 ◽

2021 ◽

Vol 10 (14) ◽

pp. 1670

Author(s):

Waheeb Abu-Ulbeh ◽

Maryam Altalhi ◽

Laith Abualigah ◽

Abdulwahab Ali Almazroi ◽

Putra Sumari ◽

...

Keyword(s):

Data Analysis ◽

Structural Equation ◽

Large Scale ◽

Review Paper ◽

Essential Element ◽

Routine Activities ◽

Criminological Theory ◽

Equation Modeling ◽

Future Research ◽

Proposed Model

Cyberstalking is a growing anti-social problem being transformed on a large scale and in various forms. Cyberstalking detection has become increasingly popular in recent years and has technically been investigated by many researchers. However, cyberstalking victimization, an essential part of cyberstalking, has empirically received less attention from the paper community. This paper attempts to address this gap and develop a model to understand and estimate the prevalence of cyberstalking victimization. The model of this paper is produced using routine activities and lifestyle exposure theories and includes eight hypotheses. The data of this paper is collected from the 757 respondents in Jordanian universities. This review paper utilizes a quantitative approach and uses structural equation modeling for data analysis. The results revealed a modest prevalence range is more dependent on the cyberstalking type. The results also indicated that proximity to motivated offenders, suitable targets, and digital guardians significantly influences cyberstalking victimization. The outcome from moderation hypothesis testing demonstrated that age and residence have a significant effect on cyberstalking victimization. The proposed model is an essential element for assessing cyberstalking victimization among societies, which provides a valuable understanding of the prevalence of cyberstalking victimization. This can assist the researchers and practitioners for future research in the context of cyberstalking victimization.

Download Full-text

Why ability point estimates can be pointless: a primer on using skill measures from large-scale assessments in secondary analyses

Measurement Instruments for the Social Sciences ◽

10.1186/s42409-020-00020-5 ◽

2021 ◽

Vol 3 (1) ◽

Author(s):

Clemens M. Lechner ◽

Nivedita Bhaktha ◽

Katharina Groskurth ◽

Matthias Bluemke

Keyword(s):

Measurement Error ◽

Statistical Models ◽

Test Scores ◽

Large Scale ◽

Equation Modeling ◽

Model Parameters ◽

Advantages And Disadvantages ◽

Point Estimates ◽

Secondary Analyses ◽

Large Scale Assessments

AbstractMeasures of cognitive or socio-emotional skills from large-scale assessments surveys (LSAS) are often based on advanced statistical models and scoring techniques unfamiliar to applied researchers. Consequently, applied researchers working with data from LSAS may be uncertain about the assumptions and computational details of these statistical models and scoring techniques and about how to best incorporate the resulting skill measures in secondary analyses. The present paper is intended as a primer for applied researchers. After a brief introduction to the key properties of skill assessments, we give an overview over the three principal methods with which secondary analysts can incorporate skill measures from LSAS in their analyses: (1) as test scores (i.e., point estimates of individual ability), (2) through structural equation modeling (SEM), and (3) in the form of plausible values (PVs). We discuss the advantages and disadvantages of each method based on three criteria: fallibility (i.e., control for measurement error and unbiasedness), usability (i.e., ease of use in secondary analyses), and immutability (i.e., consistency of test scores, PVs, or measurement model parameters across different analyses and analysts). We show that although none of the methods are optimal under all criteria, methods that result in a single point estimate of each respondent’s ability (i.e., all types of “test scores”) are rarely optimal for research purposes. Instead, approaches that avoid or correct for measurement error—especially PV methodology—stand out as the method of choice. We conclude with practical recommendations for secondary analysts and data-producing organizations.

Download Full-text

Cloud-based intelligent self-diagnosis and department recommendation service using Chinese medical BERT

Journal of Cloud Computing Advances Systems and Applications ◽

10.1186/s13677-020-00218-2 ◽

2021 ◽

Vol 10 (1) ◽

Author(s):

Junshu Wang ◽

Guoming Zhang ◽

Wei Wang ◽

Ka Zhang ◽

Yehua Sheng

Keyword(s):

Cloud Computing ◽

Large Scale ◽

Medical Service ◽

Rapid Development ◽

Medical Knowledge ◽

Language Models ◽

Computing Environment ◽

Computing Power ◽

Cloud Computing Environment ◽

Proposed Model

AbstractWith the rapid development of hospital informatization and Internet medical service in recent years, most hospitals have launched online hospital appointment registration systems to remove patient queues and improve the efficiency of medical services. However, most of the patients lack professional medical knowledge and have no idea of how to choose department when registering. To instruct the patients to seek medical care and register effectively, we proposed CIDRS, an intelligent self-diagnosis and department recommendation framework based on Chinese medical Bidirectional Encoder Representations from Transformers (BERT) in the cloud computing environment. We also established a Chinese BERT model (CHMBERT) trained on a large-scale Chinese medical text corpus. This model was used to optimize self-diagnosis and department recommendation tasks. To solve the limited computing power of terminals, we deployed the proposed framework in a cloud computing environment based on container and micro-service technologies. Real-world medical datasets from hospitals were used in the experiments, and results showed that the proposed model was superior to the traditional deep learning models and other pre-trained language models in terms of performance.

Download Full-text

Assessing the Response of Snow Avalanche Runout Altitudes to Climate Fluctuations Using Hierarchical Modeling: Application to 61 Winters of Data in France

Journal of Climate ◽

10.1175/2010jcli3312.1 ◽

2010 ◽

Vol 23 (12) ◽

pp. 3157-3180 ◽

Cited By ~ 42

Author(s):

N. Eckert ◽

H. Baya ◽

M. Deschatres

Keyword(s):

Climate Change ◽

Large Scale ◽

Hierarchical Modeling ◽

Control Measures ◽

Level Shift ◽

Snow Avalanches ◽

Winter Climate ◽

Climate Fluctuations ◽

High Magnitude ◽

Proposed Model

Abstract Snow avalanches are natural hazards strongly controlled by the mountain winter climate, but their recent response to climate change has thus far been poorly documented. In this paper, hierarchical modeling is used to obtain robust indexes of the annual fluctuations of runout altitudes. The proposed model includes a possible level shift, and distinguishes common large-scale signals in both mean- and high-magnitude events from the interannual variability. Application to the data available in France over the last 61 winters shows that the mean runout altitude is not different now than it was 60 yr ago, but that snow avalanches have been retreating since 1977. This trend is of particular note for high-magnitude events, which have seen their probability rates halved, a crucial result in terms of hazard assessment. Avalanche control measures, observation errors, and model limitations are insufficient explanations for these trends. On the other hand, strong similarities in the pattern of behavior of the proposed runout indexes and several climate datasets are shown, as well as a consistent evolution of the preferred flow regime. The proposed runout indexes may therefore be usable as indicators of climate change at high altitudes.

Download Full-text

An Attention-Based Model Using Character Composition of Entities in Chinese Relation Extraction

Information ◽

10.3390/info11020079 ◽

2020 ◽

Vol 11 (2) ◽

pp. 79 ◽

Cited By ~ 2

Author(s):

Xiaoyu Han ◽

Yue Zhang ◽

Wenkai Zhang ◽

Tinglei Huang

Keyword(s):

Language Processing ◽

Large Scale ◽

Named Entity Recognition ◽

Relation Extraction ◽

Entity Recognition ◽

Additional Information ◽

Named Entity ◽

Proposed Model ◽

The Relationship ◽

Crucial Part

Relation extraction is a vital task in natural language processing. It aims to identify the relationship between two specified entities in a sentence. Besides information contained in the sentence, additional information about the entities is verified to be helpful in relation extraction. Additional information such as entity type getting by NER (Named Entity Recognition) and description provided by knowledge base both have their limitations. Nevertheless, there exists another way to provide additional information which can overcome these limitations in Chinese relation extraction. As Chinese characters usually have explicit meanings and can carry more information than English letters. We suggest that characters that constitute the entities can provide additional information which is helpful for the relation extraction task, especially in large scale datasets. This assumption has never been verified before. The main obstacle is the lack of large-scale Chinese relation datasets. In this paper, first, we generate a large scale Chinese relation extraction dataset based on a Chinese encyclopedia. Second, we propose an attention-based model using the characters that compose the entities. The result on the generated dataset shows that these characters can provide useful information for the Chinese relation extraction task. By using this information, the attention mechanism we used can recognize the crucial part of the sentence that can express the relation. The proposed model outperforms other baseline models on our Chinese relation extraction dataset.

Download Full-text

On the role of the anthropocentric factor in the realization of the derivational potential of names of ungulates as motivating lexical units

Sibirskiy filologicheskiy zhurnal ◽

10.17223/18137083/76/15 ◽

2021 ◽

pp. 191-210

Author(s):

Nikolay D. Golev ◽

◽

Irina P. Falomkina ◽

Keyword(s):

Native Speakers ◽

Russian Language ◽

Search System ◽

Proposed Model ◽

Data Capturing ◽

Building Behavior ◽

Building System ◽

Google Search ◽

The Russian Language

The paper is dedicated to describing the word-building system of the Russian language in terms of its vocabulary. Lexical factors are discussed influencing the formation of lexical units’ potential as motivating units of word-building processes and relations and the realization of this potential in language activities. Of most interest for the authors are anthropocentric determinants, most of which are coordinating the lexical system and, through its mediation, the word-building system with the worldview of native speakers of the Russian language. The proposed model of derivational development of vocabulary provides such coordination through studying the deep-seated process of conceptualization of the words that are the potential motivators of neologisms. This study identifies the word frequency as an external manifestation of conceptualization. The frequency data were obtained from Google search system statistical data. Capturing not only usual but also occasional and potential words, this source is an effective tool for studying word-building processes and their results. This study has unveiled the interrelation between the language worldview of native speakers of Russian and their “word-building behavior” in language activities. The worldview has been found, first of all, to be determined by the pragmatic factor, which primarily influences the usage of a word in the speech reflected by its frequency. The frequency ranks lexical units due to their derivational potential and thereby provides a researcher with a reliable instrument for its study.

Download Full-text