sequence modeling
Recently Published Documents


TOTAL DOCUMENTS

195
(FIVE YEARS 115)

H-INDEX

15
(FIVE YEARS 6)

2022 ◽  
Vol 40 (3) ◽  
pp. 1-30
Author(s):  
Procheta Sen ◽  
Debasis Ganguly ◽  
Gareth J. F. Jones

Reducing user effort in finding relevant information is one of the key objectives of search systems. Existing approaches have been shown to effectively exploit the context from the current search session of users for automatically suggesting queries to reduce their search efforts. However, these approaches do not accomplish the end goal of a search system—that of retrieving a set of potentially relevant documents for the evolving information need during a search session. This article takes the problem of query prediction one step further by investigating the problem of contextual recommendation within a search session. More specifically, given the partial context information of a session in the form of a small number of queries, we investigate how a search system can effectively predict the documents that a user would have been presented with had he continued the search session by submitting subsequent queries. To address the problem, we propose a model of contextual recommendation that seeks to capture the underlying semantics of information need transitions of a current user’s search context. This model leverages information from a number of past interactions of other users with similar interactions from an existing search log. To identify similar interactions, as a novel contribution, we propose an embedding approach that jointly learns representations of both individual query terms and also those of queries (in their entirety) from a search log data by leveraging session-level containment relationships. Our experiments conducted on a large query log, namely the AOL, demonstrate that using a joint embedding of queries and their terms within our proposed framework of document retrieval outperforms a number of text-only and sequence modeling based baselines.


Author(s):  
Jamil Ahmad ◽  
Abdul Khader Jilani Saudagar ◽  
Khalid Mahmood Malik ◽  
Waseem Ahmad ◽  
Muhammad Badruddin Khan ◽  
...  

The highly rapid spread of the current pandemic has quickly overwhelmed hospitals all over the world and motivated extensive research to address a wide range of emerging problems. The unforeseen influx of COVID-19 patients to hospitals has made it inevitable to deploy a rapid and accurate triage system, monitor progression, and predict patients at higher risk of deterioration in order to make informed decisions regarding hospital resource management. Disease detection in radiographic scans, severity estimation, and progression and prognosis prediction have been extensively studied with the help of end-to-end methods based on deep learning. The majority of recent works have utilized a single scan to determine severity or predict progression of the disease. In this paper, we present a method based on deep sequence learning to predict improvement or deterioration in successive chest X-ray scans and build a mathematical model to determine individual patient disease progression profile using successive scans. A deep convolutional neural network pretrained on a diverse lung disease dataset was used as a feature extractor to generate the sequences. We devised three strategies for sequence modeling in order to obtain both fine-grained and coarse-grained features and construct sequences of different lengths. We also devised a strategy to quantify positive or negative change in successive scans, which was then combined with age-related risk factors to construct disease progression profile for COVID-19 patients. The age-related risk factors allowed us to model rapid deterioration and slower recovery in older patients. Experiments conducted on two large datasets showed that the proposed method could accurately predict disease progression. With the best feature extractor, the proposed method was able to achieve AUC of 0.98 with the features obtained from radiographs. Furthermore, the proposed patient profiling method accurately estimated the health profile of patients.


Webology ◽  
2021 ◽  
Vol 18 (2) ◽  
pp. 154-165
Author(s):  
Abbas Atwan Mhawes ◽  
Ahmed Yousif Falih Saedi ◽  
Ali Talib Qasim Al-Aqbi ◽  
Lamees Abdalhasan Salman

Data mining is characterized as a quest for useful knowledge via large quantities of data. Some basic and most common techniques for data extraction are association rules, grouping, clustering, estimation, sequence modeling. For a wide range of applications, data mining techniques are used. Techniques of data analysis are essential to the preparation and implementation of the administration of the learning system, including behavioral guidance and personal behavior appraisal. The article applies data analytical methods to the role of student classification. Several tests are used for the interpretation of the findings. In keeping with the methodology proposed in the paper, the classification using cognitive skills provides more detailed results than the findings of other study published. Five algorithms were used (J48, Naïve Bayes, Multilayer Perception, K Star and SMO). This essay discusses and measures the application of the various algorithms so that factors affecting the success and failure of students can be identified, student performance can be estimated, and the significant consequences of the mathematics system for the second university year can be identified. However the number of exams can be minimized using data mining techniques. In terms of time and consequences, this shortened analysis plays a key role.


2021 ◽  
Author(s):  
Vinayak Gupta ◽  
Srikanta Bedathur

A large fraction of data generated via human activities such as online purchases, health records, spatial mobility etc. can be represented as continuous-time event sequences (CTES) i.e. sequences of discrete events over a continuous time. Learning neural models over CTES is a non-trivial task as it involves modeling the ever-increasing event timestamps, inter-event time gaps, event types, and the influences between different events within and across different sequences. Moreover, existing sequence modeling techniques consider a complete observation scenario i.e. the event sequence being modeled is completely observed with no missing events – an ideal setting that is rarely applicable in real-world applications. In this paper, we highlight our approach[8] for modeling CTES with intermittent observations. Buoyed by the recent success of neural marked temporal point processes (MTPP) for modeling the generative distribution of CTES, we provide a novel unsupervised model and inference method for learning MTPP in presence of event sequences with missing events. Specifically, we first model the generative processes of observed events and missing events using two MTPP, where the missing events are represented as latent random variables. Then, we devise an unsupervised training method that jointly learns both the MTPP using variational inference. Experiments across real-world datasets show that our modeling framework outperforms state-of-the-art techniques for future event prediction and imputation. This work appeared in AISTATS 2021.


Author(s):  
Rana Riad K. AL-Taie ◽  
Basma Jumaa Saleh ◽  
Ahmed Yousif Falih Saedi ◽  
Lamees Abdalhasan Salman

Data mining is defined as a search through large amounts of data for valuable information. The association rules, grouping, clustering, prediction, sequence modeling is some essential and most general strategies for data extraction. The processing of data plays a major role in the healthcare industry's disease detection. A variety of disease evaluations should be required to diagnose the patient. However, using data mining strategies, the number of examinations should be decreased. This decreased examination plays a crucial role in terms of time and results. Heart disease is a death-provoking disorder. In this recent instance, health issues are immense because of the availability of health issues and the grouping of various situations. Today, secret information is important in the healthcare industry to make decisions. For the prediction of cardiovascular problems, (Weka 3.8.3) tools for this analysis are used for the prediction of data extraction algorithms like sequential minimal optimization (SMO), multilayer perceptron (MLP), random forest and Bayes net. The data collected combine the prediction accuracy results, the receiver operating characteristic (ROC) curve, and the PRC value. The performance of Bayes net (94.5%) and random forest (94%) technologies indicates optimum performance rather than the sequential minimal optimization (SMO) and multilayer perceptron (MLP) methods.


Author(s):  
Jiaxuan Wang ◽  
Chaoyi Wang ◽  
Yang Hua ◽  
Tao Song ◽  
Zhengui Xue ◽  
...  

This paper acts as a comprehensive analytical study of natural language processing (NLP) and provides a briefing of the most prominent astounding reforms of the field over a good chunk of time. It covers even the future research insights and most relevant features, which act as a result of the discussed concepts or research, until this paper's reading point. This paper starts with covering the most basic concepts of text cleaning, such as tokenization, the importance of stop words, etc., to concepts such as sequence modeling, speech recognition, the effect of quantum computing concepts in Natural Language Processing, and so on. The current development of deep neural networks, which is the current trend in artificial intelligence, always gives NLP a cutting-edge technology, also covered in this paper. This paper will also emphasize that it covers the broad area of explanations to the concepts to guide learners or researchers to have an excellent overall understanding of the field.


Sign in / Sign up

Export Citation Format

Share Document