I Know What You Need: Investigating Document Retrieval Effectiveness with Partial Session Contexts

2022 ◽  
Vol 40 (3) ◽  
pp. 1-30
Author(s):  
Procheta Sen ◽  
Debasis Ganguly ◽  
Gareth J. F. Jones

Reducing user effort in finding relevant information is one of the key objectives of search systems. Existing approaches have been shown to effectively exploit the context from the current search session of users for automatically suggesting queries to reduce their search efforts. However, these approaches do not accomplish the end goal of a search system—that of retrieving a set of potentially relevant documents for the evolving information need during a search session. This article takes the problem of query prediction one step further by investigating the problem of contextual recommendation within a search session. More specifically, given the partial context information of a session in the form of a small number of queries, we investigate how a search system can effectively predict the documents that a user would have been presented with had he continued the search session by submitting subsequent queries. To address the problem, we propose a model of contextual recommendation that seeks to capture the underlying semantics of information need transitions of a current user’s search context. This model leverages information from a number of past interactions of other users with similar interactions from an existing search log. To identify similar interactions, as a novel contribution, we propose an embedding approach that jointly learns representations of both individual query terms and also those of queries (in their entirety) from a search log data by leveraging session-level containment relationships. Our experiments conducted on a large query log, namely the AOL, demonstrate that using a joint embedding of queries and their terms within our proposed framework of document retrieval outperforms a number of text-only and sequence modeling based baselines.

2021 ◽  
pp. 016555152110184
Author(s):  
Gunjan Chandwani ◽  
Anil Ahlawat ◽  
Gaurav Dubey

Document retrieval plays an important role in knowledge management as it facilitates us to discover the relevant information from the existing data. This article proposes a cluster-based inverted indexing algorithm for document retrieval. First, the pre-processing is done to remove the unnecessary and redundant words from the documents. Then, the indexing of documents is done by the cluster-based inverted indexing algorithm, which is developed by integrating the piecewise fuzzy C-means (piFCM) clustering algorithm and inverted indexing. After providing the index to the documents, the query matching is performed for the user queries using the Bhattacharyya distance. Finally, the query optimisation is done by the Pearson correlation coefficient, and the relevant documents are retrieved. The performance of the proposed algorithm is analysed by the WebKB data set and Twenty Newsgroups data set. The analysis exposes that the proposed algorithm offers high performance with a precision of 1, recall of 0.70 and F-measure of 0.8235. The proposed document retrieval system retrieves the most relevant documents and speeds up the storing and retrieval of information.


2020 ◽  
pp. 619-637
Author(s):  
Yogesh Kumar Meena ◽  
Dinesh Gopalani

Automatic Text Summarization (ATS) enables users to save their precious time to retrieve their relevant information need while searching voluminous big data. Text summaries are sensitive to scoring methods, as most of the methods requires to weight features for sentence scoring. In this chapter, various statistical features proposed by researchers for extractive automatic text summarization are explored. Features that perform well are termed as best features using ROUGE evaluation measures and used for creating feature combinations. After that, best performing feature combinations are identified. Performance evaluation of best performing feature combinations on short, medium and large size documents is also conducted using same ROUGE performance measures.


2020 ◽  
Vol 10 (12) ◽  
pp. 4316 ◽  
Author(s):  
Ivan Boban ◽  
Alen Doko ◽  
Sven Gotovac

Sentence retrieval is an information retrieval technique that aims to find sentences corresponding to an information need. It is used for tasks like question answering (QA) or novelty detection. Since it is similar to document retrieval but with a smaller unit of retrieval, methods for document retrieval are also used for sentence retrieval like term frequency—inverse document frequency (TF-IDF), BM 25 , and language modeling-based methods. The effect of partial matching of words to sentence retrieval is an issue that has not been analyzed. We think that there is a substantial potential for the improvement of sentence retrieval methods if we consider this approach. We adapted TF-ISF, BM 25 , and language modeling-based methods to test the partial matching of terms through combining sentence retrieval with sequence similarity, which allows matching of words that are similar but not identical. All tests were conducted using data from the novelty tracks of the Text Retrieval Conference (TREC). The scope of this paper was to find out if such approach is generally beneficial to sentence retrieval. However, we did not examine in depth how partial matching helps or hinders the finding of relevant sentences.


2018 ◽  
Vol 16 (1) ◽  
pp. 869-875
Author(s):  
Mediha İpek ◽  
Tuba Yener ◽  
Gözde Ç. Efe ◽  
Ibrahim Altınsoy ◽  
Cuma Bindal ◽  
...  

AbstractIntermetallics are known as a group of materials that draws attention with their features such as ordered structure, high temperature resistance, high hardness and low density. In this paper, it is aimed to obtain intermetallic matrix composites and also to maintain some ductile Nb and Ti metallic phase by using 99.5% purity, 35-44 μm particle size titanium, niobium and aluminium powders in one step via recently developed powder metallurgy processing technique - Electric current activated/assisted sintering system (ECAS). In this way, metallic reinforced intermetallic matrix composites were produced. Dominant phases of TiAl3 and NbAl3 which were the first compounds formed between peritectic reaction of solid titanium, niobium and molten aluminum in Ti-Al-Nb system during 10, 30 and 90 s for 2000 A current and 1.5-2.0 voltage were detected by XRD and SEM-EDS analysis. Hardness values of the test samples were measured by Vickers indentation technique and it was detected that the hardnesses of intermetallic phases as 411 HVN whereas ductile metallic phase as 120 HVN.


2017 ◽  
Vol 35 (3) ◽  
pp. 398-409
Author(s):  
Gracielle Mendonça Rodrigues Gomes ◽  
Beatriz Valadares Cendon

Purpose The study aims to propose the use of the semiotics inspection method (SIM) which is an interpretative and qualitative method from semiotics engineering (SE) for the evaluation of the communicability of systems and to evaluate digital libraries and information retrieval systems (IRS). The paper presents the results of the application of this method in the evaluation of the quality of the communicability of the interface and search system of the Coordination for the Improvement of Higher Education Personnel (CAPES) Portal of e-Journals, a major scientific digital library in Brazil. There are proposed solutions to improve this system included. Design/methodology/approach The study used the SIM to evaluate the system. Two evaluators inspected the system. They performed the comparison and the analysis of three types of metamessages (metalinguistic, static and dynamic). The metamessages generated by the evaluators were contrasted to find inconsistencies and ambiguities in the CAPES Portal of e-Journals. Finally, the last step of the method was the final assessment about the inspection. Findings The evaluators identified 52 problems of communicability. These problems were ranked according to severity ratings established by Nielsen (1994). They were grouped in ten types of problems present in the interface and in the search system of the CAPES Portal of e-Journals. Originality value This research contributes theoretically to the field of information retrieval and to the area of human–computer interaction and, in particular, to the theory of SE by adapting SE methods that allow the evaluation of communicability to the context of the scientific IRS. Results obtained through scientific methods should contribute to development of the interface and search tools of IRS to better support query formulation and retrieval of relevant information and more efficiently satisfy the information needs of individuals.


Nanoscale ◽  
2014 ◽  
Vol 6 (9) ◽  
pp. 4864-4873 ◽  
Author(s):  
Partha Khanra ◽  
Chang-No Lee ◽  
Tapas Kuila ◽  
Nam Hoon Kim ◽  
Min Jun Park ◽  
...  

Water-dispersible functionalized graphene via one-step electrochemical exfoliation of graphite was prepared using 7,7,8,8-tetracyanoquinodimethane (TCNQ) anions as surface modifiers and electrolytes. The specific capacitance value of TCNQ-modified graphene measured with electrolytes (1 M KOH) was 324 F g−1 at a current density of 1 A g−1.


2017 ◽  
Vol 35 (3) ◽  
pp. 447-453
Author(s):  
Ying Tao ◽  
Danqin Yi ◽  
Baojun Zhu ◽  
Wenpeng Cao

AbstractDiamond-like carbon (DLC) thin films were prepared by hydrothermal electrochemical method in one-step process. The structural characterization of these films was carried out by scanning electron microscopy (SEM), Raman spectroscopy, and infrared reflectance spectroscopy (IR). It was found that there was an increased sp2 carbon content but decreased sp3 carbon and hydrogen contents with an increase in current density. The flexibility and internal stresses of the DLC films were affected by hydrogen, sp3 amorphous carbon and ordered crystalline sp2 carbon contents. The highly flexible DLC films with high sp3 carbon and hydrogen contents were prepared at a current density of 0.001 mA/cm2.


2020 ◽  
Vol 18 (4) ◽  
pp. 209-221
Author(s):  
N. G. Neznanov ◽  
U. V. Lebedeva ◽  
O. Rida ◽  
V. B. Petrova ◽  
E. I. Palchikova ◽  
...  

The aim is to study the influence and assessment of mental and emotional states in patients with arrhythmias.Materials and methods. Literature search was performed using the following resources: PubMed, Web of Science, Scopus, as well as in the search system Google Scholar by the key words “psychoarrhythmology”, “neural-cardiac axis”, “psychocardiology”, “arrhythmogenesis”, and “stress-induced arrhythmia”. Articles should be freely available and should represent the most relevant information on the topic. Studies were selected by the largest sample and citation index.Results. In this review of studies on the correlation of psychosocial factors and constitutional features of personality in patients with arrhythmias, the available data on the pathogenesis of cardiac pathology, including the main arrhythmological disorders in nervous excitation caused by negative emotions and stress are presented. The article also reflects the importance of a multidisciplinary approach to risk prediction, potential risk modifiers and approaches to the treatment of cardiac pathology, taking into account the psycho-emotional state of the patient.Conclusion. Reducing the severity of the disease requires a comprehensive approach, in particular, psychodiagnostics, psychocorrection, psychotherapy and psychopharmacotherapy. Further development of this approach to this problem will lead to the creation of new programs for early diagnosis, prevention and treatment of cardiac pathology. 


Author(s):  
Mario Diván ◽  
María de los Ángeles Martín

<span>In this work an updating of the C-INCAMI (Context-Information Need, Concept model, Attribute, Metric and Indicator) conceptual framework for Measurement and Evaluation projects was proposed. The updating incorporated better supporting for the measures stream processing. Therefore, a new version of the measurement interchange schema based on the updated C-INCAMI framework was introduced. This new schema incorporated the concept of “complementary data” linking them with geographic information. The complementary data could be associated with the measures and allowed us incorporating video, geographic information, text plain, audio or pictures with the quantitative measures (deterministic or estimated) jointly. A practical case associated with the Weather Radar of the Experimental Agricultural Station (EAS) INTA Anguil (Province of La Pampa, Argentina) was shown, indicating the advantages of the new schema.</span>


Sign in / Sign up

Export Citation Format

Share Document