Clear and Private Ad Hoc Retrieval Models on Web Data

Author(s):  
Souria Ortiga

During the 1980s, and despite its maturity, the search information (RI) was only intended for librarians and experts in the field of information. Such tendentious vision prevailed for many years. Since the mid-90s, the web has become an increasingly crucial source of information , which has a renewed interest in IR. In the last decade, the popularization of computers, the terrible explosion in the amount of unstructured data, internal documents, and corporate collections, and the huge and growing number of internet document sources have deeply shaken the relationship between man and information. Today, a great change has taken place, and the RI is often used by billions of people around the world. Simply, the need for automated methods for efficient access to this huge amount of digital information has become more important, and appears as a necessity.

2022 ◽  
Vol 40 (3) ◽  
pp. 1-37
Author(s):  
Edward Kai Fung Dang ◽  
Robert Wing Pong Luk ◽  
James Allan

In Information Retrieval, numerous retrieval models or document ranking functions have been developed in the quest for better retrieval effectiveness. Apart from some formal retrieval models formulated on a theoretical basis, various recent works have applied heuristic constraints to guide the derivation of document ranking functions. While many recent methods are shown to improve over established and successful models, comparison among these new methods under a common environment is often missing. To address this issue, we perform an extensive and up-to-date comparison of leading term-independence retrieval models implemented in our own retrieval system. Our study focuses on the following questions: (RQ1) Is there a retrieval model that consistently outperforms all other models across multiple collections; (RQ2) What are the important features of an effective document ranking function? Our retrieval experiments performed on several TREC test collections of a wide range of sizes (up to the terabyte-sized Clueweb09 Category B) enable us to answer these research questions. This work also serves as a reproducibility study for leading retrieval models. While our experiments show that no single retrieval model outperforms all others across all tested collections, some recent retrieval models, such as MATF and MVD, consistently perform better than the common baselines.


Author(s):  
Ana Gabriela Maguitman ◽  
Carlos M. Lorenzetti ◽  
Rocío L. Cecchini

Performance evaluation plays a crucial role in the development and improvement of search systems in general and context-based systems in particular. In order to evaluate search systems, test collections are needed. These test collections typically involve a corpus of documents, a set of queries and a series of relevance assessments. In traditional approaches users or hired evaluators provide manual assessments of relevance. However this is difficult and expensive, and does not scale with the complexity and heterogeneity of available digital information. This chapter proposes a semantic evaluation framework that takes advantages of topic ontologies and semantic similarity data derived from these ontologies. The structure and content of the Open Directory Project topic ontology is used to derive semantic relations among a massive number of topics and to implement classical and ad hoc retrieval performance evaluation metrics. In addition, this chapter describes an incremental method for context-based retrieval, which is based on the notions of topic descriptors and topic discriminators. The incremental context-based retrieval method is used to illustrate the application of the proposed semantic evaluation framework. Finally, the chapter discusses the advantages of applying the proposed framework.


Author(s):  
Mandeep Kaur ◽  
Manpreet Kaur

Internet is a very powerful communication device to disclose financial and non-financial information. Almost every company today maintains its website and disseminates their information voluntarily. Internet is very exciting medium to disclose information in the form of presentation. It has become most frequently used source of information. This paper tries to examine the web home page disclosure practices of top public and private Indian banks and try to find out the relationship between the disclosure score and size of bank by using the sample of 20 banks which constitute of top public and private sector banks. The results show that there is positive relationship between the disclosure score and size of bank.


Author(s):  
Katherine H. Rogers

When forming impressions of an other’s personality, people often rely on information not directly related to the individual at hand. One source of information that can influence people’s impressions of others is the personality of the average person (i.e., normative profile). This relationship between the normative profile and an impression is called normative accuracy or normativity. In this chapter, you will learn about the average personality, why it is important, the relationship to social desirability and what it means to have a normative impression, as well as correlates and moderators of normativity. More broadly, you will learn about current research and views regarding the normative profile and normative impressions as well as concrete steps for incorporating this approach into your future research on interpersonal perception.


Molecules ◽  
2021 ◽  
Vol 26 (13) ◽  
pp. 3895
Author(s):  
Marica Baldoni ◽  
Alessandra Nardi ◽  
Flavio De Angelis ◽  
Olga Rickards ◽  
Cristina Martínez-Labarga

The present research investigates the relationship between dietary habits and mortality patterns in the Roman Imperial and Medieval periods. The reconstructions of population dynamics and subsistence strategies provide a fascinating source of information for understanding our history. This is particularly true given that the changes in social, economic, political, and religious aspects related to the transition from the Roman period to the Middle Ages have been widely discussed. We analyzed the isotopic and mortality patterns of 616 individuals from 18 archeological sites (the Medieval Latium sites of Colonna, Santa Severa, Allumiere, Cencelle, and 14 Medieval and Imperial funerary contexts from Rome) to compile a survivorship analysis. A semi-parametric approach was applied, suggesting variations in mortality patterns between sexes in the Roman period. Nitrogen isotopic signatures influenced mortality in both periods, showing a quadratic and a linear effect for Roman Imperial and Medieval populations, respectively. No influence of carbon isotopic signatures has been detected for Roman Imperial populations. Conversely, increased mortality risk for rising carbon isotopic values was observed in Medieval samples.


Author(s):  
Mª del Carmen Pérez-Fuentes ◽  
José J. Gázquez ◽  
Mª del Mar Molero ◽  
Fernando Cardila ◽  
África Martos ◽  
...  

Adolescence is characterized by premature experimentation with new experiences and sensations. These experiences sometimes include drugs, which even though legal and socially accepted, begin to have noticeable negative consequences to the adolescent’s development. In recent years, a decrease in use of tobacco by Spanish adolescents has been observed, but not in alcohol. One of the causes of initiation in drug use is impulsive personality or behavior. Thus the purpose of this study was to analyze the relationship between impulsiveness and frequency of use of alcohol and tobacco in 822 students aged 13 to 18 years of age. The State Impulsivity Scale (SIS) and an ad hoc questionnaire on demographic characteristics and use of alcohol and tobacco were used for this. The results showed that students who stated they were users scored significantly higher on impulsivity. Thus detailed analysis of the profile of individuals with this risk factor could favor more adequate intervention program design.


Sign in / Sign up

Export Citation Format

Share Document