A Neural Passage Model for Ad-hoc Document Retrieval

The main goal of search engines is ad hoc retrieval: ranking documents in a corpus by their relevance to the information need expressed by a query. The Probability Ranking Principle (PRP) --- ranking the documents by their relevance probabilities --- is the theoretical foundation of most existing ad hoc document retrieval methods. A key observation that motivates our work is that the PRP does not account for potential post-ranking effects; specifically, changes to documents that result from a given ranking. Yet, in adversarial retrieval settings such as the Web, authors may consistently try to promote their documents in rankings by changing them. We prove that, indeed, the PRP can be sub-optimal in adversarial retrieval settings. We do so by presenting a novel game theoretic analysis of the adversarial setting. The analysis is performed for different types of documents (single-topic and multi-topic) and is based on different assumptions about the writing qualities of documents' authors. We show that in some cases, introducing randomization into the document ranking function yields an overall user utility that transcends that of applying the PRP.

Download Full-text

Graph-based entity-oriented search

ACM SIGIR Forum ◽

10.1145/3476415.3476430 ◽

2021 ◽

Vol 55 (1) ◽

pp. 1-2

Author(s):

José Devezas

Keyword(s):

Random Walk ◽

Ad Hoc ◽

Evaluation Process ◽

Document Retrieval ◽

Knowledge Bases ◽

Ranking Function ◽

Test Collection ◽

Information Need ◽

Keyword Query ◽

Walk Score

Entity-oriented search has revolutionized search engines. In the era of Google Knowledge Graph and Microsoft Satori, users demand an effortless process of search. Whether they express an information need through a keyword query, expecting documents and entities, or through a clicked entity, expecting related entities, there is an inherent need for the combination of corpora and knowledge bases to obtain an answer. Such integration frequently relies on independent signals extracted from inverted indexes, and from quad indexes indirectly accessed through queries to a triplestore. However, relying on two separate representation models inhibits the effective cross-referencing of information, discarding otherwise available relations that could lead to a better ranking. Moreover, different retrieval tasks often demand separate implementations, although the problem is, at its core, the same. With the goal of harnessing all available information to optimize retrieval, we explore joint representation models of documents and entities, while taking a step towards the definition of a more general retrieval approach. Specifically, we propose that graphs should be used to incorporate explicit and implicit information derived from the relations between text found in corpora and entities found in knowledge bases. We also take advantage of this framework to elaborate a general model for entity-oriented search, proposing a universal ranking function for the tasks of ad hoc document retrieval (leveraging entities), ad hoc entity retrieval, and entity list completion. At a conceptual stage, we begin by proposing the graph-of-entity, based on the relations between combinations of term and entity nodes. We introduce the entity weight as the corresponding ranking function, relying on the idea of seed nodes for representing the query, either directly through term nodes, or based on the expansion to adjacent entity nodes. The score is computed based on a series of geodesic distances to the remaining nodes, providing a ranking for the documents (or entities) in the graph. In order to improve on the low scalability of the graph-of-entity, we then redesigned this model in a way that reduced the number of edges in relation to the number of nodes, by relying on the hypergraph data structure. The resulting model, which we called hypergraph-of-entity, is the main contribution of this thesis. The obtained reduction was achieved by replacing binary edges with n -ary relations based on sets of nodes and entities (undirected document hyperedges), sets of entities (undirected hyperedges, either based on cooccurrence or a grouping by semantic subject), and pairs of a set of terms and a set of one entity (directed hyperedges, mapping text to an object). We introduce the random walk score as the corresponding ranking function, relying on the same idea of seed nodes, similar to the entity weight in the graph-of-entity. Scoring based on this function is highly reliant on the structure of the hypergraph, which we call representation-driven retrieval. As such, we explore several extensions of the hypergraph-of-entity, including relations of synonymy, or contextual similarity, as well as different weighting functions per node and hyperedge type. We also propose TF-bins as a discretization for representing term frequency in the hypergraph-of-entity. For the random walk score, we propose and explore several parameters, including length and repeats, with or without seed node expansion, direction, or weights, and with or without a certain degree of node and/or hyperedge fatigue, a concept that we also propose. For evaluation, we took advantage of TREC 2017 OpenSearch track, which relied on an online evaluation process based on the Living Labs API, and we also participated in TREC 2018 Common Core track, which was based on the newly introduced TREC Washington Post Corpus. Our main experiments were supported on the INEX 2009 Wikipedia collection, which proved to be a fundamental test collection for assessing retrieval effectiveness across multiple tasks. At first, our experiments solely focused on ad hoc document retrieval, ensuring that the model performed adequately for a classical task. We then expanded the work to cover all three entity-oriented search tasks. Results supported the viability of a general retrieval model, opening novel challenges in information retrieval, and proposing a new path towards generality in this area.

Download Full-text

Learning Deep Relevance Couplings for Ad-hoc Document Retrieval

Expert Systems with Applications ◽

10.1016/j.eswa.2021.115335 ◽

2021 ◽

pp. 115335

Author(s):

Shufeng Hao ◽

Chongyang Shi ◽

Longbing Cao ◽

Zhendong Niu ◽

Ping Guo

Keyword(s):

Ad Hoc ◽

Document Retrieval

Download Full-text

United States and Russia Promote Ad Hoc Global Initiative to Enhance Capacity to Deal with Events of Nuclear Terrorism

The American Journal of International Law ◽

10.1017/s000293000075408x ◽

2008 ◽

Vol 102 (4) ◽

pp. 870-872

Author(s):

John R. Crook

Keyword(s):

United States ◽

Ad Hoc ◽

Nuclear Terrorism ◽

Global Initiative

Download Full-text

Committees: Standing and Ad Hoc

Kappa Delta Pi Record ◽

10.1080/00228958.1965.10517088 ◽

1965 ◽

Vol 1 (3) ◽

pp. 68-68

Keyword(s):

Ad Hoc

Download Full-text

Die COVID-19-Pandemie im akutstationären Setting aus Sicht von Führungspersonen und Hygienefachkräften in der Pflege

Pflege ◽

10.1024/1012-5302/a000756 ◽

2020 ◽

Vol 33 (5) ◽

pp. 289-298

Author(s):

Katharina Silies ◽

Angelika Schley ◽

Janna Sill ◽

Steffen Fleischer ◽

Martin Müller ◽

...

Keyword(s):

Ad Hoc ◽

Qualitative Studie ◽

Skill Mix ◽

Task Forces

Zusammenfassung. Hintergrund: Die COVID-19-Pandemie ist eine Ausnahmesituation ohne Präzedenz und erforderte zahlreiche Ad-hoc-Anpassungen in den Strukturen und Prozessen der akutstationären Versorgung. Ziel: Ziel war es zu untersuchen, wie aus Sicht von Führungspersonen und Hygienefachkräften in der Pflege die stationäre Akutversorgung durch die Pandemiesituation beeinflusst wurde und welche Implikationen sich daraus für die Zukunft ergeben. Methoden: Qualitative Studie bestehend aus semistrukturierten Interviews mit fünf Verantwortlichen des leitenden Pflegemanagements und drei Hygienefachkräften in vier Krankenhäusern in Deutschland. Die Interviews wurden mittels qualitativer Inhaltsanalyse ausgewertet. Ergebnisse: Die Befragten beschrieben den auf die prioritäre Versorgung von COVID-19-Fällen hin umstrukturierten Klinikalltag. Herausforderungen waren Unsicherheit und Angst bei den Mitarbeiter_innen, relative Ressourcenknappheit von Material und Personal und die schnelle Umsetzung neuer Anforderungen an die Versorgungleistung. Dem wurde durch gezielte Kommunikation und Information, massive Anstrengungen zur Sicherung der Ressourcen und koordinierte Steuerung aller Prozesse durch bereichsübergreifende, interprofessionelle Task Forces begegnet. Schlussfolgerungen: Die in der COVID-19-Pandemie vorgenommenen Anpassungen zeigen Entwicklungspotenziale für die zukünftige Routineversorgung auf, z. B. könnten neue Arbeits- und Skill Mix-Modelle aufgegriffen werden. Für die Konkretisierung praktischer Implikationen sind vertiefende Analysen der Daten mit zeitlichem Abstand erforderlich.

Download Full-text