Compression and full-text indexing for Digital Libraries

Author(s):  
Ian H. Witten ◽  
Alistair Moffat ◽  
Timothy C. Bell
1994 ◽  
Vol 15 (1) ◽  
pp. 11-13 ◽  
Author(s):  
Ian H. Witten ◽  
Alistair Moffat ◽  
Timothy C. Bell

Author(s):  
Jason Strate ◽  
Grant Fritchey
Keyword(s):  

Author(s):  
Namik Delilovic

Searching for contents in present digital libraries is still very primitive; most websites provide a search field where users can enter information such as book title, author name, or terms they expect to be found in the book. Some platforms provide advanced search options, which allow the users to narrow the search results by specific parameters such as year, author name, publisher, and similar. Currently, when users find a book which might be of interest to them, this search process ends; only a full-text search or references at the end of the book may provide some additional pointers. In this chapter, the author is going to give an example of how a user could permanently get recommendations for additional contents even while reading the article, using present machine learning and artificial intelligence techniques.


2017 ◽  
Vol 65 (4) ◽  
pp. 407-418
Author(s):  
S. Grabowski ◽  
M. Raniszewski

AbstractFull-text indexing aims at building a data structure over a given text capable of efficiently finding arbitrary text patterns, and possibly requiring little space. We propose two suffix array inspired full-text indexes. One, called SA-hash, augments the suffix array with a hash table to speed up pattern searches due to significantly narrowed search interval before the binary search phase. The other, called FBCSA, is a compact data structure, similar to Mäkinen’s compact suffix array (MakCSA), but working on fixed size blocks. Experiments on the widely used Pizza & Chili datasets show that SA-hash is about 2–3 times faster in pattern searches (counts) than the standard suffix array, for the price of requiring 0.2n–1.1nbytes of extra space, wherenis the text length. FBCSA, in one of the presented variants, reduces the suffix array size by a factor of about 1.5–2, while it gets close in search times, winning in speed with its competitors known from the literature, MakCSA and LCSA.


Author(s):  
Shuigeng Zhou ◽  
Jihong Guan ◽  
Yunfa Hu ◽  
Jiangtao Hu ◽  
Aoying Zhou

2014 ◽  
Vol 8 (1) ◽  
pp. 321-326 ◽  
Author(s):  
Wei-Zhe Zhang ◽  
Hui-Xiang Chen ◽  
Hui He ◽  
Gui Chen

Sign in / Sign up

Export Citation Format

Share Document