THE EVOLUTION OF THE CORNELL SPIRES SLIDE INDEXING SYSTEM

Author(s):  
Ingeborg Wald
Keyword(s):  
10.28945/371 ◽  
2008 ◽  
Vol 4 ◽  
pp. 137-149 ◽  
Author(s):  
Doina Ana Cernea ◽  
Esther Del Moral-Pérez ◽  
Jose E. Labra Gayo

Terminology ◽  
2005 ◽  
Vol 11 (1) ◽  
pp. 199-224 ◽  
Author(s):  
Adeline Nazarenko ◽  
Touria Aït El Mekki

This paper presents an original natural language processing (NLP) approach for building of back-of-the-book indexes. Our indexing system, IndDoc, exploits some terminological tools and automatically builds an index draft of the analysis of the document text. The indexer then has to validate that index draft through a dedicated interface. This approach has been tested on several documents with promising results. Relying on our experience in developing and testing the IndDoc indexing system, we aim at assessing the contribution of terminological analysis as well as the level of maturity that computational terminology has reached in the indexing perspective.


2007 ◽  
Vol 45 (4) ◽  
pp. 839-852 ◽  
Author(s):  
Chi-Ren Shyu ◽  
Matt Klaric ◽  
Grant J. Scott ◽  
Adrian S. Barb ◽  
Curt H. Davis ◽  
...  

2007 ◽  
Vol 37 (2) ◽  
pp. 135-167 ◽  
Author(s):  
Ardhendu Behera ◽  
Denis Lalanne ◽  
Rolf Ingold
Keyword(s):  

2018 ◽  
Vol 12 (02) ◽  
pp. 191-213
Author(s):  
Nan Zhu ◽  
Yangdi Lu ◽  
Wenbo He ◽  
Hua Yu ◽  
Jike Ge

The sheer volume of contents generated by today’s Internet services is stored in the cloud. The effective indexing method is important to provide the content to users on demand. The indexing method associating the user-generated metadata with the content is vulnerable to the inaccuracy caused by the low quality of the metadata. While the content-based indexing does not depend on the error-prone metadata, the state-of-the-art research focuses on developing descriptive features and misses the system-oriented considerations when incorporating these features into the practical cloud computing systems. We propose an Update-Efficient and Parallel-Friendly content-based indexing system, called Partitioned Hash Forest (PHF). The PHF system incorporates the state-of-the-art content-based indexing models and multiple system-oriented optimizations. PHF contains an approximate content-based index and leverages the hierarchical memory system to support the high volume of updates. Additionally, the content-aware data partitioning and lock-free concurrency management module enable the parallel processing of the concurrent user requests. We evaluate PHF in terms of indexing accuracy and system efficiency by comparing it with the state-of-the-art content-based indexing algorithm and its variances. We achieve the significantly better accuracy with less resource consumption, around 37% faster in update processing and up to 2.5[Formula: see text] throughput speedup in a multi-core platform comparing to other parallel-friendly designs.


2005 ◽  
Vol 13 (4) ◽  
pp. 299 ◽  
Author(s):  
Mladen Kolar ◽  
Igor Vukmirovi� ◽  
Bojana Dalbelo Ba�i� ◽  
Jan �najder

Sign in / Sign up

Export Citation Format

Share Document