Score-consistent algebraic optimization of full-text search queries with GRAFT

SMART SEARCH IN THE DATABASE OF PHYSIC-TECHNICAL INFORMATION

ITNOU: Information technologies in education, science and management ◽

10.47501/itnou.2020.2.28-32 ◽

2020 ◽

pp. 28-32

Author(s):

Dmitry Mikhailovich Korobkin ◽

Stanislav Alekseevich Avdosev ◽

Sergei Alekseevich Fomenkov ◽

Sergei Grigorievich Kolesnikov

Keyword(s):

Full Text ◽

Technical Information ◽

Text Search ◽

Search System ◽

Full Text Search ◽

Search Queries ◽

Intelligent Search ◽

Physical Effects

The article describes the process of developing an automated intelligent search system based on physical effects. The developed system performs descriptor, full-text search, logging of search queries, displaying physical effects and other functions.

Get full-text (via PubEx)

Big Data Full-Text Search Index Minimization Using Text Summarization

Information Technology And Control ◽

10.5755/j01.itc.50.2.25470 ◽

2021 ◽

Vol 50 (2) ◽

pp. 375-389

Author(s):

Waheed Iqbal ◽

Waqas Ilyas Malik ◽

Faisal Bukhari ◽

Khaled Mohamad Almustafa ◽

Zubiar Nawaz

Keyword(s):

Big Data ◽

Full Text ◽

Text Summarization ◽

Text Search ◽

Full Text Search ◽

Search Queries ◽

Search Results ◽

Real World Datasets ◽

Search Index ◽

Index Size

An efficient full-text search is achieved by indexing the raw data with an additional 20 to 30 percent storagecost. In the context of Big Data, this additional storage space is huge and introduces challenges to entertainfull-text search queries with good performance. It also incurs overhead to store, manage, and update the largesize index. In this paper, we propose and evaluate a method to minimize the index size to offer full-text searchover Big Data using an automatic extractive-based text summarization method. To evaluate the effectivenessof the proposed approach, we used two real-world datasets. We indexed actual and summarized datasets usingApache Lucene and studied average simple overlapping, Spearman’s rho correlation, and average rankingscore measures of search results obtained using different search queries. Our experimental evaluation showsthat automatic text summarization is an effective method to reduce the index size significantly. We obtained amaximum of 82% reduction in index size with 42% higher relevance of the search results using the proposedsolution to minimize the full-text index size.

Get full-text (via PubEx)

Improving full text search performance through textual analysis

Information Processing & Management ◽

10.1016/0306-4573(93)90083-p ◽

1993 ◽

Vol 29 (5) ◽

pp. 615-632 ◽

Cited By ~ 2

Author(s):

Mavis Molto

Keyword(s):

Full Text ◽

Textual Analysis ◽

Search Performance ◽

Text Search ◽

Full Text Search

Get full-text (via PubEx)

GPU Computation for Online Realtime Multi-Pattern Matching

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.284-287.3428 ◽

2013 ◽

Vol 284-287 ◽

pp. 3428-3432 ◽

Cited By ~ 2

Author(s):

Yu Hsiu Huang ◽

Richard Chun Hung Lin ◽

Ying Chih Lin ◽

Cheng Yi Lin

Keyword(s):

Parallel Computation ◽

Pattern Matching ◽

Full Text ◽

Network Performance ◽

Text Search ◽

Full Text Search ◽

Network Intrusion ◽

Speed Up ◽

Set Up ◽

Gpu Implementation

Most applications of traditional full-text search, e.g., webpage search, are offline which exploit text search engine to preview the texts and set up related index. However, applications of online realtime full-text search, e.g., network Intrusion detection and prevention systems (IDPS) are too hard to implementation by using commodity hardware. They are expensive and inflexible for more and more occurrences of new virus patterns and the text cannot be previewed and the search must be complete realtime online. Additionally, IDPS needs multi-pattern matching, and then malicious packets can be removed immediately from normal ones without degrading the network performance. Considering the problem of realtime multi-pattern matching, we implement two sequential algorithms, Wu-Manber and Aho-Corasick, respectively over GPU parallel computation platform. Both pattern matching algorithms are quite suitable for the cases with a large amount of patterns. In addition, they are also easier extendable over GPU parallel computation platform to satisfy realtime requirement. Our experimental results show that the throughput of GPU implementation is about five to seven times faster than CPU. Therefore, pattern matching over GPU offers an attractive solution of IDPS to speed up malicious packets detection among the normal traffic by considering the lower cost, easy expansion and better performance.

Get full-text (via PubEx)

Application of Full Text Search Engine Based on Lucene

Advances in Internet of Things ◽

10.4236/ait.2012.24013 ◽

2012 ◽

Vol 02 (04) ◽

pp. 106-109 ◽

Cited By ~ 6

Author(s):

Rujia Gao ◽

Danying Li ◽

Wanlong Li ◽

Yaze Dong

Keyword(s):

Search Engine ◽

Full Text ◽

Text Search ◽

Full Text Search

Get full-text (via PubEx)

Development of the Multilingual Collaboration System for Farmers of Several Countries (2) : Multilingual Full Text Search System

Journal of the Faculty of Agriculture, Kyushu University ◽

10.5109/4605 ◽

2004 ◽

Vol 49 (2) ◽

pp. 441-448

Author(s):

Kang Oh Lee ◽

Kei Nakaji ◽

Yoichi Nada

Keyword(s):

Full Text ◽

Text Search ◽

Search System ◽

Full Text Search

Get full-text (via PubEx)

Experimental simulation on incremental three-gram index for two-gram full-text search systems

SMC'03 Conference Proceedings. 2003 IEEE International Conference on Systems, Man and Cybernetics. Conference Theme - System Security and Assurance (Cat. No.03CH37483) ◽

10.1109/icsmc.2003.1245750 ◽

2004 ◽

Cited By ~ 1

Author(s):

H. Yamamoto ◽

S. Ohmi ◽

H. Tsuji

Keyword(s):

Full Text ◽

Experimental Simulation ◽

Text Search ◽

Full Text Search ◽

Search Systems

Get full-text (via PubEx)

“Dynamic” Syntax Model in Automated Language Analysis Systems for Increasing Full-Text Search Systems Efficiency

Emerging Intelligent Technologies in Industry - Studies in Computational Intelligence ◽

10.1007/978-3-642-22732-5_14 ◽

2011 ◽

pp. 157-166

Author(s):

Marcin Karwinski

Keyword(s):

Full Text ◽

Text Search ◽

Full Text Search ◽

Language Analysis ◽

Dynamic Syntax ◽

Search Systems

Get full-text (via PubEx)

Recommender Systems in Digital Libraries Using Artificial Intelligence and Machine Learning

Advances in Systems Analysis, Software Engineering, and High Performance Computing - Handbook of Research on Methodologies and Applications of Supercomputing ◽

10.4018/978-1-7998-7156-9.ch012 ◽

2021 ◽

pp. 162-178

Author(s):

Namik Delilovic

Keyword(s):

Artificial Intelligence ◽

Machine Learning ◽

Digital Libraries ◽

Full Text ◽

Text Search ◽

Full Text Search ◽

Artificial Intelligence Techniques ◽

Search Results ◽

Advanced Search ◽

Search Field

Searching for contents in present digital libraries is still very primitive; most websites provide a search field where users can enter information such as book title, author name, or terms they expect to be found in the book. Some platforms provide advanced search options, which allow the users to narrow the search results by specific parameters such as year, author name, publisher, and similar. Currently, when users find a book which might be of interest to them, this search process ends; only a full-text search or references at the end of the book may provide some additional pointers. In this chapter, the author is going to give an example of how a user could permanently get recommendations for additional contents even while reading the article, using present machine learning and artificial intelligence techniques.

Get full-text (via PubEx)

ASH: A New Tool for Automated and Full-Text Search in Systematic Literature Reviews

Computational Science – ICCS 2021 - Lecture Notes in Computer Science ◽

10.1007/978-3-030-77967-2_30 ◽

2021 ◽

pp. 362-369

Author(s):

Marek Sośnicki ◽

Lech Madeyski

Keyword(s):

Full Text ◽

Text Search ◽

Full Text Search ◽

Literature Reviews

Get full-text (via PubEx)