string searching Latest Research Papers

Word matching is a string searching technique for information retrieval in Natural Language Processing (NLP). There are several algorithms have been used for string search and matching such as Knuth Morris Pratt, Boyer Moore, Horspool, Intelligent Predictive and many other. However, there some issues need to be considered in measuring the performance of the algorithms such as the efficiency for searching small alphabets, time taken in processing the pattern of the text and extra space to support a huge table or state machines. Intelligent Predictive (IP) algorithm capable to solve several word matching issues discovered in other string searching algorithms especially with abilities to skip the pre-processing of the pattern, uses simple rules during matching process and does not involved complex computations. Due to those reasons,<strong> </strong>IP algorithm is used in this study due to the ability of this algorithm to produce a good result in string searching process. This article aims to apply IP algorithm together with Optical Character Recognition (OCR) tool for mobile-based word matching detection. There are four phases in this study consists of data preparation, mobile based system design, algorithm implementation and result analysis. The efficiency of the proposed algorithm was evaluated based on the execution time of searching process among the selected algorithms. The result shows that the IP algorithm for string searching process is more efficient in execution time compared to well-known algorithm i.e. Boyer Moore algorithm. In future work, the performance of string searching process can be enhanced by using other suitable optimization searching techniques such as Genetic Algorithm, Particle Swarm Optimization, Ant Colony Optimization and many others.

Download Full-text

Searching Process using Bayer Moore Algorithm in Medicinal Information Media

International Journal of Recent Technology and Engineering - 2 ◽

10.35940/ijrte.b1007.0782s719 ◽

2019 ◽

Vol 8 (2S7) ◽

pp. 30-32

Keyword(s):

Medicinal Plants ◽

String Matching ◽

Botanical Garden ◽

Visual Simulation ◽

String Searching ◽

Information Media ◽

Search For Information

Indonesia is one of the giant centers of biodiversity, there are many types of plants that can be used as a medicine. Based on medicinal plants collection of Bogor Botanical Garden which has the potential as medicine consists of 764 species, 465 genus and 135 families. So in search of herbs manually becomes difficult. Therefore, to facilitate the search for information about the plume of drugs, required an effective and easy to use information media by the community. In this research we used Boyer Moore algorithm to execute the searching process. This research shows the process of string matching using a Boyer-Moore algorithm in the form of visual simulation by doing string searching of a pattern string, The results show that the algorithm can be used to the searching process

Download Full-text

Fast String Searching on PISA

Proceedings of the 2019 ACM Symposium on SDN Research - SOSR '19 ◽

10.1145/3314148.3314356 ◽

2019 ◽

Cited By ~ 5

Author(s):

Theo Jepsen ◽

Daniel Alvarez ◽

Nate Foster ◽

Changhoon Kim ◽

Jeongkeun Lee ◽

...

Keyword(s):

String Searching

Download Full-text

Approximate string searching with fast fourier transforms and simplexes

10.7287/peerj.preprints.27615 ◽

2019 ◽

Cited By ~ 1

Author(s):

Daniel Liu

Keyword(s):

Fourier Transform ◽

Fast Fourier Transform ◽

Fourier Transforms ◽

Hamming Distance ◽

Dimensional Space ◽

Approximate String Matching ◽

String Searching ◽

Novel Approach ◽

Sigma 1 ◽

Distance Problem

Previous algorithms for solving the approximate string matching with Hamming distance problem with wildcard ("don't care") characters have been shown to take \(O(|\Sigma| N \log M)\) time, where \(N\) is the length of the text, \(M\) is the length of the pattern, and \(|\Sigma|\) is the size of the alphabet. They make use of the Fast Fourier Transform for efficiently calculating convolutions. We describe a novel approach of the problem, which makes use of special encoding schemes that depend on \((|\Sigma| - 1)\)-simplexes in \((|\Sigma| - 1)\)-dimensional space.

Download Full-text

Approximate string searching with fast fourier transforms and simplexes

10.7287/peerj.preprints.27615v1 ◽

2019 ◽

Cited By ~ 2

Author(s):

Daniel Liu

Keyword(s):

Fourier Transform ◽

Fast Fourier Transform ◽

Fourier Transforms ◽

Hamming Distance ◽

Dimensional Space ◽

Approximate String Matching ◽

String Searching ◽

Novel Approach ◽

Sigma 1 ◽

Distance Problem

Previous algorithms for solving the approximate string matching with Hamming distance problem with wildcard ("don't care") characters have been shown to take \(O(|\Sigma| N \log M)\) time, where \(N\) is the length of the text, \(M\) is the length of the pattern, and \(|\Sigma|\) is the size of the alphabet. They make use of the Fast Fourier Transform for efficiently calculating convolutions. We describe a novel approach of the problem, which makes use of special encoding schemes that depend on \((|\Sigma| - 1)\)-simplexes in \((|\Sigma| - 1)\)-dimensional space.

Download Full-text

Detecting Implicit Security Exceptions Using an Improved Variable-Length Sequential Pattern Mining Method

International Journal of Software Engineering and Knowledge Engineering ◽

10.1142/s0218194017500462 ◽

2017 ◽

Vol 27 (08) ◽

pp. 1235-1268

Author(s):

Jinfu Chen ◽

Saihua Cai ◽

Dave Towey ◽

Lili Zhu ◽

Rubing Huang ◽

...

Keyword(s):

Visual Inspection ◽

Pattern Mining ◽

Sequential Pattern Mining ◽

Variable Length ◽

Sequential Pattern ◽

Sequential Patterns ◽

Mining Method ◽

Security Testing ◽

String Searching ◽

Correct Execution

The process of component security testing can produce massive amounts of monitor logs. Current approaches to detect implicit security exceptions (those which cannot be identified by visual inspection alone) compare correct execution sequences with fixed patterns mined from the execution of sequential patterns in the monitor logs. However, this is not efficient and is not suitable for mining large monitor logs. To enable effective mining of implicit security exceptions from large monitor logs, this paper proposes a method based on improved variable-length sequential pattern mining. The proposed method first mines the variable-length sequential patterns from correct execution sequences and from actual execution sequences, thus reducing the number of patterns. The sequential patterns are then detected using the Sunday string-searching algorithm. We conducted an experimental study based on this method, the results of which show that the proposed method can efficiently detect the implicit security exceptions of components.

Download Full-text

GPU Based N-Gram String Matching Algorithm with Score Table Approach for String Searching in Many Documents

Journal of The Institution of Engineers (India) Series B ◽

10.1007/s40031-017-0295-3 ◽

2017 ◽

Vol 98 (5) ◽

pp. 467-476 ◽

Cited By ~ 1

Author(s):

K. G. Srinivasa ◽

B. N. Shree Devi

Keyword(s):

String Matching ◽

Matching Algorithm ◽

String Searching ◽

N Gram ◽

Score Table

Download Full-text

Perbandingan Algoritma Horspool dan Algoritma Zhu-Takaoka dalam Pencarian String Berbasis Desktop

Jurnal ULTIMA Computing ◽

10.31937/sk.v9i1.568 ◽

2017 ◽

Vol 9 (1) ◽

Cited By ~ 1

Author(s):

Adhi Kusnadi ◽

Abraham Khrisnandi Wicaksono

Keyword(s):

Processing Time ◽

Search Process ◽

Text File ◽

C Programming Language ◽

Retrieval Systems ◽

String Searching ◽

C Programming ◽

Information Retrieval Systems ◽

Index Terms ◽

Time Required

String searching is the search process by using the index to find text that can help in information retrieval systems. Continuing previous research, this study uses an algorithm Horspool and Zhu-Takaoka to find the performance of each of these algorithms in the search for a pattern in the text. So they make a useful desktop-based application to measure performance of both algorithm, particulary the time required to perform string searching process. Using the method of prototyping and Microsoft Visual Studio with C# programming language implementation. Result obtained from this application is the number of words found, and the processing time of each algorithm. From this study, the Horspool algorithm is 19,82845 percent faster in first test with the pattern "swan" in a text file 50 multiples 1000 words and 15.9442 percent in the second trial using text files 70000 words with different pattern than the number of characters Zhu algorithm -Takaoka in the process of searching string. Index Terms-String searching, Horspool, Zhu-Takaoka, Microsoft Visual Studio, application, processing time.

Download Full-text

string searching
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Modification of Valiant’s Parsing Algorithm for the String-Searching Problem

An Algorithm for String Searching

Mobile-Based Word Matching Detection using Intelligent Predictive Algorithm

Searching Process using Bayer Moore Algorithm in Medicinal Information Media

Fast String Searching on PISA

Approximate string searching with fast fourier transforms and simplexes

Approximate string searching with fast fourier transforms and simplexes

Detecting Implicit Security Exceptions Using an Improved Variable-Length Sequential Pattern Mining Method

GPU Based N-Gram String Matching Algorithm with Score Table Approach for String Searching in Many Documents

Perbandingan Algoritma Horspool dan Algoritma Zhu-Takaoka dalam Pencarian String Berbasis Desktop

Export Citation Format

string searchingRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Modification of Valiant’s Parsing Algorithm for the String-Searching Problem

An Algorithm for String Searching

Mobile-Based Word Matching Detection using Intelligent Predictive Algorithm

Searching Process using Bayer Moore Algorithm in Medicinal Information Media

Fast String Searching on PISA

Approximate string searching with fast fourier transforms and simplexes

Approximate string searching with fast fourier transforms and simplexes

Detecting Implicit Security Exceptions Using an Improved Variable-Length Sequential Pattern Mining Method

GPU Based N-Gram String Matching Algorithm with Score Table Approach for String Searching in Many Documents

Perbandingan Algoritma Horspool dan Algoritma Zhu-Takaoka dalam Pencarian String Berbasis Desktop

string searching
Recently Published Documents