Fast Convolutions of Packed Strings and Pattern Matching with Wildcards

We give faster methods to compute discrete convolutions. We assume that all the inputs are packed, that is, strings are packed into words such that each word is packed with [Formula: see text] characters, where w is the length of a machine word and ∑ is the alphabet. The output of our methods is also packed, that is, each word of the output contains more than one element of the result. The approach is based on the word-level parallelism and the FFT. Given two strings with m and n ( n ≥ m ) characters that are packed into [Formula: see text] and [Formula: see text] words respectively, the convolution of them can be computed in [Formula: see text] time, where [Formula: see text] by the FFT. Experiments show that our method is three times faster than the convolution using the standard trick. We consider the problem of pattern matching with wildcards on packed strings. It finds all the occurrences of a pattern in a text, both of which may contain wildcards. By the convolution of packed strings, we present algorithms that are faster than the previous [Formula: see text]-time algorithm, where m is the length of the pattern and n the length of the text. The algorithm runs in [Formula: see text] time, where occ is the number of occurrences of the pattern in the input. Experiments show that the method is faster than the bit-parallel wildcard matching algorithm for long patterns.

Download Full-text

Exploiting word-level parallelism for fast convolutions and their applications in approximate string matching

European Journal of Combinatorics ◽

10.1016/j.ejc.2012.07.013 ◽

2013 ◽

Vol 34 (1) ◽

pp. 38-51 ◽

Cited By ~ 5

Author(s):

Kimmo Fredriksson ◽

Szymon Grabowski

Keyword(s):

String Matching ◽

Approximate String Matching ◽

Word Level ◽

Fast Convolutions ◽

Level Parallelism

Download Full-text

Double Line Clustering based Colour Image Segmentation Technique for Plant Disease Detection

Current Medical Imaging Formerly Current Medical Imaging Reviews ◽

10.2174/1573405614666180322130242 ◽

2019 ◽

Vol 15 (8) ◽

pp. 769-776

Author(s):

Kalaivani Subramani ◽

Shantharajah Periyasamy ◽

Padma Theagarajan

Keyword(s):

Pattern Matching ◽

Error Rate ◽

Median Filter ◽

Identification System ◽

Double Line ◽

Cultivation System ◽

Matching Algorithm ◽

Non Local ◽

Mining Methods ◽

Blight Disease

Background: Agriculture is one of the most essential industry that fullfills people’s need and also plays an important role in economic evolution of the nation. However, there is a gap between the agriculture sector and the technological industry and the agriculture plants are mostly affected by diseases, such as the bacterial, fungus and viral diseases that lead to loss in crop yield. The affected parts of the plants need to be identified at the beginning stage to eliminate the huge loss in productivity. Methods: In the present scenario, crop cultivation system depend on the farmers experience and the man power, but it consumes more time and increases error rate. To overcome this issue, the proposed system introduces the Double Line Clustering technique based disease identification system using the image processing and data mining methods. The introduced method analyze the Anthracnose, blight disease in grapes, tomato and cucumber. The leaf images are captured and the noise has been removed by non-local median filter and the segmentation is done by double line clustering method. The segmented part compared with diseased leaf using pattern matching algorithm. Methods: In the present scenario, crop cultivation system depend on the farmers experience and the man power, but it consumes more time and increases error rate. To overcome this issue, the proposed system introduces the Double Line Clustering technique based disease identification system using the image processing and data mining methods. The introduced method analyze the Anthracnose, blight disease in grapes, tomato and cucumber. The leaf images are captured and the noise has been removed by non-local median filter and the segmentation is done by double line clustering method. The segmented part compared with diseased leaf using pattern matching algorithm. Conclusion: The result of the clustering algorithm achieved high accuracy, sensitivity, and specificity. The feature extraction is applied after the clustering process which produces minimum error rate.

Download Full-text

An Optimized Aho-Corasick Multi-Pattern Matching Algorithm for Fast Pattern Matching

2020 IEEE 17th India Council International Conference (INDICON) ◽

10.1109/indicon49873.2020.9342041 ◽

2020 ◽

Author(s):

Uday Trivedi

Keyword(s):

Pattern Matching ◽

Matching Algorithm ◽

Pattern Matching Algorithm

Download Full-text

A Flexible Pattern-Matching Algorithm for Network Intrusion Detection Systems Using Multi-Core Processors

Algorithms ◽

10.3390/a10020058 ◽

2017 ◽

Vol 10 (2) ◽

pp. 58 ◽

Cited By ~ 1

Author(s):

◽

Keyword(s):

Intrusion Detection ◽

Pattern Matching ◽

Intrusion Detection Systems ◽

Network Intrusion Detection ◽

Matching Algorithm ◽

Detection Systems ◽

Network Intrusion ◽

Network Intrusion Detection Systems ◽

Pattern Matching Algorithm

Download Full-text

Improving Wu-Manber: A Multi-pattern Matching Algorithm

2008 IEEE International Conference on Networking, Sensing and Control ◽

10.1109/icnsc.2008.4525327 ◽

2008 ◽

Cited By ~ 6

Author(s):

Chen Zhen ◽

Wu Di

Keyword(s):

Pattern Matching ◽

Matching Algorithm ◽

Pattern Matching Algorithm

Download Full-text

Clustering oriented hashing based multiple string pattern matching algorithm

2015 International Conference on Circuits, Power and Computing Technologies [ICCPCT-2015] ◽

10.1109/iccpct.2015.7159288 ◽

2015 ◽

Author(s):

Punit Kanuga

Keyword(s):

Pattern Matching ◽

Matching Algorithm ◽

Pattern Matching Algorithm ◽

String Pattern

Download Full-text

A simple pattern matching algorithm for weighted sequences

Proceedings of the 2012 ACM Research in Applied Computation Symposium on - RACS '12 ◽

10.1145/2401603.2401616 ◽

2012 ◽

Author(s):

Inbok Lee

Keyword(s):

Pattern Matching ◽

Simple Pattern ◽

Matching Algorithm ◽

Pattern Matching Algorithm ◽

Weighted Sequences

Download Full-text

A FAST PATTERN-MATCHING ALGORITHM ON MODULAR MESH-CONNECTED COMPUTERS WITH MULTIPLE BUSES

International Journal of Pattern Recognition and Artificial Intelligence ◽

10.1142/s0218001495000195 ◽

1995 ◽

Vol 09 (02) ◽

pp. 411-419

Author(s):

KUO-LIANG CHUNG

Keyword(s):

Parallel Algorithm ◽

Pattern Matching ◽

Matching Algorithm ◽

Pattern Matching Algorithm ◽

Multiple Buses

Given a pattern of length m and a text of length n, commonly m≪n, this paper presents a randomized parallel algorithm for pattern matching in O(n1/10) (=O(n1/10+(n−m)1/10)) time on a newly proposed n3/5×n2/5 modular meshconnected computers with multiple buses. Furthermore, the time bound of our parallel algorithm can be reduced to O(n1/11) if fewer processors are used.

Download Full-text

EPMA: Efficient pattern matching algorithm for DNA sequences

Expert Systems with Applications ◽

10.1016/j.eswa.2017.03.026 ◽

2017 ◽

Vol 80 ◽

pp. 162-170 ◽

Cited By ~ 6

Author(s):

Muhammad Tahir ◽

Muhammad Sardaraz ◽

Ataul Aziz Ikram

Keyword(s):

Pattern Matching ◽

Dna Sequences ◽

Matching Algorithm ◽

Pattern Matching Algorithm

Download Full-text

A Pattern Matching Algorithm for Double-Type Characters

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.571-572.461 ◽

2014 ◽

Vol 571-572 ◽

pp. 461-464

Author(s):

Hai Yan Zhou

Keyword(s):

Pattern Matching ◽

Chinese Characters ◽

Cross Link ◽

Data Link ◽

Matching Algorithm ◽

Order Of Magnitude ◽

Link Data ◽

Efficient Matching ◽

Link Table ◽

Double Cross

A fast and efficient matching algorithm is proposed to address the issue on multi-pattern matching of double-byte string, for example Chinese characters, which has major difference with single-byte string matching algorithm. The algorithm capitalizes on double cross link data list and two finite prefix automata to match a double-byte character, so as to solve the storage expansion problems in which the double-byte cross data link table results. The method requires less storage in comparison with double-byte cross data link table, and has the same order of magnitude in efficiency as a single-byte cross-link table approach.

Download Full-text