Matching Regular Expressions on uncertain data

Algorithmica ◽

10.1007/s00453-021-00906-8 ◽

2022 ◽

Author(s):

José Arturo Gil ◽

Simone Santini

Keyword(s):

Shortest Path ◽

Regular Expression ◽

Uncertain Data ◽

Regular Expressions ◽

Shortest Path Algorithm ◽

Regular Expression Matching

AbstractIn this paper we study regular expression matching in cases in which the identity of the symbols received is subject to uncertainty. We develop a model of symbol emission and uses a modification of the shortest path algorithm to find optimal matches on the Cartesian Graph of an expression provided that the input is a finite list. In the case of infinite streams, we show that the problem is in general undecidable but, if each symbols is received with probability 0 infinitely often, then with probability 1 the problem is decidable.

Download Full-text

Software Toolchain for Large-Scale RE-NFA Construction on FPGA

International Journal of Reconfigurable Computing ◽

10.1155/2009/301512 ◽

2009 ◽

Vol 2009 ◽

pp. 1-10 ◽

Cited By ~ 3

Author(s):

Yi-Hua E. Yang ◽

Viktor K. Prasanna

Keyword(s):

High Performance ◽

Large Scale ◽

Regular Expression ◽

Finite Automata ◽

Fixed Number ◽

Regular Expressions ◽

Pattern Complexity ◽

Regular Expression Matching ◽

Area Increase ◽

Prototype Software

We present a software toolchain for constructing large-scaleregular expression matching(REM) on FPGA. The software automates the conversion of regular expressions into compact and high-performance nondeterministic finite automata (RE-NFA). Each RE-NFA is described as an RTL regular expression matching engine (REME) in VHDL for FPGA implementation. Assuming a fixed number of fan-out transitions per state, ann-statem-bytes-per-cycle RE-NFA can be constructed inO(n×m)time andO(n×m)memory by our software. A large number of RE-NFAs are placed onto a two-dimensionalstaged pipeline, allowing scalability to thousands of RE-NFAs with linear area increase and little clock rate penalty due to scaling. On a PC with a 2 GHz Athlon64 processor and 2 GB memory, our prototype software constructs hundreds of RE-NFAs used by Snort in less than 10 seconds. We also designed a benchmark generator which can produce RE-NFAs with configurable pattern complexity parameters, including state count, state fan-in, loop-back and feed-forward distances. Several regular expressions with various complexities are used to test the performance of our RE-NFA construction software.

Download Full-text

Designing efficient algorithms for querying large corpora

Oslo Studies in Language ◽

10.5617/osla.8504 ◽

2021 ◽

Vol 11 (2) ◽

pp. 283-302

Author(s):

Paul Meurer

Keyword(s):

Regular Expression ◽

Linear Time ◽

Suffix Array ◽

Efficient Algorithms ◽

Regular Expressions ◽

Efficient Treatment ◽

Suffix Arrays ◽

Regular Expression Matching ◽

Finite State ◽

Query System

I describe several new efficient algorithms for querying large annotated corpora. The search algorithms as they are implemented in several popular corpus search engines are less than optimal in two respects: regular expression string matching in the lexicon is done in linear time, and regular expressions over corpus positions are evaluated starting in those corpus positions that match the constraints of the initial edges of the corresponding network. To address these shortcomings, I have developed an algorithm for regular expression matching on suffix arrays that allows fast lexicon lookup, and a technique for running finite state automata from edges with lowest corpus counts. The implementation of the lexicon as suffix array also lends itself to an elegant and efficient treatment of multi-valued and set-valued attributes. The described techniques have been implemented in a fully functional corpus management system and are also used in a treebank query system.

Download Full-text

Proof-directed program transformation: A functional account of efficient regular expression matching

Journal of Functional Programming ◽

10.1017/s0956796820000295 ◽

2021 ◽

Vol 31 ◽

Author(s):

ANDRZEJ FILINSKI

Keyword(s):

Program Transformation ◽

Formal Language ◽

Regular Expression ◽

State Machine ◽

Automata Theory ◽

Regular Expressions ◽

Transformation Techniques ◽

Standard Specification ◽

Correctness Proofs ◽

Regular Expression Matching

Abstract We show how to systematically derive an efficient regular expression (regex) matcher using a variety of program transformation techniques, but very little specialized formal language and automata theory. Starting from the standard specification of the set-theoretic semantics of regular expressions, we proceed via a continuation-based backtracking matcher, to a classical, table-driven state machine. All steps of the development are supported by self-contained (and machine-verified) equational correctness proofs.

Download Full-text

One dynamic shortest path algorithm in a traffic network based on a genetic algorithm

Advances in Civil, Transportation and Environmental Engineering ◽

10.2495/ctee120281 ◽

2013 ◽

Author(s):

Shuijian Zhang

Keyword(s):

Genetic Algorithm ◽

Shortest Path ◽

Traffic Network ◽

Shortest Path Algorithm

Download Full-text

ynamic traffic model under emergency incident and bidirectional dynamic shortest path algorithm

Journal of Computer Applications ◽

10.3724/sp.j.1087.2008.02955 ◽

2009 ◽

Vol 28 (11) ◽

pp. 2955-2957

Author(s):

Zi-hui REN ◽

Jian WANG

Keyword(s):

Shortest Path ◽

Traffic Model ◽

Shortest Path Algorithm

Download Full-text

Routing Military Aircraft with a Constrained Shortest-Path Algorithm

10.21236/ada486703 ◽

2007 ◽

Cited By ~ 3

Author(s):

W. M. Carlyle ◽

Johannes O. Royset ◽

R. K. Wood

Keyword(s):

Shortest Path ◽

Military Aircraft ◽

Shortest Path Algorithm ◽

Constrained Shortest Path

Download Full-text

NFA split architecture for fast regular expression matching

Proceedings of the 6th ACM/IEEE Symposium on Architectures for Networking and Communications Systems - ANCS '10 ◽

10.1145/1872007.1872024 ◽

2010 ◽

Cited By ~ 1

Author(s):

Jan Kořenek ◽

Vlastimil Košař

Keyword(s):

Regular Expression ◽

Regular Expression Matching ◽

Split Architecture

Download Full-text

Shortest path algorithm of a network via picture fuzzy digraphs and its application

Materials Today Proceedings ◽

10.1016/j.matpr.2020.12.006 ◽

2021 ◽

Author(s):

Parimala Mani ◽

Biju Vasudevan ◽

Murali Sivaraman

Keyword(s):

Shortest Path ◽

Shortest Path Algorithm

Download Full-text

Utilizing Restricted Direction Strategy and Binary Heap Technology to Optimize Dijkstra Algorithm in WebGIS

Key Engineering Materials ◽

10.4028/www.scientific.net/kem.419-420.557 ◽

2009 ◽

Vol 419-420 ◽

pp. 557-560 ◽

Cited By ~ 2

Author(s):

Rui Li

Keyword(s):

Shortest Path ◽

Operating Efficiency ◽

Dijkstra Algorithm ◽

Shortest Path Algorithm ◽

Research Focus ◽

Network Information ◽

The Core ◽

Highway Network ◽

Core Issue ◽

Storage Structures

Shortest path is the core issue in application of WebGIS. Improving the efficiency of the algorithm is an urgent requirement to be resolved at present. By the lossy algorithm analyzing, which is the current research focus of the shortest path algorithm to optimize, utilizing adjacency table of storage structures, restricted direction strategy and binary heap technology to optimize the algorithm, thereby reduce the scale of algorithm to improve the operating efficiency of algorithm. This scheme has been applied in the simulation of the data downloaded from the Guangdong Provincial Highway Network Information System and satisfactory results have been obtained.

Download Full-text

Study on the shortest path algorithm based on fluid neural network of in-vehicle traffic flow guidance system

Proceedings of the IEEE International Vehicle Electronics Conference (IVEC'99) (Cat. No.99EX257) ◽

10.1109/ivec.1999.830636 ◽

2003 ◽

Cited By ~ 1

Author(s):

Wen Huimin ◽

Yang Zhaosheng

Keyword(s):

Neural Network ◽

Traffic Flow ◽

Shortest Path ◽

Guidance System ◽

Shortest Path Algorithm ◽

Vehicle Traffic

Download Full-text