Beliefs We Can Believe in: Replacing Assumptions with Data in Real-Time Search

Maximilian Fickert; Tianyi Gu; Leonhard Staut; Wheeler Ruml; Joerg Hoffmann; Marek Petrik

doi:10.1609/aaai.v34i06.6535

Beliefs We Can Believe in: Replacing Assumptions with Data in Real-Time Search

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i06.6535 ◽

2020 ◽

Vol 34 (06) ◽

pp. 9827-9834

Author(s):

Maximilian Fickert ◽

Tianyi Gu ◽

Leonhard Staut ◽

Wheeler Ruml ◽

Joerg Hoffmann ◽

...

Keyword(s):

Real Time ◽

Heuristic Search ◽

Search Algorithm ◽

Search Algorithms ◽

Experimental Results ◽

Data Driven ◽

Heuristic Search Algorithms ◽

Original Presentation ◽

Performance Statistics ◽

The Cost

Suboptimal heuristic search algorithms can benefit from reasoning about heuristic error, especially in a real-time setting where there is not enough time to search all the way to a goal. However, current reasoning methods implicitly or explicitly incorporate assumptions about the cost-to-go function. We consider a recent real-time search algorithm, called Nancy, that manipulates explicit beliefs about the cost-to-go. The original presentation of Nancy assumed that these beliefs are Gaussian, with parameters following a certain form. In this paper, we explore how to replace these assumptions with actual data. We develop a data-driven variant of Nancy, DDNancy, that bases its beliefs on heuristic performance statistics from the same domain. We extend Nancy and DDNancy with the notion of persistence and prove their completeness. Experimental results show that DDNancy can perform well in domains in which the original assumption-based Nancy performs poorly.

Download Full-text

Avoiding and Escaping Depressions in Real-Time Heuristic Search

Journal of Artificial Intelligence Research ◽

10.1613/jair.3590 ◽

2012 ◽

Vol 43 ◽

pp. 523-570 ◽

Cited By ~ 9

Author(s):

C. Hernandez ◽

J. A. Baier

Keyword(s):

Real Time ◽

Heuristic Search ◽

Search Algorithm ◽

Search Space ◽

Search Algorithms ◽

Search Spaces ◽

Heuristic Search Algorithms ◽

Order Of Magnitude ◽

Improved Performance ◽

Hard Real Time

Heuristics used for solving hard real-time search problems have regions with depressions. Such regions are bounded areas of the search space in which the heuristic function is inaccurate compared to the actual cost to reach a solution. Early real-time search algorithms, like LRTA*, easily become trapped in those regions since the heuristic values of their states may need to be updated multiple times, which results in costly solutions. State-of-the-art real-time search algorithms, like LSS-LRTA* or LRTA*(k), improve LRTA*'s mechanism to update the heuristic, resulting in improved performance. Those algorithms, however, do not guide search towards avoiding depressed regions. This paper presents depression avoidance, a simple real-time search principle to guide search towards avoiding states that have been marked as part of a heuristic depression. We propose two ways in which depression avoidance can be implemented: mark-and-avoid and move-to-border. We implement these strategies on top of LSS-LRTA* and RTAA*, producing 4 new real-time heuristic search algorithms: aLSS-LRTA*, daLSS-LRTA*, aRTAA*, and daRTAA*. When the objective is to find a single solution by running the real-time search algorithm once, we show that daLSS-LRTA* and daRTAA* outperform their predecessors sometimes by one order of magnitude. Of the four new algorithms, daRTAA* produces the best solutions given a fixed deadline on the average time allowed per planning episode. We prove all our algorithms have good theoretical properties: in finite search spaces, they find a solution if one exists, and converge to an optimal after a number of trials.

Download Full-text

A Unifying View on Individual Bounds and Heuristic Inaccuracies in Bidirectional Search

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i03.5611 ◽

2020 ◽

Vol 34 (03) ◽

pp. 2327-2334

Author(s):

Vidal Alcázar ◽

Pat Riddle ◽

Mike Barley

Keyword(s):

Lower Bound ◽

Heuristic Search ◽

Search Algorithm ◽

Substantial Improvement ◽

Full Potential ◽

The Past ◽

Heuristic Search Algorithms ◽

Bidirectional Search ◽

Order Of Magnitude ◽

The Cost

In the past few years, new very successful bidirectional heuristic search algorithms have been proposed. Their key novelty is a lower bound on the cost of a solution that includes information from the g values in both directions. Kaindl and Kainz (1997) proposed measuring how inaccurate a heuristic is while expanding nodes in the opposite direction, and using this information to raise the f value of the evaluated nodes. However, this comes with a set of disadvantages and remains yet to be exploited to its full potential. Additionally, Sadhukhan (2013) presented BAE∗, a bidirectional best-first search algorithm based on the accumulated heuristic inaccuracy along a path. However, no complete comparison in regards to other bidirectional algorithms has yet been done, neither theoretical nor empirical. In this paper we define individual bounds within the lower-bound framework and show how both Kaindl and Kainz's and Sadhukhan's methods can be generalized thus creating new bounds. This overcomes previous shortcomings and allows newer algorithms to benefit from these techniques as well. Experimental results show a substantial improvement, up to an order of magnitude in the number of necessarily-expanded nodes compared to state-of-the-art near-optimal algorithms in common benchmarks.

Download Full-text

Quantum dialogue protocol based on Grover’s search algorithms

Modern Physics Letters A ◽

10.1142/s0217732319501694 ◽

2019 ◽

Vol 34 (21) ◽

pp. 1950169

Author(s):

Aihan Yin ◽

Kemeng He ◽

Ping Fan

Keyword(s):

Heuristic Search ◽

Search Algorithm ◽

Search Algorithms ◽

Quantum Dialogue ◽

Quantum Search ◽

Decoy State ◽

Quantum Search Algorithm ◽

Heuristic Search Algorithms ◽

Security Information

Among many classic heuristic search algorithms, the Grover quantum search algorithm (QSA) can play a role of secondary acceleration. Based on the properties of the two-qubit Grover QSA, a quantum dialogue (QD) protocol is proposed. In addition, our protocol also utilizes the unitary operations and single-particle measurements. The transmitted quantum state (except for the decoy state used for detection) can transmit two-bits of security information simultaneously. Theoretical analysis shows that the proposed protocol has high security.

Download Full-text

Machine Learning Based Heuristic Search Algorithms to Solve Birds of a Feather Card Game

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v33i01.33019656 ◽

2019 ◽

Vol 33 ◽

pp. 9656-9661

Author(s):

Bryon Kucharski ◽

Azad Deihim ◽

Mehmet Ergezer

Keyword(s):

Machine Learning ◽

Undergraduate Students ◽

Heuristic Search ◽

Heuristic Algorithms ◽

Search Algorithm ◽

Search Algorithms ◽

Perfect Information ◽

Monte Carlo Tree Search ◽

Heuristic Search Algorithms ◽

Heuristic Search Algorithm

This research was conducted by an interdisciplinary team of two undergraduate students and a faculty to explore solutions to the Birds of a Feather (BoF) Research Challenge. BoF is a newly-designed perfect-information solitaire-type game. The focus of the study was to design and implement different algorithms and evaluate their effectiveness. The team compared the provided depth-first search (DFS) to heuristic algorithms such as Monte Carlo tree search (MCTS), as well as a novel heuristic search algorithm guided by machine learning. Since all of the studied algorithms converge to a solution from a solvable deal, effectiveness of each approach was measured by how quickly a solution was reached, and how many nodes were traversed until a solution was reached. The employed methods have a potential to provide artificial intelligence enthusiasts with a better understanding of BoF and novel ways to solve perfect-information games and puzzles in general. The results indicate that the proposed heuristic search algorithms guided by machine learning provide a significant improvement in terms of number of nodes traversed over the provided DFS algorithm.

Download Full-text

Hardness measures for gridworld benchmarks and performance analysis of real-time heuristic search algorithms

Journal of Heuristics ◽

10.1007/s10732-008-9084-0 ◽

2008 ◽

Vol 16 (1) ◽

pp. 23-36

Author(s):

Masataka Mizusawa ◽

Masahito Kurihara

Keyword(s):

Performance Analysis ◽

Real Time ◽

Heuristic Search ◽

Search Algorithms ◽

Heuristic Search Algorithms ◽

And Performance

Download Full-text

Evolving Real-time Heuristic Search Algorithms

Proceedings of the Artificial Life Conference 2016 ◽

10.7551/978-0-262-33936-0-ch024 ◽

2016 ◽

Author(s):

Vadim Bulitko

Keyword(s):

Real Time ◽

Heuristic Search ◽

Search Algorithms ◽

Heuristic Search Algorithms

Download Full-text

Goal Probability Analysis in Probabilistic Planning: Exploring and Enhancing the State of the Art

Journal of Artificial Intelligence Research ◽

10.1613/jair.5153 ◽

2016 ◽

Vol 57 ◽

pp. 229-271 ◽

Cited By ~ 6

Author(s):

Marcel Steinmetz ◽

Jörg Hoffmann ◽

Olivier Buffet

Keyword(s):

Heuristic Search ◽

Resource Constraints ◽

State Of The Art ◽

Search Algorithm ◽

Search Algorithms ◽

The State ◽

Probabilistic Planning ◽

Large Space ◽

Heuristic Search Algorithms ◽

Wide Range

Unavoidable dead-ends are common in many probabilistic planning problems, e.g. when actions may fail or when operating under resource constraints. An important objective in such settings is MaxProb, determining the maximal probability with which the goal can be reached, and a policy achieving that probability. Yet algorithms for MaxProb probabilistic planning are severely underexplored, to the extent that there is scant evidence of what the empirical state of the art actually is. We close this gap with a comprehensive empirical analysis. We design and explore a large space of heuristic search algorithms, systematizing known algorithms and contributing several new algorithm variants. We consider MaxProb, as well as weaker objectives that we baptize AtLeastProb (requiring to achieve a given goal probabilty threshold) and ApproxProb (requiring to compute the maximum goal probability up to a given accuracy). We explore both the general case where there may be 0-reward cycles, and the practically relevant special case of acyclic planning, such as planning with a limited action-cost budget. We design suitable termination criteria, search algorithm variants, dead-end pruning methods using classical planning heuristics, and node selection strategies. We design a benchmark suite comprising more than 1000 instances adapted from the IPPC, resource-constrained planning, and simulated penetration testing. Our evaluation clarifies the state of the art, characterizes the behavior of a wide range of heuristic search algorithms, and demonstrates significant benefits of our new algorithm variants.

Download Full-text

Exploiting Obstacle Geometry to Reduce Search Time in Grid-Based Pathfinding

Symmetry ◽

10.3390/sym12071186 ◽

2020 ◽

Vol 12 (7) ◽

pp. 1186

Author(s):

Fahed Jubair ◽

Mohammed Hawa

Keyword(s):

Heuristic Search ◽

Search Algorithm ◽

Search Time ◽

Search Tree ◽

Search Algorithms ◽

Binary Search ◽

Binary Search Tree ◽

Grid Map ◽

Heuristic Search Algorithms ◽

The Impact

Pathfinding is the problem of finding the shortest path between a pair of nodes in a graph. In the context of uniform-cost undirected grid maps, heuristic search algorithms, such as A ★ and weighted A ★ ( W A ★ ), have been dominantly used for pathfinding. However, the lack of knowledge about obstacle shapes in a gird map often leads heuristic search algorithms to unnecessarily explore areas where a viable path is not available. We refer to such areas in a grid map as blocked areas (BAs). This paper introduces a preprocessing algorithm that analyzes the geometry of obstacles in a grid map and stores knowledge about blocked areas in a memory-efficient balanced binary search tree data structure. During actual pathfinding, a search algorithm accesses the binary search tree to identify blocked areas in a grid map and therefore avoid exploring them. As a result, the search time is significantly reduced. The scope of the paper covers maps in which obstacles are represented as horizontal and vertical line-segments. The impact of using the blocked area knowledge during pathfinding in A ★ and W A ★ is evaluated using publicly available benchmark set, consisting of sixty grid maps of mazes and rooms. In mazes, the search time for both A ★ and W A ★ is reduced by 28 % , on average. In rooms, the search time for both A ★ and W A ★ is reduced by 30 % , on average. This is achieved while preserving the search optimality of A ★ and the search sub-optimality of W A ★ .

Download Full-text

Case-Based Subgoaling in Real-Time Heuristic Search for Video Game Pathfinding

Journal of Artificial Intelligence Research ◽

10.1613/jair.3076 ◽

2010 ◽

Vol 39 ◽

pp. 269-300 ◽

Cited By ~ 11

Author(s):

V. Bulitko ◽

Y. Björnsson ◽

R. Lawrence

Keyword(s):

Video Games ◽

Real Time ◽

Video Game ◽

Heuristic Search ◽

Scale Up ◽

Computation Time ◽

Search Algorithms ◽

Problem Size ◽

Solution Quality ◽

Heuristic Search Algorithms

Real-time heuristic search algorithms satisfy a constant bound on the amount of planning per action, independent of problem size. As a result, they scale up well as problems become larger. This property would make them well suited for video games where Artificial Intelligence controlled agents must react quickly to user commands and to other agents' actions. On the downside, real-time search algorithms employ learning methods that frequently lead to poor solution quality and cause the agent to appear irrational by re-visiting the same problem states repeatedly. The situation changed recently with a new algorithm, D LRTA*, which attempted to eliminate learning by automatically selecting subgoals. D LRTA* is well poised for video games, except it has a complex and memory-demanding pre-computation phase during which it builds a database of subgoals. In this paper, we propose a simpler and more memory-efficient way of pre-computing subgoals thereby eliminating the main obstacle to applying state-of-the-art real-time search methods in video games. The new algorithm solves a number of randomly chosen problems off-line, compresses the solutions into a series of subgoals and stores them in a database. When presented with a novel problem on-line, it queries the database for the most similar previously solved case and uses its subgoals to solve the problem. In the domain of pathfinding on four large video game maps, the new algorithm delivers solutions eight times better while using 57 times less memory and requiring 14% less pre-computation time.

Download Full-text

Heuristic Search Algorithms for Constructing Optimal Latin Hypercube Designs

Recent Advances in Information and Communication Technology 2016 - Advances in Intelligent Systems and Computing ◽

10.1007/978-3-319-40415-8_18 ◽

2016 ◽

pp. 183-193 ◽

Cited By ~ 1

Author(s):

Anamai Na-udom ◽

Jaratsri Rungrattanaubol

Keyword(s):

Heuristic Search ◽

Search Algorithms ◽

Latin Hypercube ◽

Heuristic Search Algorithms ◽

Latin Hypercube Designs

Download Full-text