scholarly journals Chemotaxis: Active Degradation of Attractant Enables Optimal Maze Navigation

2020 ◽  
Vol 30 (23) ◽  
pp. R1436-R1438
Author(s):  
Cornelis J. Weijer
Keyword(s):  
2021 ◽  
pp. 1-34
Author(s):  
Joost Huizinga ◽  
Jeff Clune

Abstract An important challenge in reinforcement learning is to solve multimodal problems, where agents have to act in qualitatively different ways depending on the circumstances. Because multimodal problems are often too difficult to solve directly, it is often helpful to define a curriculum, which is an ordered set of sub-tasks that can serve as the stepping stones for solving the overall problem. Unfortunately, choosing an effective ordering for these subtasks is difficult, and a poor ordering can reduce the performance of the learning process. Here, we provide a thorough introduction and investigation of the Combinatorial Multi-Objective Evolutionary Algorithm (CMOEA), which allows all combinations of subtasks to be explored simultaneously. We compare CMOEA against three algorithms that can similarly optimize on multiple subtasks simultaneously: NSGA-II, NSGA-III and ϵ-Lexicase Selection. The algorithms are tested on a function-optimization problem with two subtasks, a simulated multimodal robot locomotion problem with six subtasks and a simulated robot maze navigation problem where a hundred random mazes are treated as subtasks. On these problems, CMOEA either outperforms or is competitive with the controls. As a separate contribution, we show that adding a linear combination over all objectives can improve the ability of the control algorithms to solve these multimodal problems. Lastly, we show that CMOEA can leverage auxiliary objectives more effectively than the controls on the multimodal locomotion task. In general, our experiments suggest that CMOEA is a promising algorithm for solving multimodal problems.


2008 ◽  
Vol 21 (1) ◽  
pp. 18-25 ◽  
Author(s):  
Brian R. Ott ◽  
Elena K. Festa ◽  
Melissa M. Amick ◽  
Janet Grace ◽  
Jennifer D. Davis ◽  
...  

Author(s):  
Ho Ching ◽  
Wayne J. Book

In a conventional bilateral teleoperation, transmission delay over the Internet can potentially cause instability. The wave variables algorithm can solve this problem at the cost of poor transient response. The wave variables algorithm with adaptive predictor and drift control based on our previous work [24] has been proposed to provide stability under time delay with improved performance. The effectiveness of this algorithm is fully evaluated using human subjects with no previous experience in haptics. Three algorithms are tested using Phantom haptic devices as master and slave: conventional bilateral teleoperation with no transmission delay as control, wave variables with 200-300 ms transmission delay one way, and wave variables with adaptive predictor and direct drift control (WAPD) also with 200-300 ms delay one way. For each algorithm the human subjects are asked to perform three simple tasks: free space trajectory tracking, surface contour identification, and maze navigation. The results show WAPD to be superior to regular wave variables algorithm with higher subject ratings.


2005 ◽  
Vol 158 (1) ◽  
pp. 175-182 ◽  
Author(s):  
Nikolai V. Lukoyanov ◽  
Elena A. Lukoyanova ◽  
José P. Andrade ◽  
Manuel M. Paula-Barbosa

2014 ◽  
Vol 25 (9) ◽  
pp. 3122-3131 ◽  
Author(s):  
Ana M. Daugherty ◽  
Peng Yuan ◽  
Cheryl L. Dahle ◽  
Andrew R. Bender ◽  
Yiqin Yang ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document