Safe Intermittent Reinforcement Learning With Static and Dynamic Event Generators

2020 ◽  
Vol 31 (12) ◽  
pp. 5441-5455 ◽  
Author(s):  
Yongliang Yang ◽  
Kyriakos G. Vamvoudakis ◽  
Hamidreza Modares ◽  
Yixin Yin ◽  
Donald C. Wunsch
Author(s):  
Yongliang Yang ◽  
Kyriakos G. Vamvoudakis ◽  
Hamidreza Modares ◽  
Wei He ◽  
Yixin Yin ◽  
...  

2001 ◽  
Vol 7 (S2) ◽  
pp. 1012-1013
Author(s):  
Uyen Tram ◽  
William Sullivan

Embryonic development is a dynamic event and is best studied in live animals in real time. Much of our knowledge of the early events of embryogenesis, however, comes from immunofluourescent analysis of fixed embryos. While these studies provide an enormous amount of information about the organization of different structures during development, they can give only a static glimpse of a very dynamic event. More recently real-time fluorescent studies of living embryos have become much more routine and have given new insights to how different structures and organelles (chromosomes, centrosomes, cytoskeleton, etc.) are coordinately regulated. This is in large part due to the development of commercially available fluorescent probes, GFP technology, and newly developed sensitive fluorescent microscopes. For example, live confocal fluorescent analysis proved essential in determining the primary defect in mutations that disrupt early nuclear divisions in Drosophila melanogaster. For organisms in which GPF transgenics is not available, fluorescent probes that label DNA, microtubules, and actin are available for microinjection.


1985 ◽  
Vol 28 (4) ◽  
pp. 539-547 ◽  
Author(s):  
Michael A. Primus ◽  
Gary Thompson

An operant conditioning discrimination paradigm was evaluated in terms of relationships between response behavior of young children and two stimulus components of the paradigm, the discriminative stimulus (DS) and the reinforcing stimulus (RS). Experiment I measured response performance in normal 1-year-old subjects as a function of differences in intensity and/or complexity among three DSs. Results showed no significant differences in conditioning rate, habituation, or consistency of the conditioned response relative to variable properties of the DS. Experiment II examined response performance of normal 2-year-old children as a function of two modifications in the RS, reinforcement schedule and reinforcement novelty. Subjects reinforced on a variable-ratio schedule of intermittent reinforcement and subjects reinforced on a 100% schedule demonstrated equivalent response habituation and consistency. In the second part of the experiment, subjects receiving novel RSs showed significantly greater response recovery than subjects reinforced with familiar RSs. Comparison of normal 1- and 2-year-old children revealed similar rates of conditioning and response consistency. However, 2-year-olds habituated more rapidly than 1-year-olds.


Decision ◽  
2016 ◽  
Vol 3 (2) ◽  
pp. 115-131 ◽  
Author(s):  
Helen Steingroever ◽  
Ruud Wetzels ◽  
Eric-Jan Wagenmakers

Sign in / Sign up

Export Citation Format

Share Document