About Profit Sharing Considering Infatuate Actions

Author(s):  
Wataru Uemura ◽  

In reinforcement learning systems based on trial-and error, the agent, that is the subject or the system that perceives its environment and takes actions which maximize its chances of success, is rewarded when it attains the target level of learning of the learning exercise. In Profit Sharing, the reinforcement learning process is pursued for the accumulation of such rewards. In order to continue the process of reward accumulation, the agent insists upon the repetition of the particular actions that are being learned and avoids selecting other actions, making the agent less adaptable to changes in the environment. In view of the above, this paper attempts to propose the introduction of the concept of infatuation to eliminate the reluctance of the agent to adapt to new environments. If the agent is a living being, when a single particular reinforcement learning process is repeated, the stimulus the agent perceives in each of the processes gradually loses its intensity due to familiarization. However, if the agent encounters a set of rules that are different from those of the particular repeated learning process, then the agent reverts to the previous particular learning process, and the stimulus the agent receives after the said reversion recovers its intensity. The intention here is to apply the concept of assimilation infatuation to Profit Sharing, and to confirm its effects through experiments.

2020 ◽  
Author(s):  
Felipe Leno Da Silva ◽  
Anna Helena Reali Costa

Reinforcement Learning (RL) is a powerful tool that has been used to solve increasingly complex tasks. RL operates through repeated interactions of the learning agent with the environment, via trial and error. However, this learning process is extremely slow, requiring many interactions. In this thesis, we leverage previous knowledge so as to accelerate learning in multiagent RL problems. We propose knowledge reuse both from previous tasks and from other agents. Several flexible methods are introduced so that each of these two types of knowledge reuse is possible. This thesis adds important steps towards more flexible and broadly applicable multiagent transfer learning methods.


2021 ◽  
Vol 11 (1) ◽  
pp. 6637-6644
Author(s):  
H. El Fazazi ◽  
M. Elgarej ◽  
M. Qbadou ◽  
K. Mansouri

Adaptive e-learning systems are created to facilitate the learning process. These systems are able to suggest the student the most suitable pedagogical strategy and to extract the information and characteristics of the learners. A multi-agent system is a collection of organized and independent agents that communicate with each other to resolve a problem or complete a well-defined objective. These agents are always in communication and they can be homogeneous or heterogeneous and may or may not have common objectives. The application of the multi-agent approach in adaptive e-learning systems can enhance the learning process quality by customizing the contents to students’ needs. The agents in these systems collaborate to provide a personalized learning experience. In this paper, a design of an adaptative e-learning system based on a multi-agent approach and reinforcement learning is presented. The main objective of this system is the recommendation to the students of a learning path that meets their characteristics and preferences using the Q-learning algorithm. The proposed system is focused on three principal characteristics, the learning style according to the Felder-Silverman learning style model, the knowledge level, and the student's possible disabilities. Three types of disabilities were taken into account, namely hearing impairments, visual impairments, and dyslexia. The system will be able to provide the students with a sequence of learning objects that matches their profiles for a personalized learning experience.


Author(s):  
Kazuteru Miyazaki

Exploitation-oriented Learning XoL is a new framework of reinforcement learning. XoL aims to learn a rational policy whose expected reward per an action is larger than zero, and does not require a sophisticated design of the value of a reward signal. In this chapter, as examples of learning systems that belongs in XoL, we introduce the rationality theorem of profit Sharing (PS), the rationality theorem of reward sharing in multi-agent PS, and PS-r*. XoL has several features. (1) Though traditional RL systems require appropriate reward and penalty values, XoL only requires an order of importance among them. (2) XoL can learn more quickly since it traces successful experiences very strongly. (3) XoL may be unsuitable for pursuing an optimal policy. The optimal policy can be acquired by the multi-start method that needs to reset all memories to get a better policy. (4) XoL is effective on the classes beyond MDPs, since it is a Bellman-free method that does not depend on DP. We show several numerical examples to confirm these features.


2021 ◽  
Author(s):  
Milena Rmus ◽  
Amy Zou ◽  
Anne G.E. Collins

AbstractIn reinforcement learning (RL) experiments, participants learn to make rewarding choices in response to different stimuli; RL models use outcomes to estimate stimulus-response values which change incrementally. RL models consider any response type indiscriminately, ranging from less abstract choices (e.g. pressing a key with the index finger), to more abstract choices that can be executed in a number of ways (e.g. getting dinner at the restaurant). But does the learning process vary as a function of how abstract the choices are? In Experiment 1, we show that choice abstraction impacts learning: participants were slower and less accurate in learning to select a more abstract choice. Using computational modeling, we show that two mechanisms contribute to this. First, the values of motor actions interfered with the values of more abstract responses, resulting in more incorrect choices; second, information integration for relevant abstract choices was slower. In Experiment 2, we replicate the findings from Experiment 1, and further extend the results by investigating whether slowed learning is attributable to working memory (WM) or RL contributions. We find that the impairment in more abstract/flexible choices is driven primarily by a weaker contribution of WM. We conclude that defining a more abstract choice space used by multiple learning systems recruits limited executive resources, limiting how much such processes then contribute to fast learning.


Author(s):  
Putri Denaya Side Ayu ◽  
Heri Hidayatullah ◽  
Sri Ariani

This Collaborative Classroom Action Research aimed at enhancing students’ speaking skill through the application of Circle Game. It was conducted in one cycle consisting of two meetings. The subject was the seventh of C class of SMPN 2 Alas Barat consisting of 22 students. The types of data were qualitative (acquiring observation during the learning process) and quantitative (speaking tests). The results qualitatively showed that the implementation of Circle Game enhanced the students’ speaking skills. Such improvement could be seen from their enthusiasm, their interesting feeling in various materials presented by the teacher, their active involvement. In the quantitative findings, the result also showed a higher value of the mean score of the post-test (77.34) rather than the pre-test (75.75). In conclusion, the use of Circle Game can improve students’ speaking skill.


2018 ◽  
Vol 2 (2) ◽  
pp. 173
Author(s):  
Jaimah Jaimah

The background of this research is the low of mathematics learning result of grade 1 student of SDN 004Tembilahan Kota, for that done perbaiakan learning process by applying method of discussion. This researchwas conducted in SDN 004 Tembilahan Kota with the subject of research is the first grade students with 36students. This research is a classroom action research conducted two cycles. Based on the results of theresearch, it is found that the use of the method of disuksi can improve the students' learning outcomes aboutknowing and determining the length and weight with nonstandard units using concrete objects / concretesituation in the initial data, the total number of completed students is 12 students (33.3%) average 55.83. In thefirst cycle has increased the number of students who complete is 26 students (77.22%) with an average value of80.00. In the second cycle has increased with the total number of students who completed is 32 students (88.89)with an average value of 88.06.


2019 ◽  
Vol 3 (3) ◽  
pp. 660
Author(s):  
Ranirizal Ranirizal

Performance is the performance shown by educators, both in quality and quantity in carrying out their duties in accordance with the responsibilities given to them professionally. Educator performance development is a very decisive factor in the success of the education and learning process. In fact, in Kindergarten Rayon IV, Dumai City, there is still a low level of competency standards possessed by educators. The intended competency standard is from the standard academic qualifications and four competencies that must be possessed by a kindergarten educator, namely pedagogic, professional, social and personality competencies. This is evidenced by educators not yet mastering learning material with the maximum known when the learning process educators are not able to explain well the subject matter, and educators have not shown maximum performance in carrying out their duties and functions. The purpose of this study was to see whether there was an influence on teacher professionalism on teacher performance in Dumai IV Rayon Kindergarten. The results of the study prove that there is a significant relationship between the professionalism of Kindergarten educators and the performance of educators in Kindergarten Rayon IV, Dumai City. This is evidenced by the value of Sig (2-tailed) professionalism on educator's performance of 0,000, so the calculation shows 0,000 <0.05. This means that Ha is accepted, that is, there is a significant relationship between the professionalism of Kindergarten educators and the Performance of Educators in Kindergarten Rayon IV, Dumai City.


Author(s):  
DesiYusnanda Sari And Sri MindaMurni

This study attempted to improve the students’ reading comprehension achievement in descriptive text through Team Games Tournament (TGT) method. This study was conducted by using classroom action research. The subject of the research was class IX SMP PAB 10 Medan Estate which consisted of 25 students. The research was conducted in two cycles and the first cycle consisted of four meetings the second cycle consisted of two meetings. The instruments for collecting the data were quantitative data (reading evaluation) and qualitative data (diary note, observation sheet and interview sheet). Based on reading scores, students’ score kept improving in every evaluation. In the test I the mean was 65,33, in the test II the mean was 71,72 and the test III the mean was 84,54.Based on diary note, observation sheet and interview sheet, it was found that teaching-learning process ran well. Students were active, enthusiastic, and interested in reading. The result of the research showed that Team Games Tournament (TGT) method significantly improved students’ achievement in reading comprehension especially in reading descriptive text.


Author(s):  
Yohanna Situmeang And Busmin Gurning

This study was attempted to improve students’ achievement in writing narrativetext by watching movies. This study was conducted by using classroom action researchin two cycles. Cycle I consisted of four meetings while Cycle II consisted of twomeetings. The subject of the research was class VIII-A SMP SWASTA IMMANUELMEDAN. The number of the students was 34. To collect the data, the instruments werequantitative and qualitative data. The quantitative data was the scores of the studentsand the qualitative data was interview sheets, observation sheets, and diary notes. Basedon the writing score, students’ score were improving in every test. It can be seen inOrientation test, Test I in Cycle I, and Test II in Cycle II. In Orientation test, the meanscore was 37.94 then it was improved to 65.85 in Test I. In the end of cycle II, the meanscore for test II was again improved to 83.41. Based on interview sheets, observationsheets and diary notes, it was found that teaching-learning process ran well. In line withthe data, it showed that the application of watching movies significantly improvedstudent’s achievement in writing narrative text. English teachers were suggested to givemovies to be watched as guidance on students’ narrative writing in the class.


Author(s):  
Sri Wuri Handayani And Masitowarni Siregar

This study attempts to Improving Students’ Achievement Through the Process GenreApproach. The objective of the study is to discover whether the use of the ProcessGenre approach could significantly improve students’ writing achievement. This studyconduct by applying Classroom Action Research which was carried out in two cycles inten meetings. The subject of this study of SMA Panca Budi Medan which consisted of39 students of class X – 2 reguler.. Orientation test was done in the first meeting, test IIwas done in the fourth meeting, and test III was done in the last meeting. The mean ofstudents’ score test III (76,242) was higher than the test II (66,878) and test I (57,606).Besides, the qualitative data was taken from observation sheet, interview, questionnaire,and diary notes. It was found that teaching learning process ran well. By applying theProcess Genre Approach in the classroom, the students understood the content ofdescriptive text easily. They also were active and enthusiastic during the teachinglearning process. The result of the research showed that the application of the ProcessGenre approach significantly improved students’ achievement in writing descriptive text


Sign in / Sign up

Export Citation Format

Share Document