scholarly journals APPLICATION OF LEARNING REINFORCEMENT METHOD IN ROBOTIZED AND AUTOMATED FORESTRY SYSTEMS

2020 ◽  
Vol 10 (1) ◽  
pp. 256-265
Author(s):  
Andrey Tolstyh ◽  
D Stupnikov ◽  
Sergey Malyukov ◽  
Aleksandr Luk'yanov ◽  
Yuriy Lunev

Abstract Currently, most large enterprises are actively using industrial robots and other automated solutions. This allows a significant increase in productivity and quality of work performed. This article gave a brief overview of modern industrial robots, their operating principle, basic components and systems. A reinforcement learning algorithm was developed and tested. The task of constructing a learning algorithm with reinforcement was divided into two stages: modeling the environment and description and optimization of the cost function. Since industrial robotic systems operate in the real world, the environment model should reflect basic physical laws. Therefore, the pyBullet library of the physical environment was chosen as the physical environment for testing. After modeling the manipulator in the selected physical medium, it was given the trivial task of touching a given object with the capture of the manipulator. An artificial neural network was used as an agent interacting with the environment. The inputs were the coordinates of the object and the existing angles of rotation of the articulated joints of the robot. Outputs - angle of rotation of joints at this step. This network was trained using the back propagation method, Adam modification. The system was trained for about 12 hours. Success is achieved in 95% of cases when testing the stability of the system (random position of the cylinder). In future, it is planned to test the obtained models on bench samples

2022 ◽  
Vol 12 (2) ◽  
pp. 682
Author(s):  
Yuzhan Wu ◽  
Chenlong Li ◽  
Changshun Yuan ◽  
Meng Li ◽  
Hao Li

Tracking control of Small Unmanned Ground Vehicles (SUGVs) is easily affected by the nonlinearity and time-varying characteristics. An improved predictive control scheme based on the multi-dimensional Taylor network (MTN) is proposed for tracking control of SUGVs. First, a MTN model is used as a predictive model to construct a SUGV model and back propagation (BP) is taken as its learning algorithm. Second, the predictive control law is designed and the traditional objective function is improved to obtain a predictive objective function with a differential term. The optimal control quantity is given in real time through iterative optimization. Meanwhile, the stability of the closed-loop system is proved by the Lyapunov stability theorem. Finally, a tracking control experiment on the SUGV model is used to verify the effectiveness of the proposed scheme. For comparison, traditional MTN and Radial Basis Function (RBF) predictive control schemes are introduced. Moreover, a noise disturbance is considered. Experimental results show that the proposed scheme is effective, which ensures that the vehicle can quickly and accurately track the desired yaw velocity signal with good real-time, robustness, and convergence performance, and is superior to other comparison schemes.


2020 ◽  
Vol 26 (7) ◽  
pp. 1469-1495
Author(s):  
A.L. Sabinina ◽  
V.V. Sokolovskii ◽  
N.A. Shul'zhenko ◽  
N.A. Sychova

Subject. The article describes the findings of the authors of fundamental strategic decisions on the formation of multifunctional urban complexes, using the housing demand and supply criterion. Objectives. We undertake a comprehensive study aimed at perfecting the methodology for evaluating the options for city infrastructure development at two stages, i.e. strategic, when general targets of feasible commissioning are determined, and current, when parameters of demand for facilities are taken into account. Methods. The study employs methods of expert survey, statistical data processing, predictive and investigative analysis. Results. We explored factors of creating amenities and comfort in residential construction areas, developed an algorithm to calculate the volume of new living space commissioning on the basis of evaluating demands in the Smart City paradigm. Conclusions. The study shows the cost increase depending on the built-up area, number of floors, and the balance between the type of capacity and the number of residents in the quarter (linear relationship).


2019 ◽  
Author(s):  
Andrew Medford ◽  
Shengchun Yang ◽  
Fuzhu Liu

Understanding the interaction of multiple types of adsorbate molecules on solid surfaces is crucial to establishing the stability of catalysts under various chemical environments. Computational studies on the high coverage and mixed coverages of reaction intermediates are still challenging, especially for transition-metal compounds. In this work, we present a framework to predict differential adsorption energies and identify low-energy structures under high- and mixed-adsorbate coverages on oxide materials. The approach uses Gaussian process machine-learning models with quantified uncertainty in conjunction with an iterative training algorithm to actively identify the training set. The framework is demonstrated for the mixed adsorption of CH<sub>x</sub>, NH<sub>x</sub> and OH<sub>x</sub> species on the oxygen vacancy and pristine rutile TiO<sub>2</sub>(110) surface sites. The results indicate that the proposed algorithm is highly efficient at identifying the most valuable training data, and is able to predict differential adsorption energies with a mean absolute error of ~0.3 eV based on <25% of the total DFT data. The algorithm is also used to identify 76% of the low-energy structures based on <30% of the total DFT data, enabling construction of surface phase diagrams that account for high and mixed coverage as a function of the chemical potential of C, H, O, and N. Furthermore, the computational scaling indicates the algorithm scales nearly linearly (N<sup>1.12</sup>) as the number of adsorbates increases. This framework can be directly extended to metals, metal oxides, and other materials, providing a practical route toward the investigation of the behavior of catalysts under high-coverage conditions.


2020 ◽  
Vol 12 (7) ◽  
pp. 2767 ◽  
Author(s):  
Víctor Yepes ◽  
José V. Martí ◽  
José García

The optimization of the cost and CO 2 emissions in earth-retaining walls is of relevance, since these structures are often used in civil engineering. The optimization of costs is essential for the competitiveness of the construction company, and the optimization of emissions is relevant in the environmental impact of construction. To address the optimization, black hole metaheuristics were used, along with a discretization mechanism based on min–max normalization. The stability of the algorithm was evaluated with respect to the solutions obtained; the steel and concrete values obtained in both optimizations were analyzed. Additionally, the geometric variables of the structure were compared. Finally, the results obtained were compared with another algorithm that solved the problem. The results show that there is a trade-off between the use of steel and concrete. The solutions that minimize CO 2 emissions prefer the use of concrete instead of those that optimize the cost. On the other hand, when comparing the geometric variables, it is seen that most remain similar in both optimizations except for the distance between buttresses. When comparing with another algorithm, the results show a good performance in optimization using the black hole algorithm.


2021 ◽  
Vol 11 (6) ◽  
pp. 803
Author(s):  
Jie Chai ◽  
Xiaogang Ruan ◽  
Jing Huang

Neurophysiological studies have shown that the hippocampus, striatum, and prefrontal cortex play different roles in animal navigation, but it is still less clear how these structures work together. In this paper, we establish a navigation learning model based on the hippocampal–striatal circuit (NLM-HS), which provides a possible explanation for the navigation mechanism in the animal brain. The hippocampal model generates a cognitive map of the environment and performs goal-directed navigation by using a place cell sequence planning algorithm. The striatal model performs reward-related habitual navigation by using the classic temporal difference learning algorithm. Since the two models may produce inconsistent behavioral decisions, the prefrontal cortex model chooses the most appropriate strategies by using a strategy arbitration mechanism. The cognitive and learning mechanism of the NLM-HS works in two stages of exploration and navigation. First, the agent uses a hippocampal model to construct the cognitive map of the unknown environment. Then, the agent uses the strategy arbitration mechanism in the prefrontal cortex model to directly decide which strategy to choose. To test the validity of the NLM-HS, the classical Tolman detour experiment was reproduced. The results show that the NLM-HS not only makes agents show environmental cognition and navigation behavior similar to animals, but also makes behavioral decisions faster and achieves better adaptivity than hippocampal or striatal models alone.


2021 ◽  
Vol 2 (2) ◽  
pp. 325-334
Author(s):  
Neda Javadi ◽  
Hamed Khodadadi Tirkolaei ◽  
Nasser Hamdan ◽  
Edward Kavazanjian

The stability (longevity of activity) of three crude urease extracts was evaluated in a laboratory study as part of an effort to reduce the cost of urease for applications that do not require high purity enzyme. A low-cost, stable source of urease will greatly facilitate engineering applications of urease such as biocementation of soil. Inexpensive crude extracts of urease have been shown to be effective at hydrolyzing urea for carbonate precipitation. However, some studies have suggested that the activity of a crude extract may decrease with time, limiting the potential for its mass production for commercial applications. The stability of crude urease extracts shown to be effective for biocementation was studied. The crude extracts were obtained from jack beans via a simple extraction process, stored at room temperature and at 4 ℃, and periodically tested to evaluate their stability. To facilitate storage and transportation of the extracted enzyme, the longevity of the enzyme following freeze drying (lyophilization) to reduce the crude extract to a powder and subsequent re-hydration into an aqueous solution was evaluated. In an attempt to improve the shelf life of the lyophilized extract, dextran and sucrose were added during lyophilization. The stability of purified commercial urease following rehydration was also investigated. Results of the laboratory tests showed that the lyophilized crude extract maintained its activity during storage more effectively than either the crude extract solution or the rehydrated commercial urease. While incorporating 2% dextran (w/v) prior to lyophilization of the crude extract increased the overall enzymatic activity, it did not enhance the stability of the urease during storage.


Games ◽  
2021 ◽  
Vol 12 (3) ◽  
pp. 53
Author(s):  
Roberto Rozzi

We consider an evolutionary model of social coordination in a 2 × 2 game where two groups of players prefer to coordinate on different actions. Players can pay a cost to learn their opponent’s group: if they pay it, they can condition their actions concerning the groups. We assess the stability of outcomes in the long run using stochastic stability analysis. We find that three elements matter for the equilibrium selection: the group size, the strength of preferences, and the information’s cost. If the cost is too high, players never learn the group of their opponents in the long run. If one group is stronger in preferences for its favorite action than the other, or its size is sufficiently large compared to the other group, every player plays that group’s favorite action. If both groups are strong enough in preferences, or if none of the groups’ sizes is large enough, players play their favorite actions and miscoordinate in inter-group interactions. Lower levels of the cost favor coordination. Indeed, when the cost is low, in inside-group interactions, players always coordinate on their favorite action, while in inter-group interactions, they coordinate on the favorite action of the group that is stronger in preferences or large enough.


2021 ◽  
Vol 15 (3) ◽  
pp. 1-28
Author(s):  
Xueyan Liu ◽  
Bo Yang ◽  
Hechang Chen ◽  
Katarzyna Musial ◽  
Hongxu Chen ◽  
...  

Stochastic blockmodel (SBM) is a widely used statistical network representation model, with good interpretability, expressiveness, generalization, and flexibility, which has become prevalent and important in the field of network science over the last years. However, learning an optimal SBM for a given network is an NP-hard problem. This results in significant limitations when it comes to applications of SBMs in large-scale networks, because of the significant computational overhead of existing SBM models, as well as their learning methods. Reducing the cost of SBM learning and making it scalable for handling large-scale networks, while maintaining the good theoretical properties of SBM, remains an unresolved problem. In this work, we address this challenging task from a novel perspective of model redefinition. We propose a novel redefined SBM with Poisson distribution and its block-wise learning algorithm that can efficiently analyse large-scale networks. Extensive validation conducted on both artificial and real-world data shows that our proposed method significantly outperforms the state-of-the-art methods in terms of a reasonable trade-off between accuracy and scalability. 1


Sign in / Sign up

Export Citation Format

Share Document