A ship hull block is generally composed of skin plates, longitudinals, and transverse webs as a grillage structure, and longitudinals and transverse webs are joined by a fillet weld on a skin plate panel. Then much labor time is necessary for this welding work because many welding lines exist. Now, a simple automatic welding machine using a truck system is applied widely as well as semiautomatic CO2 weld or gravity weld. Since the automatic welding machine needs the help of workers for initial setting, turning, and shifting, the efficient routing has to be investigated for the improvement of productivity. However, it is difficult to find an optimal weld sequence when weld lines increase and then the combination number of welding sequence increases. Such research is called combinatorial problem. This paper examines how to decrease the work time using the reinforcement learning method, which imitated the behavior pattern of animals.