Design of Friends-Making Recommendation System Based on Reinforcement Learning and Generative Adversarial Network

Widely used recommendation systems do not meet all industry requirements, so the search for more advanced methods for creating recommendations continues. The proposed new methods based on Generative Adversarial Networks (GAN) have a theoretical comparison with other recommendation algorithms; however, real-world comparisons are needed to introduce new methods in the industry. In our work, we compare recommendations from the Generative Adversarial Network with recommendation from the Deep Semantic Similarity Model (DSSM) on real-world case of airflight tickets. We found a way to train the GAN so that users receive appropriate recommendations, and during A/B testing, we noted that the GAN-based recommendation system can successfully compete with other neural networks in generating recommendations. One of the advantages of the proposed approach is that the GAN training process avoids a negative sampling, which causes a number of distortions in the final ratings of recommendations. Due to the ability of the GAN to generate new objects from the distribution of the training set, we assume that the Conditional GAN is able to solve the cold start problem.

Download Full-text

Deep inverse reinforcement learning for structural evolution of small molecules

Briefings in Bioinformatics ◽

10.1093/bib/bbaa364 ◽

2020 ◽

Author(s):

Brighter Agyemang ◽

Wei-Ping Wu ◽

Daniel Addo ◽

Michael Y Kpiebaareh ◽

Ebenezer Nanor ◽

...

Keyword(s):

Reinforcement Learning ◽

High Throughput Screening ◽

Structural Evolution ◽

Search Space ◽

New Drugs ◽

Inverse Reinforcement Learning ◽

Generative Adversarial Network ◽

Entropy Maximization ◽

Reward Function ◽

Adversarial Network

Abstract The size and quality of chemical libraries to the drug discovery pipeline are crucial for developing new drugs or repurposing existing drugs. Existing techniques such as combinatorial organic synthesis and high-throughput screening usually make the process extraordinarily tough and complicated since the search space of synthetically feasible drugs is exorbitantly huge. While reinforcement learning has been mostly exploited in the literature for generating novel compounds, the requirement of designing a reward function that succinctly represents the learning objective could prove daunting in certain complex domains. Generative adversarial network-based methods also mostly discard the discriminator after training and could be hard to train. In this study, we propose a framework for training a compound generator and learn a transferable reward function based on the entropy maximization inverse reinforcement learning (IRL) paradigm. We show from our experiments that the IRL route offers a rational alternative for generating chemical compounds in domains where reward function engineering may be less appealing or impossible while data exhibiting the desired objective is readily available.

Download Full-text

Recommendation System Based on Generative Adversarial Network with Graph Convolutional Layers

Journal of Advanced Computational Intelligence and Intelligent Informatics ◽

10.20965/jaciii.2021.p0389 ◽

2021 ◽

Vol 25 (4) ◽

pp. 389-396

Author(s):

Takato Sasagawa ◽

◽

Shin Kawai ◽

Hajime Nobuhara

Keyword(s):

Bipartite Graph ◽

Recommendation System ◽

Sufficient Data ◽

Generative Adversarial Network ◽

Adversarial Network ◽

Latent Features ◽

Information Recommendation ◽

Domain Information

A graph convolutional generative adversarial network (GCGAN) is proposed to provide recommendations for new users or items. To maintain scalability, the discriminator was improved to capture the latent features of users and items, using graph convolution from a minibatch-sized bipartite graph. In the experiment using MovieLens, it was confirmed that the proposed GCGAN had better performance than the conventional CFGAN, when MovieLens 1M was employed with sufficient data. The proposed method is characterized in such a manner that it can learn domain information of both, users and items, and it does not require to relearn a model for a new node. Further, it can be developed for any service having such conditions, in the information recommendation field.

Download Full-text

A Generative Adversarial Network Enabled Deep Distributional Reinforcement Learning for Transmission Scheduling in Internet of Vehicles

IEEE Transactions on Intelligent Transportation Systems ◽

10.1109/tits.2020.3033577 ◽

2020 ◽

pp. 1-10

Author(s):

Faisal Naeem ◽

Sattar Seifollahi ◽

Zhenyu Zhou ◽

Muhammad Tariq

Keyword(s):

Reinforcement Learning ◽

Transmission Scheduling ◽

Generative Adversarial Network ◽

Internet Of Vehicles ◽

Adversarial Network

Download Full-text

Generative Adversarial Network and Reinforcement Learning to Estimate Channel Coefficients

Lecture Notes in Electrical Engineering - Machine Learning, Deep Learning and Computational Intelligence for Wireless Communication ◽

10.1007/978-981-16-0289-4_4 ◽

2021 ◽

pp. 49-58

Author(s):

Pranav Mani ◽

E. S. Gopi ◽

Hrishikesh Shekhar ◽

Sharan Chandra

Keyword(s):

Reinforcement Learning ◽

Generative Adversarial Network ◽

Adversarial Network

Download Full-text

Network Flow Generation Based on Reinforcement Learning Powered Generative Adversarial Network

10.1109/ic-nidc54101.2021.9660491 ◽

2021 ◽

Author(s):

Jianxue Li ◽

Yang Xiao ◽

Jiawei Wu ◽

Jialong Feng ◽

Jun Liu

Keyword(s):

Reinforcement Learning ◽

Network Flow ◽

Generative Adversarial Network ◽

Adversarial Network ◽

Flow Generation

Download Full-text

Deep reinforcement learning for diagnosing various types of cancer by TP53 mutation patterns

10.21203/rs.3.rs-744748/v1 ◽

2021 ◽

Author(s):

Armaqan Rahmani ◽

Behrouz Minaei-Bidgoli ◽

Meysam Ahangaran

Keyword(s):

Reinforcement Learning ◽

Tp53 Mutation ◽

Short Term Memory ◽

Disease Risk ◽

P53 Mutation ◽

Multiple Cancer ◽

Generative Adversarial Network ◽

Cancer Subtypes ◽

Adversarial Network ◽

Cancer Types

Abstract One of the key challenges for classifying multiple cancer types is the complexity of Tumor Protein p53 mutation patterns and its individual effects on tumors. However, far too little attention has been paid to Deep reinforcement Learning on TP53 mutation patterns because of its extremely difficult result interpretations. We introduce a critic network by a long-short term memory, which is appropriated for discriminating the noise samples from a Feedback Generative Adversarial Network and analyzing the actor network. The correlation and analysis of the results in a belief network demonstrates significant relations between mutations and disease risk in cancer subtypes identification. In other words, the results indicate statically significant differences between the primary and secondary subtype groups of the most probable tumor.

Download Full-text

A Method of Offline Reinforcement Learning Virtual Reality Satellite Attitude Control Based on Generative Adversarial Network

Wireless Communications and Mobile Computing ◽

10.1155/2021/4238125 ◽

2021 ◽

Vol 2021 ◽

pp. 1-9

Author(s):

Jian Zhang ◽

Fengge Wu

Keyword(s):

Virtual Reality ◽

Reinforcement Learning ◽

Real World ◽

Attitude Control ◽

Control Method ◽

Sensor Data ◽

Generative Adversarial Network ◽

Goal State ◽

Safety Issues ◽

Adversarial Network

Virtual reality satellites give people an immersive experience of exploring space. The intelligent attitude control method using reinforcement learning to achieve multiaxis synchronous control is one of the important tasks of virtual reality satellites. In real-world systems, methods based on reinforcement learning face safety issues during exploration, unknown actuator delays, and noise in the raw sensor data. To improve the sample efficiency and avoid safety issues during exploration, this paper proposes a new offline reinforcement learning method to make full use of samples. This method learns a policy set with imitation learning and a policy selector using a generative adversarial network (GAN). The performance of the proposed method was verified in a real-world system (reaction-wheel-based inverted pendulum). The results showed that the agent trained with our method reached and maintained a stable goal state in 10,000 steps, whereas the behavior cloning method only remained stable for 500 steps.

Download Full-text

Multimodal News Feed Evaluation System with Deep Reinforcement Learning Approaches

ACM Transactions on Asian and Low-Resource Language Information Processing ◽

10.1145/3414523 ◽

2021 ◽

Vol 20 (1) ◽

pp. 1-12

Author(s):

S. Rakesh Kumar ◽

S. Muthuramalingam ◽

Fadi Al-Turjman

Keyword(s):

Deep Learning ◽

Reinforcement Learning ◽

Evaluation System ◽

Language Models ◽

Learning Approaches ◽

Generative Adversarial Network ◽

Vast Number ◽

News Analysis ◽

Adversarial Network ◽

Learning Techniques

Multilingual and multimodal data analysis is the emerging news feed evaluation system. News feed analysis and evaluations are interrelated processes, which are useful in understanding the news factors. The news feed evaluation system can be implemented for single or multilingual language models. Classification techniques used on multilingual news analysis require deep layered learning techniques rather than conventional approaches. In this proposed work, a hierarchical structure of deep learning algorithms is implemented for making an effective complex news evaluation system. Deep learning techniques such as the Deep Cooperative Multilingual Reinforcement Learning Model, the Multidimensional Genetic Algorithm, and the Multilingual Generative Adversarial Network are developed to evaluate a vast number of news feeds. The proposed tech-niques collaborate in a pipeline order to build a deep news feed evaluation system. The implementation details project that the newly proposed system performs 5% to 12% better than the other news evaluation systems.

Download Full-text