Missing Link Prediction Using Non-Overlapped Features and Multiple Sources of Social Networks

Social network is a hot topic of interest for researchers in the field of computer science in recent years. These social networks such as Facebook, Twitter, Instagram play an important role in information diffusion. Social network data are created by its users. Users’ online activities and behavior have been studied in various past research efforts in order to get a better understanding on how information is diffused on social networks. In this study, we focus on Twitter and we explore the impact of user behavior on their retweet activity. To represent a user’s behavior for predicting their retweet decision, we introduce 10-dimentional emotion and 35-dimensional personality related features. We consider the difference of a user being an author and a retweeter in terms of their behaviors, and propose a machine learning based retweet prediction model considering this difference. We also propose two approaches for matrix factorization retweet prediction model which learns the latent relation between users and tweets to predict the user’s retweet decision. In the experiment, we have tested our proposed models. We find that models based on user behavior related features provide good improvement (3% - 6% in terms of F1- score) over baseline models. By only considering user’s behavior as a retweeter, the data processing time is reduced while the prediction accuracy is comparable to the case when both retweeting and posting behaviors are considered. In the proposed matrix factorization models, we include tweet features into the basic factorization model through newly defined regularization terms and improve the performance by 3% - 4% in terms of F1-score. Finally, we compare the performance of machine learning and matrix factorization models for retweet prediction and find that none of the models is superior to the other in all occasions. Therefore, different models should be used depending on how prediction results will be used. Machine learning model is preferable when a model’s performance quality is important such as for tweet re-ranking and tweet recommendation. Matrix factorization is a preferred option when model’s positive retweet prediction capability is more important such as for marketing campaign and finding potential retweeters.

Download Full-text

Retweet Prediction Based on User Behavior

10.32920/ryerson.14657001.v1 ◽

2021 ◽

Author(s):

Syeda Nadia Firdaus

Keyword(s):

Machine Learning ◽

Social Networks ◽

Social Network ◽

Prediction Model ◽

Matrix Factorization ◽

Information Diffusion ◽

User Behavior ◽

Past Research ◽

The Difference ◽

The Impact

Social network is a hot topic of interest for researchers in the field of computer science in recent years. These social networks such as Facebook, Twitter, Instagram play an important role in information diffusion. Social network data are created by its users. Users’ online activities and behavior have been studied in various past research efforts in order to get a better understanding on how information is diffused on social networks. In this study, we focus on Twitter and we explore the impact of user behavior on their retweet activity. To represent a user’s behavior for predicting their retweet decision, we introduce 10-dimentional emotion and 35-dimensional personality related features. We consider the difference of a user being an author and a retweeter in terms of their behaviors, and propose a machine learning based retweet prediction model considering this difference. We also propose two approaches for matrix factorization retweet prediction model which learns the latent relation between users and tweets to predict the user’s retweet decision. In the experiment, we have tested our proposed models. We find that models based on user behavior related features provide good improvement (3% - 6% in terms of F1- score) over baseline models. By only considering user’s behavior as a retweeter, the data processing time is reduced while the prediction accuracy is comparable to the case when both retweeting and posting behaviors are considered. In the proposed matrix factorization models, we include tweet features into the basic factorization model through newly defined regularization terms and improve the performance by 3% - 4% in terms of F1-score. Finally, we compare the performance of machine learning and matrix factorization models for retweet prediction and find that none of the models is superior to the other in all occasions. Therefore, different models should be used depending on how prediction results will be used. Machine learning model is preferable when a model’s performance quality is important such as for tweet re-ranking and tweet recommendation. Matrix factorization is a preferred option when model’s positive retweet prediction capability is more important such as for marketing campaign and finding potential retweeters.

Download Full-text

Trust-Based Missing Link Prediction in Signed Social Networks with Privacy Preservation

Wireless Communications and Mobile Computing ◽

10.1155/2020/8849536 ◽

2020 ◽

Vol 2020 ◽

pp. 1-10

Author(s):

Huaizhen Kou ◽

Fan Wang ◽

Chao Lv ◽

Zhaoan Dong ◽

Wanli Huang ◽

...

Keyword(s):

Social Networks ◽

Social Network ◽

Privacy Protection ◽

Link Prediction ◽

Social Relationship ◽

Hamming Distance ◽

Mobile Internet ◽

User Privacy ◽

Missing Link ◽

Signed Social Networks

With the development of mobile Internet, more and more individuals and institutions tend to express their views on certain things (such as software and music) on social platforms. In some online social network services, users are allowed to label users with similar interests as “trust” to get the information they want and use “distrust” to label users with opposite interests to avoid browsing content they do not want to see. The networks containing such trust relationships and distrust relationships are named signed social networks (SSNs), and some real-world complex systems can be also modeled with signed networks. However, the sparse social relationships seriously hinder the expansion of users’ social circle in social networks. In order to solve this problem, researchers have done a lot of research on link prediction. Although these studies have been proved to be effective in the unsigned social network, the prediction of trust and distrust in SSN has not achieved good results. In addition, the existing link prediction research does not consider the needs of user privacy protection, so most of them do not add privacy protection measures. To solve these problems, we propose a trust-based missing link prediction method (TMLP). First, we use the simhash method to create a hash index for each user. Then, we calculate the Hamming distance between the two users to determine whether they can establish a new social relationship. Finally, we use the fuzzy computing model to determine the type of their new social relationship (e.g., trust or distrust). In the paper, we gradually explain our method through a case study and prove our method’s feasibility.

Download Full-text

Performance Analysis of Naïve Bayes Classifier Over Similarity Score-Based Techniques for Missing Link Prediction in Ego Networks

Journal of Information Technology Research ◽

10.4018/jitr.2021010107 ◽

2021 ◽

Vol 14 (1) ◽

pp. 110-122

Author(s):

Anand Kumar Gupta ◽

Neetu Sardana

Keyword(s):

Machine Learning ◽

Social Network ◽

Performance Analysis ◽

Link Prediction ◽

Similarity Score ◽

Learning Performance ◽

Bayes Classifier ◽

Missing Link ◽

Ego Networks ◽

Network Link

Keywords Ego Network, Link Prediction, Machine Learning, Performance Analysis, Similarity Score, Social Network

Download Full-text

I Know Where You Are Coming From: On the Impact of Social Media Sources on AI Model Performance (Student Abstract)

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i10.7258 ◽

2020 ◽

Vol 34 (10) ◽

pp. 13971-13972

Author(s):

Yang Qi ◽

Farseev Aleksandr ◽

Filchenkov Andrey

Keyword(s):

Machine Learning ◽

Social Networks ◽

Social Media ◽

Social Network ◽

Model Performance ◽

User Profiling ◽

Personalized Recommendation ◽

Modal Data ◽

Social Media Networks ◽

The Impact

Nowadays, social networks play a crucial role in human everyday life and no longer purely associated with spare time spending. In fact, instant communication with friends and colleagues has become an essential component of our daily interaction giving a raise of multiple new social network types emergence. By participating in such networks, individuals generate a multitude of data points that describe their activities from different perspectives and, for example, can be further used for applications such as personalized recommendation or user profiling. However, the impact of the different social media networks on machine learning model performance has not been studied comprehensively yet. Particularly, the literature on modeling multi-modal data from multiple social networks is relatively sparse, which had inspired us to take a deeper dive into the topic in this preliminary study. Specifically, in this work, we will study the performance of different machine learning models when being learned on multi-modal data from different social networks. Our initial experimental results reveal that social network choice impacts the performance and the proper selection of data source is crucial.

Download Full-text

Group Relationship Mining of College Students Based on Predictive Social Network

Security and Communication Networks ◽

10.1155/2022/5443655 ◽

2022 ◽

Vol 2022 ◽

pp. 1-9

Author(s):

Huazhang Liu

Keyword(s):

College Students ◽

Social Networks ◽

Social Network ◽

Complex Network ◽

Link Prediction ◽

Rapid Development ◽

Combination Method ◽

Prediction Algorithm ◽

The Social ◽

Random Block

With the rapid development of the Internet, social networks have shown an unprecedented development trend among college students. Closer social activities among college students have led to the emergence of college students with new social characteristics. The traditional method of college students’ group classification can no longer meet the current demand. Therefore, this paper proposes a social network link prediction method-combination algorithm, which combines neighbor information and a random block. By mining the social networks of college students’ group relationships, the classification of college students’ groups can be realized. Firstly, on the basis of complex network theory, the essential relationship of college student groups under a complex network is analyzed. Secondly, a new combination algorithm is proposed by using the simplest linear combination method to combine the proximity link prediction based on neighbor information and the likelihood analysis link prediction based on a random block. Finally, the proposed combination algorithm is verified by using the social data of college students’ networks. Experimental results show that, compared with the traditional link prediction algorithm, the proposed combination algorithm can effectively dig out the group characteristics of social networks and improve the accuracy of college students’ association classification.

Download Full-text

A novel similarity measure for missing link prediction in social networks

Indonesian Journal of Electrical Engineering and Computer Science ◽

10.11591/ijeecs.v19.i2.pp1071-1077 ◽

2020 ◽

Vol 19 (2) ◽

pp. 1071

Author(s):

Gogulamudi Naga Chandrika ◽

E. Srinivasa Reddy

Keyword(s):

Social Networks ◽

Complex Networks ◽

Similarity Measure ◽

Network Theory ◽

Link Prediction ◽

Prediction Methods ◽

Missing Link ◽

The Social ◽

Hidden Patterns ◽

Over Time

Social Networks progress over time by the addition of new nodes and links, form associations with one community to the other community. Over a few decades, the fast expansion of Social Networks has attracted many researchers to pay more attention towards complex networks, the collection of social data, understand the social behaviors of complex networks and predict future conflicts. Thus, Link prediction is imperative to do research with social networks and network theory. The objective of this research is to find the hidden patterns and uncovered missing links over complex networks. Here, we developed a new similarity measure to predict missing links over social networks. The new method is computed on common neighbors with node-to-node distance to get better accuracy of missing link prediction. We tested the proposed measure on a variety of real-world linked datasets which are formed from various linked social networks. The proposed approach performance is compared with contemporary link prediction methods. Our measure makes very effective and intuitive in predicting disappeared links in linked social networks.

Download Full-text

Link Prediction in Social Networks

Advances in Wireless Technologies and Telecommunication - Graph Theoretic Approaches for Analyzing Large-Scale Social Networks ◽

10.4018/978-1-5225-2814-2.ch010 ◽

2018 ◽

pp. 164-172 ◽

Cited By ~ 1

Author(s):

Sovan Samanta ◽

Madhumangal Pal

Keyword(s):

Social Networks ◽

Social Network ◽

Link Prediction ◽

Research Work ◽

Dense Networks ◽

Friend Recommendation ◽

Neighborhood Method

Social network is a topic of current research. Relations are broken and new relations are increased. This chapter will discuss the scope or predictions of new links in social networks. Here different approaches for link predictions are described. Among them friend recommendation model is latest. There are some other methods like common neighborhood method which is also analyzed here. The comparison among them to predict links in social networks is described. The significance of this research work is to find strong dense networks in future.

Download Full-text

Hidden Link Prediction in Criminal Networks Using the Deep Reinforcement Learning Technique

Computers ◽

10.3390/computers8010008 ◽

2019 ◽

Vol 8 (1) ◽

pp. 8 ◽

Cited By ~ 7

Author(s):

Marcus Lim ◽

Azween Abdullah ◽

NZ Jhanjhi ◽

Mahadevan Supramaniam

Keyword(s):

Machine Learning ◽

Reinforcement Learning ◽

Network Analysis ◽

Prediction Model ◽

Supervised Learning ◽

Link Prediction ◽

Supervised Machine Learning ◽

Criminal Networks ◽

Criminal Network ◽

Learning Technique

Criminal network activities, which are usually secret and stealthy, present certain difficulties in conducting criminal network analysis (CNA) because of the lack of complete datasets. The collection of criminal activities data in these networks tends to be incomplete and inconsistent, which is reflected structurally in the criminal network in the form of missing nodes (actors) and links (relationships). Criminal networks are commonly analyzed using social network analysis (SNA) models. Most machine learning techniques that rely on the metrics of SNA models in the development of hidden or missing link prediction models utilize supervised learning. However, supervised learning usually requires the availability of a large dataset to train the link prediction model in order to achieve an optimum performance level. Therefore, this research is conducted to explore the application of deep reinforcement learning (DRL) in developing a criminal network hidden links prediction model from the reconstruction of a corrupted criminal network dataset. The experiment conducted on the model indicates that the dataset generated by the DRL model through self-play or self-simulation can be used to train the link prediction model. The DRL link prediction model exhibits a better performance than a conventional supervised machine learning technique, such as the gradient boosting machine (GBM) trained with a relatively smaller domain dataset.

Download Full-text

Constructed Link Prediction Model by Relation Pattern on the Social Network

Proceedings of the Fifth Euro-China Conference on Intelligent Data Analysis and Applications - Advances in Intelligent Systems and Computing ◽

10.1007/978-3-030-03766-6_30 ◽

2018 ◽

pp. 264-271

Author(s):

Jimmy Ming-Tai Wu ◽

Meng-Hsiun Tsai ◽

Tu-Wei Li ◽

Hsien-Chung Huang

Keyword(s):

Social Network ◽

Prediction Model ◽

Link Prediction ◽

The Social

Download Full-text