Research on Web Service Clustering Method Based on Word Embedding and Topic Model

With the rapid growth of web services on the internet, web service discovery has become a hot topic in services computing. Faced with the heterogeneous and unstructured service descriptions, many service clustering approaches have been proposed to promote web service discovery, and many other approaches leveraged auxiliary features to enhance the classical LDA model to achieve better clustering performance. However, these extended LDA approaches still have limitations in processing data sparsity and noise words. This article proposes a novel web service clustering approach by incorporating LDA with word embedding, which leverages relevant words obtained based on word embedding to improve the performance of web service clustering. Especially, the semantically relevant words of service keywords by Word2vec were used to train the word embeddings and then incorporated into the LDA training process. Finally, experiments conducted on a real-world dataset published on ProgrammableWeb show that the authors' proposed approach can achieve better clustering performance than several classical approaches.

Download Full-text

Comparative Study of Topic Modeling and Word Embedding Approaches for Web Service Clustering

10.1145/3474124.3474169 ◽

2021 ◽

Author(s):

Neha Agarwal ◽

Geeta Sikka ◽

Lalit Kumar Awasthi

Keyword(s):

Comparative Study ◽

Web Service ◽

Topic Modeling ◽

Word Embedding ◽

Service Clustering

Download Full-text

Web Service Recommendation Based on Word Embedding and Topic Model

2018 IEEE Intl Conf on Parallel & Distributed Processing with Applications, Ubiquitous Computing & Communications, Big Data & Cloud Computing, Social Computing & Networking, Sustainable Computing & Communications (ISPA/IUCC/BDCloud/SocialCom/SustainCom) ◽

10.1109/bdcloud.2018.00133 ◽

2018 ◽

Author(s):

Ting Chen ◽

Jianxun Liu ◽

Buqing Cao ◽

Zhenlian Peng ◽

Yiping Wen ◽

...

Keyword(s):

Web Service ◽

Topic Model ◽

Word Embedding ◽

Service Recommendation

Download Full-text

Effective Mashup Service Clustering Method by Exploiting LDA Topic Model from Multiple Data Sources

Lecture Notes in Computer Science - Advances in Services Computing ◽

10.1007/978-3-319-26979-5_12 ◽

2015 ◽

pp. 165-180 ◽

Cited By ~ 3

Author(s):

Buqing Cao ◽

Xiaoqing Liu ◽

Jianxun Liu ◽

Mingdong Tang

Keyword(s):

Topic Model ◽

Data Sources ◽

Clustering Method ◽

Multiple Data Sources ◽

Multiple Data ◽

Service Clustering

Download Full-text

Web Service Clustering Technique based on Contextual Word Embedding for Service Representation

10.1109/ictai53825.2021.9673426 ◽

2021 ◽

Author(s):

Neha Agarwal ◽

Geeta Sikka ◽

Lalit K Awasthi

Keyword(s):

Web Service ◽

Word Embedding ◽

Clustering Technique ◽

Service Clustering

Download Full-text

A Web service clustering method based on topic enhanced Gibbs sampling algorithm for the Dirichlet Multinomial Mixture model and service collaboration graph

Information Sciences ◽

10.1016/j.ins.2021.11.087 ◽

2021 ◽

Author(s):

Qiang Hu ◽

Jiaji Shen ◽

Kun Wang ◽

Junwei Du ◽

Yuyue Du

Keyword(s):

Web Service ◽

Mixture Model ◽

Gibbs Sampling ◽

Clustering Method ◽

Sampling Algorithm ◽

Gibbs Sampling Algorithm ◽

Service Clustering ◽

Service Collaboration

Download Full-text

Ontology learning with complex data type for Web service clustering

2014 IEEE Symposium on Computational Intelligence and Data Mining (CIDM) ◽

10.1109/cidm.2014.7008658 ◽

2014 ◽

Cited By ~ 1

Author(s):

Banage T. G. S. Kumara ◽

Incheon Paik ◽

Kowatte R. C. Koswatte ◽

Wuhui Chen

Keyword(s):

Web Service ◽

Data Type ◽

Complex Data ◽

Ontology Learning ◽

Service Clustering

Download Full-text

Web Service Clustering Approaches to Enhance Service Discovery: A Review

Lecture Notes in Electrical Engineering - Recent Innovations in Computing ◽

10.1007/978-981-15-8297-4_3 ◽

2021 ◽

pp. 23-35

Author(s):

Neha Agarwal ◽

Geeta Sikka ◽

Lalit Kumar Awasthi

Keyword(s):

Web Service ◽

Service Discovery ◽

Service Clustering

Download Full-text

A Method of Subtopic Classification of Search Engine Suggests by Integrating a Topic Model and Word Embeddings

International Journal of Software Innovation ◽

10.4018/ijsi.2018070105 ◽

2018 ◽

Vol 6 (3) ◽

pp. 67-78

Author(s):

Tian Nie ◽

Yi Ding ◽

Chen Zhao ◽

Youchao Lin ◽

Takehito Utsuro

Keyword(s):

Search Engine ◽

Information Needs ◽

Web Search ◽

Topic Model ◽

Japanese Version ◽

Word Embedding ◽

Coarse Grained ◽

Web Pages ◽

Word Embeddings

The background of this article is the issue of how to overview the knowledge of a given query keyword. Especially, the authors focus on concerns of those who search for web pages with a given query keyword. The Web search information needs of a given query keyword is collected through search engine suggests. Given a query keyword, the authors collect up to around 1,000 suggests, while many of them are redundant. They classify redundant search engine suggests based on a topic model. However, one limitation of the topic model based classification of search engine suggests is that the granularity of the topics, i.e., the clusters of search engine suggests, is too coarse. In order to overcome the problem of the coarse-grained classification of search engine suggests, this article further applies the word embedding technique to the webpages used during the training of the topic model, in addition to the text data of the whole Japanese version of Wikipedia. Then, the authors examine the word embedding based similarity between search engines suggests and further classify search engine suggests within a single topic into finer-grained subtopics based on the similarity of word embeddings. Evaluation results prove that the proposed approach performs well in the task of subtopic classification of search engine suggests.

Download Full-text