PLSA-Based Personalized Information Retrieval with Network Regularization

2019 ◽  
Vol 12 (1) ◽  
pp. 105-116
Author(s):  
Qiuyu Zhu ◽  
Dongmei Li ◽  
Cong Dai ◽  
Qichen Han ◽  
Yi Lin

With the rapid development of the Internet, the information retrieval model based on the keywords matching algorithm has not met the requirements of users, because people with various query history always have different retrieval intentions. User query history often implies their interests. Therefore, it is of great importance to enhance the recall ratio and the precision ratio by applying query history into the judgment of retrieval intentions. For this sake, this article does research on user query history and proposes a method to construct user interest model utilizing query history. Coordinately, the authors design a model called PLSA-based Personalized Information Retrieval with Network Regularization. Finally, the model is applied into academic information retrieval and the authors compare it with Baidu Scholar and the personalized information retrieval model based on the probabilistic latent semantic analysis topic model. The experiment results prove that this model can effectively extract topics and retrieves back results more satisfied for users' requirements. Also, this model improves the effect of retrieval results apparently. In addition, the retrieval model can be utilized not only in the academic information retrieval, but also in the personalized information retrieval on microblog search, associate recommendation, etc.

Author(s):  
Radha Guha

Background:: In the era of information overload it is very difficult for a human reader to make sense of the vast information available in the internet quickly. Even for a specific domain like college or university website it may be difficult for a user to browse through all the links to get the relevant answers quickly. Objective:: In this scenario, design of a chat-bot which can answer questions related to college information and compare between colleges will be very useful and novel. Methods:: In this paper a novel conversational interface chat-bot application with information retrieval and text summariza-tion skill is designed and implemented. Firstly this chat-bot has a simple dialog skill when it can understand the user query intent, it responds from the stored collection of answers. Secondly for unknown queries, this chat-bot can search the internet and then perform text summarization using advanced techniques of natural language processing (NLP) and text mining (TM). Results:: The advancement of NLP capability of information retrieval and text summarization using machine learning tech-niques of Latent Semantic Analysis(LSI), Latent Dirichlet Allocation (LDA), Word2Vec, Global Vector (GloVe) and Tex-tRank are reviewed and compared in this paper first before implementing them for the chat-bot design. This chat-bot im-proves user experience tremendously by getting answers to specific queries concisely which takes less time than to read the entire document. Students, parents and faculty can get the answers for variety of information like admission criteria, fees, course offerings, notice board, attendance, grades, placements, faculty profile, research papers and patents etc. more effi-ciently. Conclusion:: The purpose of this paper was to follow the advancement in NLP technologies and implement them in a novel application.


2003 ◽  
Vol 18 (2) ◽  
pp. 251-265 ◽  
Author(s):  
Silvia Acid ◽  
Luis M. De Campos ◽  
Juan M. Fernández-Luna ◽  
Juan F. Huete

2013 ◽  
Vol 303-306 ◽  
pp. 1420-1425
Author(s):  
Qiang Pu ◽  
Ahmed Lbath ◽  
Da Qing He

Mobile personalized web search has been introduced for the purpose of distinguishing mobile user's personal different search interest. We first take the user's location information into account to do a geographic query expansion, then present an approach to personalizing web search for mobile users within language modeling framework. We estimate a user mixed model estimated according to both activated ontological topic model-based feedback and user interest model to re-rank the results from geographic query expansion. Experiments show that language model based re-ranking method is effective in presenting more relevant documents on the top retrieved results to mobile users. The main contribution of the improvements comes from the consideration of geographic information, ontological topic information and user interests together to find more relevant documents for satisfying their personal information need.


2019 ◽  
Vol 52 (9-10) ◽  
pp. 1289-1298 ◽  
Author(s):  
Lei Shi ◽  
Gang Cheng ◽  
Shang-ru Xie ◽  
Gang Xie

The aim of topic detection is to automatically identify the events and hot topics in social networks and continuously track known topics. Applying the traditional methods such as Latent Dirichlet Allocation and Probabilistic Latent Semantic Analysis is difficult given the high dimensionality of massive event texts and the short-text sparsity problems of social networks. The problem also exists of unclear topics caused by the sparse distribution of topics. To solve the above challenge, we propose a novel word embedding topic model by combining the topic model and the continuous bag-of-words mode (Cbow) method in word embedding method, named Cbow Topic Model (CTM), for topic detection and summary in social networks. We conduct similar word clustering of the target social network text dataset by introducing the classic Cbow word vectorization method, which can effectively learn the internal relationship between words and reduce the dimensionality of the input texts. We employ the topic model-to-model short text for effectively weakening the sparsity problem of social network texts. To detect and summarize the topic, we propose a topic detection method by leveraging similarity computing for social networks. We collected a Sina microblog dataset to conduct various experiments. The experimental results demonstrate that the CTM method is superior to the existing topic model method.


2014 ◽  
Vol 926-930 ◽  
pp. 2160-2163
Author(s):  
Ming Xu ◽  
Yun Ke

The common information retrieval technology is mainly based on keyword matching and this kind of method only focuse on the optimization algorithm, ignoring the semantic research. This does not solve the fundamental semantic multiplicity, retrieve diversity, related web undetected, sort unstandardized. This paper is a study of these problems arise for the current proposed MIRSA information retrieval model based on semantic analysis. This model consists of the following four main key points: disambiguation method, semantic expansion algorithm, the search terms match strategy, web sorting algorithms. This model can effectively solve the problem of semantic multiplicity, avoid missed relevant pages and reasonably improve the sor of related pages.


2014 ◽  
Vol 519-520 ◽  
pp. 853-856
Author(s):  
Zeinab E. Al-Arab ◽  
Ahmed M. Gadallah ◽  
Hesham M. Hefny

The paper proposes a linguistic based fuzzy ontology information retrieval model. The model deals with linguistic based queries in multi domains. Such linguistics are user defined, reflecting his subjective view. The model also proposes a ranking algorithm that ranks the set of relevant documents according to some criteria such as their relevance degree, confidence degree, and updating degree.


Sign in / Sign up

Export Citation Format

Share Document