PLSA-Based Personalized Information Retrieval with Network Regularization

With the rapid development of the Internet, the information retrieval model based on the keywords matching algorithm has not met the requirements of users, because people with various query history always have different retrieval intentions. User query history often implies their interests. Therefore, it is of great importance to enhance the recall ratio and the precision ratio by applying query history into the judgment of retrieval intentions. For this sake, this article does research on user query history and proposes a method to construct user interest model utilizing query history. Coordinately, the authors design a model called PLSA-based Personalized Information Retrieval with Network Regularization. Finally, the model is applied into academic information retrieval and the authors compare it with Baidu Scholar and the personalized information retrieval model based on the probabilistic latent semantic analysis topic model. The experiment results prove that this model can effectively extract topics and retrieves back results more satisfied for users' requirements. Also, this model improves the effect of retrieval results apparently. In addition, the retrieval model can be utilized not only in the academic information retrieval, but also in the personalized information retrieval on microblog search, associate recommendation, etc.

Download Full-text

The Personalized Information Retrieval Model Based on User Interest

Physics Procedia ◽

10.1016/j.phpro.2012.02.122 ◽

2012 ◽

Vol 24 ◽

pp. 817-821 ◽

Cited By ~ 2

Author(s):

Songjie Gong

Keyword(s):

Information Retrieval ◽

Retrieval Model ◽

User Interest ◽

Model Based

Download Full-text

XML document information retrieval model based on four-layered Bayesian network

Journal of Computer Applications ◽

10.3724/sp.j.1087.2009.02791 ◽

2009 ◽

Vol 29 (10) ◽

pp. 2791-2795

Author(s):

Xiao-long ZHANG ◽

Xing-chen HENG

Keyword(s):

Information Retrieval ◽

Bayesian Network ◽

Retrieval Model ◽

Model Based ◽

Xml Document

Download Full-text

Designing a Chat-Bot for College Information using Information Retrieval and Automatic Text Summarization Techniques

Current Chinese Computer Science ◽

10.2174/2665997201999201022191540 ◽

2020 ◽

Vol 01 ◽

Author(s):

Radha Guha

Keyword(s):

Information Retrieval ◽

Language Processing ◽

Latent Dirichlet Allocation ◽

Semantic Analysis ◽

Text Summarization ◽

The Internet ◽

Specific Domain ◽

User Query ◽

College Information ◽

Chat Bot

Background:: In the era of information overload it is very difficult for a human reader to make sense of the vast information available in the internet quickly. Even for a specific domain like college or university website it may be difficult for a user to browse through all the links to get the relevant answers quickly. Objective:: In this scenario, design of a chat-bot which can answer questions related to college information and compare between colleges will be very useful and novel. Methods:: In this paper a novel conversational interface chat-bot application with information retrieval and text summariza-tion skill is designed and implemented. Firstly this chat-bot has a simple dialog skill when it can understand the user query intent, it responds from the stored collection of answers. Secondly for unknown queries, this chat-bot can search the internet and then perform text summarization using advanced techniques of natural language processing (NLP) and text mining (TM). Results:: The advancement of NLP capability of information retrieval and text summarization using machine learning tech-niques of Latent Semantic Analysis(LSI), Latent Dirichlet Allocation (LDA), Word2Vec, Global Vector (GloVe) and Tex-tRank are reviewed and compared in this paper first before implementing them for the chat-bot design. This chat-bot im-proves user experience tremendously by getting answers to specific queries concisely which takes less time than to read the entire document. Students, parents and faculty can get the answers for variety of information like admission criteria, fees, course offerings, notice board, attendance, grades, placements, faculty profile, research papers and patents etc. more effi-ciently. Conclusion:: The purpose of this paper was to follow the advancement in NLP technologies and implement them in a novel application.

Download Full-text

An information retrieval model based on simple Bayesian networks

International Journal of Intelligent Systems ◽

10.1002/int.10088 ◽

2003 ◽

Vol 18 (2) ◽

pp. 251-265 ◽

Cited By ~ 28

Author(s):

Silvia Acid ◽

Luis M. De Campos ◽

Juan M. Fernández-Luna ◽

Juan F. Huete

Keyword(s):

Information Retrieval ◽

Bayesian Networks ◽

Retrieval Model ◽

Model Based

Download Full-text

Mobile Geographic Web Search Personalization with Language Model

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.303-306.1420 ◽

2013 ◽

Vol 303-306 ◽

pp. 1420-1425

Author(s):

Qiang Pu ◽

Ahmed Lbath ◽

Da Qing He

Keyword(s):

Query Expansion ◽

Web Search ◽

Mixed Model ◽

Topic Model ◽

Language Model ◽

Mobile Users ◽

Information Need ◽

User Interest ◽

Modeling Framework ◽

Model Based

Mobile personalized web search has been introduced for the purpose of distinguishing mobile user's personal different search interest. We first take the user's location information into account to do a geographic query expansion, then present an approach to personalizing web search for mobile users within language modeling framework. We estimate a user mixed model estimated according to both activated ontological topic model-based feedback and user interest model to re-rank the results from geographic query expansion. Experiments show that language model based re-ranking method is effective in presenting more relevant documents on the top retrieved results to mobile users. The main contribution of the improvements comes from the consideration of geographic information, ontological topic information and user interests together to find more relevant documents for satisfying their personal information need.

Download Full-text

A word embedding topic model for topic detection and summary in social networks

Measurement and Control ◽

10.1177/0020294019865750 ◽

2019 ◽

Vol 52 (9-10) ◽

pp. 1289-1298 ◽

Cited By ~ 1

Author(s):

Lei Shi ◽

Gang Cheng ◽

Shang-ru Xie ◽

Gang Xie

Keyword(s):

Social Networks ◽

Social Network ◽

Latent Dirichlet Allocation ◽

Semantic Analysis ◽

Topic Model ◽

Word Embedding ◽

Probabilistic Latent Semantic Analysis ◽

Topic Detection ◽

Short Text ◽

Internal Relationship

The aim of topic detection is to automatically identify the events and hot topics in social networks and continuously track known topics. Applying the traditional methods such as Latent Dirichlet Allocation and Probabilistic Latent Semantic Analysis is difficult given the high dimensionality of massive event texts and the short-text sparsity problems of social networks. The problem also exists of unclear topics caused by the sparse distribution of topics. To solve the above challenge, we propose a novel word embedding topic model by combining the topic model and the continuous bag-of-words mode (Cbow) method in word embedding method, named Cbow Topic Model (CTM), for topic detection and summary in social networks. We conduct similar word clustering of the target social network text dataset by introducing the classic Cbow word vectorization method, which can effectively learn the internal relationship between words and reduce the dimensionality of the input texts. We employ the topic model-to-model short text for effectively weakening the sparsity problem of social network texts. To detect and summarize the topic, we propose a topic detection method by leveraging similarity computing for social networks. We collected a Sina microblog dataset to conduct various experiments. The experimental results demonstrate that the CTM method is superior to the existing topic model method.

Download Full-text

The Research of Information Retrieval Technology Based on Semantic Analysis

Advanced Materials Research ◽

10.4028/www.scientific.net/amr.926-930.2160 ◽

2014 ◽

Vol 926-930 ◽

pp. 2160-2163

Author(s):

Ming Xu ◽

Yun Ke

Keyword(s):

Information Retrieval ◽

Optimization Algorithm ◽

Semantic Analysis ◽

Retrieval Model ◽

Common Information ◽

Search Terms ◽

Sorting Algorithms ◽

Key Points ◽

The Common ◽

Semantic Expansion

The common information retrieval technology is mainly based on keyword matching and this kind of method only focuse on the optimization algorithm, ignoring the semantic research. This does not solve the fundamental semantic multiplicity, retrieve diversity, related web undetected, sort unstandardized. This paper is a study of these problems arise for the current proposed MIRSA information retrieval model based on semantic analysis. This model consists of the following four main key points: disambiguation method, semantic expansion algorithm, the search terms match strategy, web sorting algorithms. This model can effectively solve the problem of semantic multiplicity, avoid missed relevant pages and reasonably improve the sor of related pages.

Download Full-text

An Enhanced Fuzzy Information Retrieval Model Based on Linguistics

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.519-520.853 ◽

2014 ◽

Vol 519-520 ◽

pp. 853-856

Author(s):

Zeinab E. Al-Arab ◽

Ahmed M. Gadallah ◽

Hesham M. Hefny

Keyword(s):

Information Retrieval ◽

Ranking Algorithm ◽

Retrieval Model ◽

Fuzzy Information ◽

Model Based ◽

Fuzzy Ontology ◽

Confidence Degree ◽

Subjective View ◽

Fuzzy Information Retrieval

The paper proposes a linguistic based fuzzy ontology information retrieval model. The model deals with linguistic based queries in multi domains. Such linguistics are user defined, reflecting his subjective view. The model also proposes a ranking algorithm that ranks the set of relevant documents according to some criteria such as their relevance degree, confidence degree, and updating degree.

Download Full-text