Enhanced Frequent Itemsets Based on Topic Modeling in Information Filtering

Than Than Wai; Sint Sint Aung

doi:10.4018/ijsi.2017100103

Enhanced Frequent Itemsets Based on Topic Modeling in Information Filtering

International Journal of Software Innovation ◽

10.4018/ijsi.2017100103 ◽

2017 ◽

Vol 5 (4) ◽

pp. 33-43

Author(s):

Than Than Wai ◽

Sint Sint Aung

Keyword(s):

Topic Modeling ◽

Information Needs ◽

Pattern Mining ◽

Topic Model ◽

State Of The Art ◽

Information Filtering ◽

Frequent Pattern Mining ◽

Frequent Itemsets ◽

Frequent Pattern ◽

Proposed Model

In order to generate user's information needs from a collection of documents, many term-based and pattern-based approaches have been used in Information Filtering. In these approaches, the documents in the collection are all about one topic. However, user's interests can be diverse and the documents in the collection often involve multiple topics. Topic modeling is useful for the area of machine learning and text mining. It generates models to discover the hidden multiple topics in a collection of documents and each of these topics are presented by distribution of words. But its effectiveness in information filtering has not been so well explored. Patterns are always thought to be more discriminative than single terms for describing documents. The major challenge found in frequent pattern mining is a large number of result patterns. As the minimum threshold becomes lower, an exponentially large number of patterns are generated. To deal with the above mentioned limitations and problems, in this paper, a novel information filtering model, EFITM (Enhanced Frequent Itemsets based on Topic Model) model is proposed. Experimental results using the CRANFIELD dataset for the task of information filtering show that the proposed model outperforms over state-of-the-art models.

AN IMPROVED PATTERN BASED LDA TOPIC MODELING FOR BUSINESS INTELLIGENCE

INFORMATION TECHNOLOGY IN INDUSTRY ◽

10.17762/itii.v9i2.363 ◽

2021 ◽

Vol 9 (2) ◽

pp. 404-409

Author(s):

K Prashant Gokul, Et. al.

Keyword(s):

Topic Modeling ◽

Information Needs ◽

Pattern Mining ◽

Topic Model ◽

Information Filtering ◽

Topic Models ◽

Model Learning ◽

Text Corpora ◽

Exploratory Data ◽

Made In

Topic models give a helpful strategy to dimensionality decrease and exploratory data analysis in huge text corpora. Most ways to deal with topic model learning have been founded on a greatest likelihood objective. Proficient algorithms exist that endeavor to inexact this target, yet they have no provable certifications. As of late, algorithms have been presented that give provable limits, however these algorithms are not down to earth since they are wasteful and not hearty to infringement of model presumptions. In this work, we propose to consolidate the statistical topic modeling with pattern mining strategies to produce pattern-based topic models to upgrade the semantic portrayals of the conventional word based topic models. Using the proposed pattern-based topic model, clients' inclinations can be modeled with different topics and every one of which is addressed with semantically rich patterns. A tale information filtering model is proposed here. In information filtering model client information needs are made in terms of different topics where every topic is addressed by patterns. The calculation produces results similar to the best executions while running significant degrees quicker.

Research into the Algorithm of Frequent Pattern Mining Based on across Linker

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.195-196.984 ◽

2012 ◽

Vol 195-196 ◽

pp. 984-986

Author(s):

Ming Ru Zhao ◽

Yuan Sun ◽

Jian Guo ◽

Ping Ping Dong

Keyword(s):

Data Mining ◽

Pattern Mining ◽

Frequent Pattern Mining ◽

Frequent Itemsets ◽

Frequent Pattern ◽

Apriori Algorithm ◽

Important Data ◽

Classical Algorithm ◽

Frequent Itemsets Mining ◽

Mining Frequent Itemsets

Frequent itemsets mining is an important data mining task and a focused theme in data mining research. Apriori algorithm is one of the most important algorithm of mining frequent itemsets. However, the Apriori algorithm scans the database too many times, so its efficiency is relatively low. The paper has therefore conducted a research on the mining frequent itemsets algorithm based on a across linker. Through comparing with the classical algorithm, the improved algorithm has obvious advantages.

Frequent Pattern Mining and Current State of the Art

International Journal of Computer Applications ◽

10.5120/3114-4279 ◽

2011 ◽

Vol 26 (7) ◽

pp. 33-39 ◽

Cited By ~ 1

Author(s):

Kalli Srinivasa Nageswara Prasad ◽

S. Ramakrishna

Keyword(s):

Pattern Mining ◽

State Of The Art ◽

Frequent Pattern Mining ◽

Frequent Pattern ◽

Current State

Frequent Patterns Mining

International Journal of Scientific Research in Computer Science Engineering and Information Technology ◽

10.32628/cseit2063230 ◽

2020 ◽

pp. 21-29

Author(s):

Y. Fakir ◽

R. Elayachi

Keyword(s):

Pattern Mining ◽

Frequent Pattern Mining ◽

Database Systems ◽

Frequent Itemsets ◽

Frequent Pattern ◽

Frequent Patterns ◽

Large Database ◽

Large Item ◽

Time Required ◽

Remarkable Progress

Frequent pattern mining has been an important subject matter in data mining from many years. A remarkable progress in this field has been made and lots of efficient algorithms have been designed to search frequent patterns in a transactional database. One of the most important technique of datamining is the extraction rule in large database. The time required for generating frequent itemsets plays an important role. This paper provides a comparative study of algorithms Eclat, Apriori and FP-Growth. The performance of these algorithms is compared according to the efficiency of the time and memory usage. This study also focuses on each of the algorithm’s strengths and weaknesses for finding patterns among large item sets in database systems.

Enriching text representation with frequent pattern mining for probabilistic topic modeling

Proceedings of the American Society for Information Science and Technology ◽

10.1002/meet.14504901209 ◽

2012 ◽

Vol 49 (1) ◽

pp. 1-10 ◽

Cited By ~ 14

Author(s):

Hyun Duk Kim ◽

Dae Hoon Park ◽

Yue Lu ◽

ChengXiang Zhai

Keyword(s):

Topic Modeling ◽

Pattern Mining ◽

Frequent Pattern Mining ◽

Frequent Pattern ◽

Text Representation ◽

Probabilistic Topic Modeling

An Adaptive Data Distribution Through Tree Rules in Frequent Pattern Mining

International Journal of Scientific Research in Computer Science Engineering and Information Technology ◽

10.32628/cseit183894 ◽

2018 ◽

pp. 300-305

Keyword(s):

Information Sharing ◽

Pattern Mining ◽

Data Distribution ◽

Frequent Pattern Mining ◽

Frequent Pattern ◽

General Development ◽

Secure Information ◽

Evaluation Parameters ◽

Secure Information Sharing

Information sharing among the associations is a general development in a couple of zones like business headway and exhibiting. As bit of the touchy principles that ought to be kept private may be uncovered and such disclosure of delicate examples may impacts the advantages of the association that have the data. Subsequently the standards which are delicate must be secured before sharing the data. In this paper to give secure information sharing delicate guidelines are bothered first which was found by incessant example tree. Here touchy arrangement of principles are bothered by substitution. This kind of substitution diminishes the hazard and increment the utility of the dataset when contrasted with different techniques. Examination is done on certifiable dataset. Results shows that proposed work is better as appear differently in relation to various past strategies on the introduce of evaluation parameters.

Learning and Synchronized Privacy Preserving Frequent Pattern Mining

Journal of Software ◽

10.3724/sp.j.1001.2011.04000 ◽

2011 ◽

Vol 22 (8) ◽

pp. 1749-1760

Author(s):

Yu-Hong GUO ◽

Yun-Hai TONG ◽

Shi-Wei TANG ◽

Leng-Dong WU

Keyword(s):

Pattern Mining ◽

Frequent Pattern Mining ◽

Privacy Preserving ◽

Frequent Pattern

RAKING: An Efficient K-Maximal Frequent Pattern Mining Algorithm on Uncertain Graph Database

Chinese Journal of Computers ◽

10.3724/sp.j.1016.2010.01387 ◽

2010 ◽

Vol 33 (8) ◽

pp. 1387-1395 ◽

Cited By ~ 4

Author(s):

Meng HAN ◽

Wei ZHANG ◽

Jian-Zhong LI

Keyword(s):

Pattern Mining ◽

Frequent Pattern Mining ◽

Frequent Pattern ◽

Graph Database ◽

Uncertain Graph ◽

Mining Algorithm ◽

Maximal Frequent Pattern

Sliding window based weighted maximal frequent pattern mining over data streams

Expert Systems with Applications ◽

10.1016/j.eswa.2013.07.094 ◽

2014 ◽

Vol 41 (2) ◽

pp. 694-708 ◽

Cited By ~ 64

Author(s):

Gangin Lee ◽

Unil Yun ◽

Keun Ho Ryu

Keyword(s):

Data Streams ◽

Pattern Mining ◽

Frequent Pattern Mining ◽

Sliding Window ◽

Frequent Pattern ◽

Maximal Frequent Pattern

Enhanced context-aware recommendation using topic modeling and particle swarm optimization

Journal of Intelligent & Fuzzy Systems ◽

10.3233/jifs-210331 ◽

2021 ◽

pp. 1-16

Author(s):

Ibtissem Gasmi ◽

Mohamed Walid Azizi ◽

Hassina Seridi-Bouchelaghem ◽

Nabiha Azizi ◽

Samir Brahim Belhaouari

Keyword(s):

Topic Modeling ◽

Latent Dirichlet Allocation ◽

State Of The Art ◽

Weighting Function ◽

Contextual Factors ◽

Pearson Correlation ◽

Correlation Coefficients ◽

Pso Algorithm ◽

Context Aware ◽

Proposed Model

Context-Aware Recommender System (CARS) suggests more relevant services by adapting them to the user’s specific context situation. Nevertheless, the use of many contextual factors can increase data sparsity while few context parameters fail to introduce the contextual effects in recommendations. Moreover, several CARSs are based on similarity algorithms, such as cosine and Pearson correlation coefficients. These methods are not very effective in the sparse datasets. This paper presents a context-aware model to integrate contextual factors into prediction process when there are insufficient co-rated items. The proposed algorithm uses Latent Dirichlet Allocation (LDA) to learn the latent interests of users from the textual descriptions of items. Then, it integrates both the explicit contextual factors and their degree of importance in the prediction process by introducing a weighting function. Indeed, the PSO algorithm is employed to learn and optimize weights of these features. The results on the Movielens 1 M dataset show that the proposed model can achieve an F-measure of 45.51% with precision as 68.64%. Furthermore, the enhancement in MAE and RMSE can respectively reach 41.63% and 39.69% compared with the state-of-the-art techniques.