A New Approach Using Hidden Markov Model and Bayesian Method for Estimate of Word Types in Text Mining

Determining the structure of words in the text for the operations such as automated information extraction and text summarization of the text is essential. In computers, textual analysis to define the type of the word is considered as a vital advantage. Defining the types of words provides an estimate of the sequence of words in the sentence. In this article, estimating types of Turkish words is provided by developing a Hidden Markov Model and a Bayesian-based new model. In this model, an algorithm is developed which separates the suffixes of the words and grouping the words by counts of characters that suffixes of the words receive. A text composed of 584 Turkish words is used for the testing the dependability of the model. The model has achieved a high success rate in predicting the types of Turkish words.

Download Full-text

Application Study of Hidden Markov Model and Maximum Entropy in Text Information Extraction

Artificial Intelligence and Computational Intelligence - Lecture Notes in Computer Science ◽

10.1007/978-3-642-05253-8_44 ◽

2009 ◽

pp. 399-407

Author(s):

Rong Li ◽

Li-ying Liu ◽

He-fang Fu ◽

Jia-heng Zheng

Keyword(s):

Markov Model ◽

Hidden Markov Model ◽

Information Extraction ◽

Maximum Entropy ◽

Hidden Markov ◽

Application Study ◽

Text Information

Download Full-text

Information Extraction System Based on Hidden Markov Model

Advances in Neural Networks – ISNN 2009 - Lecture Notes in Computer Science ◽

10.1007/978-3-642-01507-6_7 ◽

2009 ◽

pp. 52-59 ◽

Cited By ~ 1

Author(s):

Dong-Chul Park ◽

Vu Thi Lan Huong ◽

Dong-Min Woo ◽

Duong Ngoc Hieu ◽

Sai Thi Hien Ninh

Keyword(s):

Markov Model ◽

Hidden Markov Model ◽

Information Extraction ◽

Hidden Markov ◽

Extraction System ◽

Information Extraction System

Download Full-text

Web information extraction based on hidden Markov model

The 2010 14th International Conference on Computer Supported Cooperative Work in Design ◽

10.1109/cscwd.2010.5471969 ◽

2010 ◽

Cited By ~ 5

Author(s):

Jianbing Lai ◽

Qiang Liu ◽

Yi Liu

Keyword(s):

Markov Model ◽

Hidden Markov Model ◽

Information Extraction ◽

Hidden Markov ◽

Web Information Extraction ◽

Web Information

Download Full-text

Stock market forecasting using hidden Markov model: a new approach

5th International Conference on Intelligent Systems Design and Applications (ISDA'05) ◽

10.1109/isda.2005.85 ◽

2005 ◽

Cited By ~ 102

Author(s):

M.R. Hassan ◽

B. Nath

Keyword(s):

Stock Market ◽

Markov Model ◽

Hidden Markov Model ◽

Hidden Markov ◽

New Approach ◽

Stock Market Forecasting

Download Full-text

Learning Hidden Markov Model Topology Based on KL Divergence for Information Extraction

Advances in Knowledge Discovery and Data Mining - Lecture Notes in Computer Science ◽

10.1007/978-3-540-24775-3_70 ◽

2004 ◽

pp. 590-594 ◽

Cited By ~ 2

Author(s):

Kwok-Chung Au ◽

Kwok-Wai Cheung

Keyword(s):

Markov Model ◽

Hidden Markov Model ◽

Information Extraction ◽

Hidden Markov ◽

Kl Divergence ◽

Model Topology

Download Full-text

Research of Information Extraction Algorithm based on Hidden Markov Model

The 2nd International Conference on Information Science and Engineering ◽

10.1109/icise.2010.5690348 ◽

2010 ◽

Cited By ~ 2

Author(s):

Cailan Zhou ◽

Shasha Li

Keyword(s):

Markov Model ◽

Hidden Markov Model ◽

Information Extraction ◽

Hidden Markov ◽

Extraction Algorithm

Download Full-text

Web object information extraction based on generalized hidden Markov model

2007 International Symposium on Communications and Information Technologies ◽

10.1109/iscit.2007.4392257 ◽

2007 ◽

Cited By ~ 1

Author(s):

Jing Wang ◽

Yong Yao ◽

Zhi Jing Liu

Keyword(s):

Markov Model ◽

Hidden Markov Model ◽

Information Extraction ◽

Hidden Markov

Download Full-text

A Generalized Hidden Markov Model Approach for Web Information Extraction

2006 IEEE/WIC/ACM International Conference on Web Intelligence (WI 2006 Main Conference Proceedings)(WI'06) ◽

10.1109/wi.2006.13 ◽

2006 ◽

Cited By ~ 6

Author(s):

Ping Zhong ◽

Jinlin Chen

Keyword(s):

Markov Model ◽

Hidden Markov Model ◽

Information Extraction ◽

Hidden Markov ◽

Web Information Extraction ◽

Web Information ◽

Model Approach

Download Full-text

Credit Card Fraud Detection Performance Improvement using Advanced Super Gradient Boosting Algorithm

International Journal of Innovative Technology and Exploring Engineering - Special Issue ◽

10.35940/ijitee.f3457.049620 ◽

2020 ◽

Vol 9 (6) ◽

pp. 179-184

Keyword(s):

Text Mining ◽

Markov Model ◽

Hidden Markov Model ◽

Credit Card ◽

Hidden Markov ◽

Training Data ◽

Gradient Boosting ◽

Data Set ◽

Credit Card Fraud ◽

Mining Algorithm

Credit card fraud introduces to the physical loss of a credit card or the destruction of sensitive credit card data. Several text mining procedures can be used for disclosure. This investigation reveals several algorithms that can be used to analyze transactions as a fraud or as a real background. This paper represents the possibility of fraudulent transactions in the prevalence and meaning of credit card usage also, Credit card fraud data collection was used in the investigation. Since the dataset was largely unbalanced, SMOTE (Synthetic Minority oversampling Technique) is applying for an overdose. In addition, jobs selected, and the data set divided into two parts, training data and test data. In this paper, The Advanced Super Gradient Boostingbased Text mining Algorithm (ASGB) suggested to detect the fraud transaction in Credit card transactions. ASGB is a Decision-Tree-Based Ensemble Text mining algorithm that utilizes a gradient boosting framework. In forecast difficulties, including unstructured data (Images, Text, etc.), artificial neural networks tend to exceed all other algorithms or structures. The proposed algorithms used in the experiment were the Hidden Markov Model, Random Forest, Gradient Boosting, and Enhanced Hidden Markov Model. The Experimental Results show that proposed algorithms, a welltuned ASGB classifier outperforms all of them. And it presents better Precision is 99.1%, and Recall is 99.8%, F-measure is 99.5%.

Download Full-text