Learning to Generate Optimized Term Weighting for Web Documents Classification - A Parallel Mimetic Approach Based on Support Vector Machines

2016 ◽  
Vol 11 (12) ◽  
pp. 1147
Author(s):  
Abderrahmane Bendahmane ◽  
Abdelkader Benyettou
2017 ◽  
Vol 1 (1) ◽  
pp. 19-25
Author(s):  
Fithri Selva Jumeilah

Research every college will continue to grow. Research will be stored in softcopy and hardcopy. The preparation of the research should be categorized in order to facilitate the search for people who need reference. To categorize the research, we need a method for text mining, one of them is with the implementation of Support Vector Machines (SVM). The data used to recognize the characteristics of each category then it takes secondary data which is a collection of abstracts of research. The data will be pre-processed with several stages: case folding converts all the letters into lowercase, stop words removal removal of very common words, tokenizing discard punctuation, and stemming searching for root words by removing the prefix and suffix. Further data that has undergone preprocessing will be converted into a numerical form with for the term weighting stage that is the weighting contribution of each word. From the results of term weighting then obtained data that can be used for data training and test data. The training process is done by providing input in the form of text data that is known to the class or category. Then by using the Support Vector Machines algorithm, the input data is transformed into a rule, function, or knowledge model that can be used in the prediction process. From the results of this study obtained that the categorization of research produced by SVM has been very good. This is proven by the results of the test which resulted in an accuracy of 90%.


2018 ◽  
Author(s):  
Nelson Marcelo Romero Aquino ◽  
Matheus Gutoski ◽  
Leandro Takeshi Hattori ◽  
Heitor Silvério Lopes

Sign in / Sign up

Export Citation Format

Share Document