Knowledge based dimensionality reduction for technical text mining

Author(s):  
Walid Shalaby ◽  
Wlodek Zadrozny ◽  
Sean Gallagher



2006 ◽  
Vol 05 (03) ◽  
pp. 211-222
Author(s):  
Imad Rahal ◽  
Hassan Najadat ◽  
William Perrizo

The importance of text mining stems from the availability of huge volumes of text databases holding a wealth of valuable information that needs to be mined. Text mining is a coarse area encompassing many finer branches one of which is text categorisation or text classification. Text categorisation is the process of assigning class labels to documents based entirely on their textual contents where we are given a document d, and asked to find its subject matter or class label, Ci. In this paper, an optimised k-Nearest Neighbours classifier that uses discretisation, the P-tree technology, and dimensionality reduction to achieve a high degree of accuracy, space utilisation and time efficiency is proposed. One of the fundamental contributions of this work is that as new samples arrive, the proposed classifier can find the k nearest neighbours to the new sample from the training space without a single database scan.





Author(s):  
Johannes Zenkert ◽  
Christian Weber ◽  
Andre Klahold ◽  
Madjid Fathi ◽  
Kai Hahn
Keyword(s):  


2021 ◽  
Vol 11 (1) ◽  
pp. 6656-6661
Author(s):  
A. Alqahtani ◽  
H. Alhakami ◽  
T. Alsubait ◽  
A. Baz

Text matching is the process of identifying and locating particular text matches in raw data. Text matching is a vital component in practical applications and an essential process in several fields. Furthermore, several dynamic techniques have been introduced in this context in order to create ease in pattern generation from words. The process involves matching of text files, text mining, text clustering, association rule extraction, world cloud, natural language processing, and text similarity measures (knowledge-based, corpus-based, string-based, and hybrid similarities). The string-based approach forms the most conspicuous form of text mining applied in different cases. The survey attempted in the present study covers a new research premise that uses text-matching to solve problems. The study also summarizes different approaches that are being used in this domain.



2014 ◽  
Vol 71 ◽  
pp. 376-388 ◽  
Author(s):  
Juan I. Guerrero ◽  
Carlos León ◽  
Iñigo Monedero ◽  
Félix Biscarri ◽  
Jesús Biscarri


2017 ◽  
Vol 139 ◽  
pp. 00015
Author(s):  
Jihong Liu ◽  
Jiaji Wang ◽  
Kejian Wang


Sign in / Sign up

Export Citation Format

Share Document