D-Pattern Evolving and Inner Pattern Evolving for High Performance Text Mining

As a typical unsupervised learning method, the TextRank algorithm performs well for large-scale text mining, especially for automatic summarization or keyword extraction. However, TextRank only considers the similarities between sentences in the processes of automatic summarization and neglects information about text structure and context. To overcome these shortcomings, the authors propose an improved highly-scalable method, called iTextRank. When building a TextRank graph in their new method, the authors compute sentence similarities and adjust the weights of nodes by considering statistical and linguistic features, such as similarities in titles, paragraph structures, special sentences, sentence positions and lengths. Their analysis shows that the time complexity of iTextRank is comparable with TextRank. More importantly, two experiments show that iTextRank has a higher accuracy and lower recall rate than TextRank, and it is as effective as several popular online automatic summarization systems.

Download Full-text

Hierarchical pattern classification for high performance text-independent speaker verification systems

Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing ◽

10.1109/icassp.1994.389331 ◽

2002 ◽

Cited By ~ 4

Author(s):

J. Sorensen ◽

M. Savic

Keyword(s):

Pattern Classification ◽

High Performance ◽

Speaker Verification ◽

Verification Systems ◽

Hierarchical Pattern ◽

Text Independent Speaker Verification ◽

Performance Text

Download Full-text

Grid-line watermarking: A novel method for creating a high-performance text-image watermark

ScienceAsia ◽

10.2306/scienceasia1513-1874.2013.39.423 ◽

2013 ◽

Vol 39 (4) ◽

pp. 423

Author(s):

Wiyada Yawai ◽

Nualsawat Hiransakolwong

Keyword(s):

High Performance ◽

Image Watermark ◽

Grid Line ◽

Novel Method ◽

Performance Text

Download Full-text

Workload characterization and optimization of high-performance text indexing on the Cell Broadband Engine™ (Cell/B.E.)

2009 IEEE International Symposium on Workload Characterization (IISWC) ◽

10.1109/iiswc.2009.5306798 ◽

2009 ◽

Cited By ~ 3

Author(s):

Daniele Paolo Scarpazza ◽

Gordon W. Braudaway

Keyword(s):

High Performance ◽

Workload Characterization ◽

Text Indexing ◽

Cell Broadband Engine ◽

Performance Text

Download Full-text

A High Performance Text Vector Similarity Search Method Based on Overlapping Degree

2019 International Conference on Data Mining Workshops (ICDMW) ◽

10.1109/icdmw.2019.00071 ◽

2019 ◽

Author(s):

Peng Zhao ◽

Fan Yang ◽

Zhibin Zhang ◽

Jiafeng Guo ◽

Xueqi Cheng

Keyword(s):

Similarity Search ◽

High Performance ◽

Search Method ◽

Performance Text

Download Full-text

High Performance Text Categorization System Based on a Novel Neural Network Algorithm

The Sixth IEEE International Conference on Computer and Information Technology (CIT'06) ◽

10.1109/cit.2006.98 ◽

2006 ◽

Cited By ~ 1

Author(s):

Cheng Li ◽

Soon Park

Keyword(s):

Neural Network ◽

High Performance ◽

Text Categorization ◽

Network Algorithm ◽

Neural Network Algorithm ◽

Categorization System ◽

Performance Text

Download Full-text