The Big Data Mining Approach for Finding top rated URL

Abstract Finding out the widely used URL’s from online shopping sites for any particular category is a difficult task as there are many heterogeneous and multi-dimensional data set which depends on various factors. Traditional data mining methods are limited to homogenous data source, so they fail to sufficiently consider the characteristics of heterogeneous data. This paper presents a consistent Big Data mining search which performs analytics on text data to find the top rated URL’s. Though many heuristic search methods are available, our proposed method solves the problem of searching compared with traditional methods in data mining. The sample results are obtained in optimal time and are compared with other methods which is effective and efficient.

Download Full-text

Analyze the energy consumption characteristics and affecting factors of Taiwan's convenience stores-using the big data mining approach

Energy and Buildings ◽

10.1016/j.enbuild.2018.03.021 ◽

2018 ◽

Vol 168 ◽

pp. 120-136 ◽

Cited By ~ 8

Author(s):

Chung-Feng Jeffrey Kuo ◽

Chieh-Hung Lin ◽

Ming-Hao Lee

Keyword(s):

Data Mining ◽

Big Data ◽

Energy Consumption ◽

Affecting Factors ◽

Big Data Mining ◽

Data Mining Approach ◽

Convenience Stores

Download Full-text

On the power of big data: Mining structures from massive, unstructured text data

2016 IEEE International Conference on Big Data (Big Data) ◽

10.1109/bigdata.2016.7840582 ◽

2016 ◽

Cited By ~ 2

Author(s):

Jiawei Han

Keyword(s):

Data Mining ◽

Big Data ◽

Text Data ◽

Big Data Mining ◽

Unstructured Text

Download Full-text

Parameterization of LSB in Self-Recovery Speech Watermarking Framework in Big Data Mining

Security and Communication Networks ◽

10.1155/2017/3847092 ◽

2017 ◽

Vol 2017 ◽

pp. 1-12 ◽

Cited By ~ 2

Author(s):

Shuo Li ◽

Zhanjie Song ◽

Wenhuan Lu ◽

Daniel Sun ◽

Jianguo Wei

Keyword(s):

Data Mining ◽

Big Data ◽

Least Significant Bit ◽

Speech Watermarking ◽

Big Data Mining ◽

Data Mining Approach ◽

Watermark Embedding ◽

Data Infrastructures ◽

Original Speech

The privacy is a major concern in big data mining approach. In this paper, we propose a novel self-recovery speech watermarking framework with consideration of trustable communication in big data mining. In the framework, the watermark is the compressed version of the original speech. The watermark is embedded into the least significant bit (LSB) layers. At the receiver end, the watermark is used to detect the tampered area and recover the tampered speech. To fit the complexity of the scenes in big data infrastructures, the LSB is treated as a parameter. This work discusses the relationship between LSB and other parameters in terms of explicit mathematical formulations. Once the LSB layer has been chosen, the best choices of other parameters are then deduced using the exclusive method. Additionally, we observed that six LSB layers are the limit for watermark embedding when the total bit layers equaled sixteen. Experimental results indicated that when the LSB layers changed from six to three, the imperceptibility of watermark increased, while the quality of the recovered signal decreased accordingly. This result was a trade-off and different LSB layers should be chosen according to different application conditions in big data infrastructures.

Download Full-text

A big data mining approach for environmental emissions prediction of die casting process

The International Journal of Advanced Manufacturing Technology ◽

10.1007/s00170-021-07125-z ◽

2021 ◽

Author(s):

Erheng Chen ◽

Huajun Cao ◽

Hongcheng Li ◽

Hao Yi ◽

Yanni Li

Keyword(s):

Data Mining ◽

Big Data ◽

Die Casting ◽

Casting Process ◽

Big Data Mining ◽

Data Mining Approach ◽

Environmental Emissions ◽

Die Casting Process

Download Full-text

An Empirical Perusal of Distance Measures for Clustering with Big Data Mining

International Journal of Engineering and Advanced Technology - Regular Issue ◽

10.35940/ijeat.f8078.088619 ◽

2019 ◽

Vol 8 (6) ◽

pp. 606-616 ◽

Cited By ~ 1

Keyword(s):

Data Mining ◽

Big Data ◽

Clustering Algorithm ◽

Distance Measure ◽

Confusion Matrix ◽

Heterogeneous Data ◽

Distance Measures ◽

Research Perspective ◽

Big Data Mining ◽

Data Criterion

The distance measure is the core idea of data mining techniques such as classification, clustering, and statistical analysis and so on. All clustering taxonomies such as partition, hierarchical, density, grid, model, fuzzy and graphs used to distance measures for the data point’s categorization under difference cluster, cluster construction and validation. Big data mining is the advanced concept of data mining respect to the big data dimensions. When traditional clustering algorithm is used under the big data mining the distance measure is needed for scalable under big data mining and support to a huge size dataset, heterogeneous data and sources, and velocity characteristics of the big data. From a theoretically, practically and the existing research perspective, the paper focuses on volume, variety, and velocity big data criterion for identifying a distance measure for the big data mining and recognize how to distance measure works under clustering taxonomy. This study also analyzed all distance measures accuracy with the help of a confusion matrix through clustering.

Download Full-text