Smoothed analysis for tensor methods in unsupervised learning

Unsupervised learning in machine learning divides data into several groups. The observations in the same group have similar characteristics and the observations in the different groups have the different characteristics. In the paper, we classify data by partitioning around medoids which have some advantages over the k-means clustering. We apply it to baseball players in Korea Baseball League. We also apply the principal component analysis to data and draw the graph using two components for axis. We interpret the meaning of the clustering graphically through the procedure. The combination of the partitioning around medoids and the principal component analysis can be used to any other data and the approach makes us to figure out the characteristics easily.

Download Full-text

Preface to the special issue, CMAM 2011, no. 3.

Computational Methods in Applied Mathematics ◽

10.2478/cmam-2011-0014 ◽

2011 ◽

Vol 11 (3) ◽

pp. 272

Author(s):

Ivan Gavrilyuk ◽

Boris Khoromskij ◽

Eugene Tyrtyshnikov

Keyword(s):

Separation Of Variables ◽

Numerical Algorithms ◽

Multilevel Methods ◽

High Dimensional ◽

Special Issue ◽

High Dimensions ◽

Guiding Principle ◽

Tensor Methods ◽

Closed World ◽

The One

Abstract In the recent years, multidimensional numerical simulations with tensor-structured data formats have been recognized as the basic concept for breaking the "curse of dimensionality". Modern applications of tensor methods include the challenging high-dimensional problems of material sciences, bio-science, stochastic modeling, signal processing, machine learning, and data mining, financial mathematics, etc. The guiding principle of the tensor methods is an approximation of multivariate functions and operators with some separation of variables to keep the computational process in a low parametric tensor-structured manifold. Tensors structures had been wildly used as models of data and discussed in the contexts of differential geometry, mechanics, algebraic geometry, data analysis etc. before tensor methods recently have penetrated into numerical computations. On the one hand, the existing tensor representation formats remained to be of a limited use in many high-dimensional problems because of lack of sufficiently reliable and fast software. On the other hand, for moderate dimensional problems (e.g. in "ab-initio" quantum chemistry) as well as for selected model problems of very high dimensions, the application of traditional canonical and Tucker formats in combination with the ideas of multilevel methods has led to the new efficient algorithms. The recent progress in tensor numerical methods is achieved with new representation formats now known as "tensor-train representations" and "hierarchical Tucker representations". Note that the formats themselves could have been picked up earlier in the literature on the modeling of quantum systems. Until 2009 they lived in a closed world of those quantum theory publications and never trespassed the territory of numerical analysis. The tremendous progress during the very recent years shows the new tensor tools in various applications and in the development of these tools and study of their approximation and algebraic properties. This special issue treats tensors as a base for efficient numerical algorithms in various modern applications and with special emphases on the new representation formats.

Download Full-text

An Unsupervised Learning Algorithm to Compute Fluid Volumes From NMR T1-T2 Logs in Unconventional Reservoirs

Petrophysics – The SPWLA Journal of Formation Evaluation and Reservoir Description ◽

10.30632/pjv59n5-2018a4 ◽

2018 ◽

Vol 59 ◽

pp. 617-632 ◽

Cited By ~ 3

Author(s):

Lalitha Venkataramanan ◽

◽

Noyan Evirgen ◽

David F. Allen ◽

Albina Mutina ◽

...

Keyword(s):

Unsupervised Learning ◽

Learning Algorithm ◽

Unconventional Reservoirs

Download Full-text

Klasifikasi pada Tempat Tinggal Menurut Provinsi dan Jenis Kepemilikan Berdasarkan Algoritma K-Means

STRING (Satuan Tulisan Riset dan Inovasi Teknologi) ◽

10.30998/string.v4i3.5932 ◽

2020 ◽

Vol 4 (3) ◽

pp. 247

Author(s):

Dwi Swasono Rachmad

Keyword(s):

Data Mining ◽

Unsupervised Learning ◽

Residential Buildings ◽

Government Agency ◽

Role Of Government ◽

The Republic ◽

Household Processing ◽

Central Statistics

Housing is derived from the word house which means a place that has a place to live which will stay or stop in a certain time. Housing is a residence that has been grouped into a place that has facilities and infrastructure. The problem in this study focuses on the type of residential ownership in the form of SHM ART, SHM Non ART, NON SHM and others. These four types can be used to know the percentage of ownership in all provinces in Indonesia. Due to the fact that there is still a lot of information about the type of certificate ownership, there is still not much ownership. Therefore, the use of the k-Means algorithm as a data mining concept in the form of clusters, where the data already has parameters or values that fall into the category of unsupervised learning. That data produced the best. The data was obtained from published sources of the Republic of Indonesia government agency, namely the Central Statistics Agency data with the category of household processing with self-owned residential buildings purchased from developers or non-developers by province and type of ownership in 2016 throughout Indonesia. In conducting the dataset, researchers used the RapidMiner application as a clustering process application. This research shows that there are more types of ownership in the SHM ART, but for other values it is still smaller than the value in other types of ownership which is the second largest value. So, in this case, the role of government in providing assistance in the process of ownership in order to become SHM ART is very important.

Download Full-text