ANALYSIS OF THE MEDICAL DATA STRUCTURE USING SELFORGANISING MAPS

International Journal of Computing ◽

10.47839/ijc.6.3.461 ◽

2014 ◽

pp. 124-136

Author(s):

O. V. Hodych ◽

Yu. V. Nikolsky ◽

V. V. Pasichnyk ◽

Yu.Scherbyna M. Scherbyna

Keyword(s):

Data Structure ◽

Structure Analysis ◽

High Dimensional Data ◽

Research Work ◽

Medical Data ◽

High Dimensional ◽

Research Results ◽

The Real ◽

Work Related ◽

Data Structure Analysis

In this article the authors discuss several approaches to high dimensional data structure analysis using Self- Organising Maps. The describe approaches utilise graphical images for the purpose of data structure interpretation. The evaluation of the discussed techniques has been performed using the real medical data from cardiology. The research, results of which are outlined in this paper, is a continuation of the earlier work related to the analysis of the same medical data. It is envisaged that results obtained in this and earlier research work will form a foundation for creation of a robust technology to be used for automation of diagnostic tasks in medicine.

Download Full-text

High-dimensional data structure analysis using Self-Organising Maps

10.1109/cadsm.2007.4297529 ◽

2007 ◽

Cited By ~ 2

Author(s):

Oles Hodych ◽

Iouri Nikolski ◽

Volodymyr Pasichnyk ◽

Yuri Shcherbyna

Keyword(s):

Data Structure ◽

Structure Analysis ◽

High Dimensional Data ◽

High Dimensional ◽

Data Structure Analysis

Download Full-text

Opportunities and Challenges of Feature Selection Methods for High Dimensional Data: A Review

Ingénierie des systèmes d information ◽

10.18280/isi.260107 ◽

2021 ◽

Vol 26 (1) ◽

pp. 67-77

Author(s):

Siva Sankari Subbiah ◽

Jayakumar Chinnappan

Keyword(s):

Feature Selection ◽

Big Data ◽

Large Scale ◽

High Dimensional Data ◽

Research Work ◽

Basic Feature ◽

High Dimensional ◽

Selection Methods ◽

Fast Development ◽

Improved Accuracy

Now a day, all the organizations collecting huge volume of data without knowing its usefulness. The fast development of Internet helps the organizations to capture data in many different formats through Internet of Things (IoT), social media and from other disparate sources. The dimension of the dataset increases day by day at an extraordinary rate resulting in large scale dataset with high dimensionality. The present paper reviews the opportunities and challenges of feature selection for processing the high dimensional data with reduced complexity and improved accuracy. In the modern big data world the feature selection has a significance in reducing the dimensionality and overfitting of the learning process. Many feature selection methods have been proposed by researchers for obtaining more relevant features especially from the big datasets that helps to provide accurate learning results without degradation in performance. This paper discusses the importance of feature selection, basic feature selection approaches, centralized and distributed big data processing using Hadoop and Spark, challenges of feature selection and provides the summary of the related research work done by various researchers. As a result, the big data analysis with the feature selection improves the accuracy of the learning.

Download Full-text

A Novel Density-based Technique for Outlier Detection of High Dimensional Data Utilizing Full Feature Space

Information Technology And Control ◽

10.5755/j01.itc.50.1.25588 ◽

2021 ◽

Vol 50 (1) ◽

pp. 138-152

Author(s):

Mujeeb Ur Rehman ◽

Dost Muhammad Khan

Keyword(s):

Data Mining ◽

Outlier Detection ◽

High Dimensional Data ◽

Research Work ◽

Feature Space ◽

High Dimensional ◽

Data Set ◽

Data Points ◽

Low Dimensional ◽

Intrinsic Feature

Recently, anomaly detection has acquired a realistic response from data mining scientists as a graph of its reputation has increased smoothly in various practical domains like product marketing, fraud detection, medical diagnosis, fault detection and so many other fields. High dimensional data subjected to outlier detection poses exceptional challenges for data mining experts and it is because of natural problems of the curse of dimensionality and resemblance of distant and adjoining points. Traditional algorithms and techniques were experimented on full feature space regarding outlier detection. Customary methodologies concentrate largely on low dimensional data and hence show ineffectiveness while discovering anomalies in a data set comprised of a high number of dimensions. It becomes a very difficult and tiresome job to dig out anomalies present in high dimensional data set when all subsets of projections need to be explored. All data points in high dimensional data behave like similar observations because of its intrinsic feature i.e., the distance between observations approaches to zero as the number of dimensions extends towards infinity. This research work proposes a novel technique that explores deviation among all data points and embeds its findings inside well established density-based techniques. This is a state of art technique as it gives a new breadth of research towards resolving inherent problems of high dimensional data where outliers reside within clusters having different densities. A high dimensional dataset from UCI Machine Learning Repository is chosen to test the proposed technique and then its results are compared with that of density-based techniques to evaluate its efficiency.

Download Full-text

Data Structure Analysis

Applied Statistics Using SPSS, STATISTICA, MATLAB and R ◽

10.1007/978-3-540-71972-4_8 ◽

2007 ◽

pp. 329-352 ◽

Cited By ~ 1

Author(s):

Joaquim P. Marques de Sá

Keyword(s):

Data Structure ◽

Structure Analysis ◽

Data Structure Analysis

Download Full-text

An intelligent agent for ERP's data structure analysis based on ANSI/ISA-95 standard

Computers in Industry ◽

10.1016/j.compind.2015.07.011 ◽

2015 ◽

Vol 73 ◽

pp. 39-50 ◽

Cited By ~ 5

Author(s):

Melina C. Vidoni ◽

Aldo R. Vecchietti

Keyword(s):

Data Structure ◽

Structure Analysis ◽

Intelligent Agent ◽

Data Structure Analysis

Download Full-text

Comment on "A Nonlinear Mapping for Data Structure Analysis

IEEE Transactions on Computers ◽

10.1109/tc.1972.5008933 ◽

1972 ◽

Vol C-21 (2) ◽

pp. 220-221 ◽

Cited By ~ 8

Author(s):

I. White

Keyword(s):

Data Structure ◽

Structure Analysis ◽

Nonlinear Mapping ◽

Data Structure Analysis

Download Full-text

Improving data structure operations for high dimensional data

2016 1st India International Conference on Information Processing (IICIP) ◽

10.1109/iicip.2016.7975327 ◽

2016 ◽

Author(s):

Bhoomika Panchal ◽

Vinit Kumar

Keyword(s):

Data Structure ◽

High Dimensional Data ◽

High Dimensional

Download Full-text

M-Denclue for Effective Data Clustering in High Dimensional Non-Linear Data

International Journal of Innovative Technology and Exploring Engineering - Special Issue ◽

10.35940/ijitee.a9109.119119 ◽

2019 ◽

Vol 9 (1) ◽

pp. 2925-2927

Keyword(s):

Clustering Algorithms ◽

High Dimensional Data ◽

Research Work ◽

Curse Of Dimensionality ◽

Distance Measures ◽

High Dimensional ◽

Clustering Methods ◽

Non Linear ◽

Low Dimensional ◽

Automatic Grouping

Clustering is a data mining task devoted to the automatic grouping of data based on mutual similarity. Clustering in high-dimensional spaces is a recurrent problem in many domains. It affects time complexity, space complexity, scalability and accuracy of clustering methods. Highdimensional non-linear datausually live in different low dimensional subspaces hidden in the original space. As high‐dimensional objects appear almost alike, new approaches for clustering are required. This research has focused on developing Mathematical models, techniques and clustering algorithms specifically for high‐dimensional data. The innocent growth in the fields of communication and technology, there is tremendous growth in high dimensional data spaces. As the variant of dimensions on high dimensional non-linear data increases, many clustering techniques begin to suffer from the curse of dimensionality, de-grading the quality of the results. In high dimensional non-linear data, the data becomes very sparse and distance measures become increasingly meaningless. The principal challenge for clustering high dimensional data is to overcome the “curse of dimensionality”. This research work concentrates on devising an enhanced algorithm for clustering high dimensional non-linear data.

Download Full-text

Canonical correlation analysis in high dimensions with structured regularization

Statistical Modelling ◽

10.1177/1471082x211041033 ◽

2021 ◽

pp. 1471082X2110410

Author(s):

Elena Tuzhilina ◽

Leonardo Tozzi ◽

Trevor Hastie

Keyword(s):

Data Structure ◽

Correlation Analysis ◽

Canonical Correlation Analysis ◽

Canonical Correlation ◽

Multivariate Data ◽

High Dimensional Data ◽

High Dimensional ◽

High Dimensions ◽

Structured Regularization ◽

Data Matrices

Canonical correlation analysis (CCA) is a technique for measuring the association between two multivariate data matrices. A regularized modification of canonical correlation analysis (RCCA) which imposes an [Formula: see text] penalty on the CCA coefficients is widely used in applications with high-dimensional data. One limitation of such regularization is that it ignores any data structure, treating all the features equally, which can be ill-suited for some applications. In this article we introduce several approaches to regularizing CCA that take the underlying data structure into account. In particular, the proposed group regularized canonical correlation analysis (GRCCA) is useful when the variables are correlated in groups. We illustrate some computational strategies to avoid excessive computations with regularized CCA in high dimensions. We demonstrate the application of these methods in our motivating application from neuroscience, as well as in a small simulation example.

Download Full-text

Exploration of Pyramids with Equal Edges to Overcome Misconception

Педагогически форум ◽

10.15547/pf.2021.015 ◽

2021 ◽

Vol 9 (3) ◽

pp. 13-19

Author(s):

Toni Chehlarova

Keyword(s):

Research Work ◽

Computer Models ◽

Research Results ◽

Work Related ◽

3D Printed

A misconception related to the use of a non-existent figure in a task is presented. An opportunity for research work related to this problem is presented. Computer models have been created to provide conditions for self-formulation of hypotheses related to the existence of an n-gonal pyramid with equal edges. The computer models are developed with dynamic GeoGebra software. Variants for manipulatives are indicated, an illustration is made with magnetic manipulatives. Ideas for using research results to create 3D printed models are described.

Download Full-text