High-Dimensional Brain in a High-Dimensional World: Blessing of Dimensionality

High-dimensional data and high-dimensional representations of reality are inherent features of modern Artificial Intelligence systems and applications of machine learning. The well-known phenomenon of the “curse of dimensionality” states: many problems become exponentially difficult in high dimensions. Recently, the other side of the coin, the “blessing of dimensionality”, has attracted much attention. It turns out that generic high-dimensional datasets exhibit fairly simple geometric properties. Thus, there is a fundamental tradeoff between complexity and simplicity in high dimensional spaces. Here we present a brief explanatory review of recent ideas, results and hypotheses about the blessing of dimensionality and related simplifying effects relevant to machine learning and neuroscience.

Download Full-text

Outlier Detection in High Dimensional Data

International Journal of Engineering and Advanced Technology - Regular Issue ◽

10.35940/ijeat.e2675.0610521 ◽

2021 ◽

Vol 10 (5) ◽

pp. 128-130

Author(s):

Anusha L. ◽

Nagaraja G S

Keyword(s):

Artificial Intelligence ◽

Machine Learning ◽

Neural Networks ◽

Decision Making ◽

Outlier Detection ◽

Text Processing ◽

High Dimensional Data ◽

High Dimensional ◽

Human Intelligence ◽

Over Time

Artificial intelligence (AI) is the science that allows computers to replicate human intelligence in areas such as decision-making, text processing, visual perception. Artificial Intelligence is the broader field that contains several subfields such as machine learning, robotics, and computer vision. Machine Learning is a branch of Artificial Intelligence that allows a machine to learn and improve at a task over time. Deep Learning is a subset of machine learning that makes use of deep artificial neural networks for training. The paper proposed on outlier detection for multivariate high dimensional data for Autoencoder unsupervised model.

Download Full-text

Data Quality Measures and Efficient Evaluation Algorithms for Large-Scale High-Dimensional Data

Applied Sciences ◽

10.3390/app11020472 ◽

2021 ◽

Vol 11 (2) ◽

pp. 472

Author(s):

Hyeongmin Cho ◽

Sangkyun Lee

Keyword(s):

Machine Learning ◽

Data Quality ◽

Large Scale ◽

High Dimensional Data ◽

Quality Measures ◽

Training Data ◽

Measure Data ◽

High Dimensional ◽

Small Scale ◽

Class Separability

Machine learning has been proven to be effective in various application areas, such as object and speech recognition on mobile systems. Since a critical key to machine learning success is the availability of large training data, many datasets are being disclosed and published online. From a data consumer or manager point of view, measuring data quality is an important first step in the learning process. We need to determine which datasets to use, update, and maintain. However, not many practical ways to measure data quality are available today, especially when it comes to large-scale high-dimensional data, such as images and videos. This paper proposes two data quality measures that can compute class separability and in-class variability, the two important aspects of data quality, for a given dataset. Classical data quality measures tend to focus only on class separability; however, we suggest that in-class variability is another important data quality factor. We provide efficient algorithms to compute our quality measures based on random projections and bootstrapping with statistical benefits on large-scale high-dimensional data. In experiments, we show that our measures are compatible with classical measures on small-scale data and can be computed much more efficiently on large-scale high-dimensional datasets.

Download Full-text

Study on a prediction of P2P network loan default based on the machine learning LightGBM and XGboost algorithms according to different high dimensional data cleaning

Electronic Commerce Research and Applications ◽

10.1016/j.elerap.2018.08.002 ◽

2018 ◽

Vol 31 ◽

pp. 24-39 ◽

Cited By ~ 48

Author(s):

Xiaojun Ma ◽

Jinglan Sha ◽

Dehua Wang ◽

Yuanbo Yu ◽

Qian Yang ◽

...

Keyword(s):

Machine Learning ◽

Data Cleaning ◽

High Dimensional Data ◽

P2p Network ◽

High Dimensional ◽

Loan Default

Download Full-text

Scalable hierarchical clustering by composition rank vector encoding and tree structure

10.1101/2020.04.12.038026 ◽

2020 ◽

Author(s):

Xiao Lai ◽

Pu Tian

Keyword(s):

Machine Learning ◽

Hierarchical Clustering ◽

Clustering Algorithm ◽

High Dimensional Data ◽

Machine Learning Algorithms ◽

Tree Structure ◽

Supervised Machine Learning ◽

High Dimensional ◽

Rank Vector ◽

Nonlinear Correlations

AbstractSupervised machine learning, especially deep learning based on a wide variety of neural network architectures, have contributed tremendously to fields such as marketing, computer vision and natural language processing. However, development of un-supervised machine learning algorithms has been a bottleneck of artificial intelligence. Clustering is a fundamental unsupervised task in many different subjects. Unfortunately, no present algorithm is satisfactory for clustering of high dimensional data with strong nonlinear correlations. In this work, we propose a simple and highly efficient hierarchical clustering algorithm based on encoding by composition rank vectors and tree structure, and demonstrate its utility with clustering of protein structural domains. No record comparison, which is an expensive and essential common step to all present clustering algorithms, is involved. Consequently, it achieves linear time and space computational complexity hierarchical clustering, thus applicable to arbitrarily large datasets. The key factor in this algorithm is definition of composition, which is dependent upon physical nature of target data and therefore need to be constructed case by case. Nonetheless, the algorithm is general and applicable to any high dimensional data with strong nonlinear correlations. We hope this algorithm to inspire a rich research field of encoding based clustering well beyond composition rank vector trees.

Download Full-text

Overview of Smart Aquaculture System: Focusing on Applications of Machine Learning and Computer Vision

Electronics ◽

10.3390/electronics10222882 ◽

2021 ◽

Vol 10 (22) ◽

pp. 2882

Author(s):

Thi Thu Em Vo ◽

Hyeyoung Ko ◽

Jun-Ho Huh ◽

Yonghoon Kim

Keyword(s):

Artificial Intelligence ◽

Machine Learning ◽

Sustainable Development ◽

Species Classification ◽

The Sustainable Development ◽

Aquaculture System ◽

Aquaculture Industry ◽

Intelligent Technology ◽

Applications Of Machine Learning ◽

Aquaculture Production

Smart aquaculture is nowadays one of the sustainable development trends for the aquaculture industry in intelligence and automation. Modern intelligent technologies have brought huge benefits to many fields including aquaculture to reduce labor, enhance aquaculture production, and be friendly to the environment. Machine learning is a subdivision of artificial intelligence (AI) by using trained algorithm models to recognize and learn traits from the data it watches. To date, there are several studies about applications of machine learning for smart aquaculture including measuring size, weight, grading, disease detection, and species classification. This review provides and overview of the development of smart aquaculture and intelligent technology. We summarized and collected 100 articles about machine learning in smart aquaculture from nearly 10 years about the methodology, results as well as the recent technology that should be used for development of smart aquaculture. We hope that this review will give readers interested in this field useful information.

Download Full-text

Modelling of Traffic Flows and Supply Chains Based on Geospatial Knowledge

Journal of Physics Conference Series ◽

10.1088/1742-6596/2068/1/012042 ◽

2021 ◽

Vol 2068 (1) ◽

pp. 012042

Author(s):

A Kolesnikov ◽

P Kikin ◽

E Panidi

Keyword(s):

Artificial Intelligence ◽

Machine Learning ◽

Supply Chains ◽

Inventory Management ◽

Traffic Flows ◽

Machine Learning Methods ◽

Customer Interaction ◽

Long Term Impact ◽

Artificial Intelligence Systems

Abstract The field of logistics and transport operates with large amounts of data. The transformation of such arrays into knowledge and processing using machine learning methods will help to find additional reserves for optimizing transport and logistics processes and supply chains. This article analyses the possibilities and prospects for the application of machine learning and geospatial knowledge in the field of logistics and transport using specific examples. The long-term impact of geospatial-based artificial intelligence systems on such processes as procurement, delivery, inventory management, maintenance, customer interaction is considered.

Download Full-text

Machine Learning and High-Dimensional Data Analysis

Principles of Clinical Cancer Research ◽

10.1891/9781617052392.0017 ◽

2018 ◽

Author(s):

Sanjay Aneja ◽

James B. Yu

Keyword(s):

Machine Learning ◽

Data Analysis ◽

High Dimensional Data ◽

High Dimensional ◽

High Dimensional Data Analysis

Download Full-text

Introduction to Machine Learning and Its Application

Applications of Artificial Intelligence in Electrical Engineering - Advances in Computational Intelligence and Robotics ◽

10.4018/978-1-7998-2718-4.ch014 ◽

2020 ◽

pp. 262-290

Author(s):

Ladly Patel ◽

Kumar Abhishek Gaurav

Keyword(s):

Artificial Intelligence ◽

Machine Learning ◽

Learning Algorithm ◽

Machine Learning Algorithms ◽

Human Interaction ◽

Machine Learning Algorithm ◽

Past Data ◽

Applications Of Machine Learning ◽

Learning Frameworks ◽

Learning Data

In today's world, a huge amount of data is available. So, all the available data are analyzed to get information, and later this data is used to train the machine learning algorithm. Machine learning is a subpart of artificial intelligence where machines are given training with data and the machine predicts the results. Machine learning is being used in healthcare, image processing, marketing, etc. The aim of machine learning is to reduce the work of the programmer by doing complex coding and decreasing human interaction with systems. The machine learns itself from past data and then predict the desired output. This chapter describes machine learning in brief with different machine learning algorithms with examples and about machine learning frameworks such as tensor flow and Keras. The limitations of machine learning and various applications of machine learning are discussed. This chapter also describes how to identify features in machine learning data.

Download Full-text

Role of Artificial Intelligence in Modeling Psychometrics to Detect Neurodevelopmental Disorders

Interdisciplinary Approaches to Altering Neurodevelopmental Disorders - Advances in Medical Diagnosis, Treatment, and Care ◽

10.4018/978-1-7998-3069-6.ch013 ◽

2020 ◽

pp. 213-227

Author(s):

Navjot Singh ◽

Amarjot Kaur

Keyword(s):

Artificial Intelligence ◽

Machine Learning ◽

Neurodevelopmental Disorders ◽

Machine Learning Techniques ◽

Diagnosis And Prognosis ◽

Processing Power ◽

Learning Techniques ◽

Numerous Data ◽

Applications Of Machine Learning ◽

Micro Patterns

The objective of the present chapter is to highlight applications of machine learning and artificial intelligence (AI) in clinical diagnosis of neurodevelopmental disorders. The proposed approach aims at recognizing behavioral traits and other cognitive aspects. The availability of numerous data and high processing power, such as graphic processing units (GPUs) or cloud computing, enabled the study of micro-patterns hundreds of times faster compared to manual analysis. AI, being a new technological breakthrough, enables study of human behavior patterns, which are hidden in millions of micro-patterns originating from human actions, reactions, and gestures. The chapter will also focus on the challenges in existing machine learning techniques and the best possible solution addressing those problems. In the future, more AI-based expert systems can enhance the accuracy of the diagnosis and prognosis process.

Download Full-text