Machine Learning Applications in Computer Vision

2012 ◽

pp. 99-132 ◽

Cited By ~ 1

Author(s):

Mehrtash Harandi ◽

Javid Taheri ◽

Brian C. Lovell

Keyword(s):

Machine Learning ◽

Computer Vision ◽

Object Recognition ◽

Visual Recognition ◽

Sequential Decision ◽

Biologically Inspired ◽

Transductive Learning ◽

Machine Learning Applications ◽

Appearance Models ◽

Learning Frameworks

Recognizing objects based on their appearance (visual recognition) is one of the most significant abilities of many living creatures. In this study, recent advances in the area of automated object recognition are reviewed; the authors specifically look into several learning frameworks to discuss how they can be utilized in solving object recognition paradigms. This includes reinforcement learning, a biologically-inspired machine learning technique to solve sequential decision problems and transductive learning, and a framework where the learner observes query data and potentially exploits its structure for classification. The authors also discuss local and global appearance models for object recognition, as well as how similarities between objects can be learnt and evaluated.

Download Full-text

Visual Behavior Based Bio-Inspired Polarization Techniques in Computer Vision and Robotics

Developing and Applying Biologically-Inspired Vision Systems ◽

10.4018/978-1-4666-2539-6.ch011 ◽

2012 ◽

pp. 243-272 ◽

Cited By ~ 2

Author(s):

Abd El Rahman Shabayek ◽

Olivier Morel ◽

David Fofi

Keyword(s):

Computer Vision ◽

Object Recognition ◽

Visual Perception ◽

Polarization Vision ◽

Visual Behavior ◽

Biologically Inspired ◽

Long Time ◽

Comprehensive Survey ◽

High Level ◽

And Robotics

For long time, it was thought that the sensing of polarization by animals is invariably related to their behavior, such as navigation and orientation. Recently, it was found that polarization can be part of a high-level visual perception, permitting a wide area of vision applications. Polarization vision can be used for most tasks of color vision including object recognition, contrast enhancement, camouflage breaking, and signal detection and discrimination. The polarization based visual behavior found in the animal kingdom is briefly covered. Then, the authors go in depth with the bio-inspired applications based on polarization in computer vision and robotics. The aim is to have a comprehensive survey highlighting the key principles of polarization based techniques and how they are biologically inspired.

Download Full-text

VGM-Bench: FPU Benchmark Suite for Computer Vision, Computer Graphics and Machine Learning Applications

Lecture Notes in Computer Science - Embedded Computer Systems: Architectures, Modeling, and Simulation ◽

10.1007/978-3-030-60939-9_23 ◽

2020 ◽

pp. 323-335

Author(s):

Luca Cremona ◽

William Fornaciari ◽

Andrea Galimberti ◽

Andrea Romanoni ◽

Davide Zoni

Keyword(s):

Machine Learning ◽

Computer Vision ◽

Computer Graphics ◽

Machine Learning Applications ◽

Benchmark Suite

Download Full-text

Benchmark and Survey of Automated Machine Learning Frameworks

Journal of Artificial Intelligence Research ◽

10.1613/jair.1.11854 ◽

2021 ◽

Vol 70 ◽

pp. 409-472

Author(s):

Marc-André Zöller ◽

Marco F. Huber

Keyword(s):

Machine Learning ◽

Daily Life ◽

Real Data ◽

Data Sets ◽

Domain Experts ◽

Vital Part ◽

Machine Learning Applications ◽

Automated Machine Learning ◽

Learning Frameworks

Machine learning (ML) has become a vital part in many aspects of our daily life. However, building well performing machine learning applications requires highly specialized data scientists and domain experts. Automated machine learning (AutoML) aims to reduce the demand for data scientists by enabling domain experts to build machine learning applications automatically without extensive knowledge of statistics and machine learning. This paper is a combination of a survey on current AutoML methods and a benchmark of popular AutoML frameworks on real data sets. Driven by the selected frameworks for evaluation, we summarize and review important AutoML techniques and methods concerning every step in building an ML pipeline. The selected AutoML frameworks are evaluated on 137 data sets from established AutoML benchmark suites.

Download Full-text

Optimization Techniques for Mining Power Quality Data and Processing Unbalanced Datasets in Machine Learning Applications

Energies ◽

10.3390/en14020463 ◽

2021 ◽

Vol 14 (2) ◽

pp. 463

Author(s):

Alvaro Furlani Bastos ◽

Surya Santoso

Keyword(s):

Machine Learning ◽

Power Systems ◽

Power Quality ◽

Optimization Techniques ◽

Machine Learning Algorithms ◽

Quality Data ◽

Successful Performance ◽

Data Mining Approach ◽

Machine Learning Applications ◽

Learning Frameworks

In recent years, machine learning applications have received increasing interest from power system researchers. The successful performance of these applications is dependent on the availability of extensive and diverse datasets for the training and validation of machine learning frameworks. However, power systems operate at quasi-steady-state conditions for most of the time, and the measurements corresponding to these states provide limited novel knowledge for the development of machine learning applications. In this paper, a data mining approach based on optimization techniques is proposed for filtering root-mean-square (RMS) voltage profiles and identifying unusual measurements within triggerless power quality datasets. Then, datasets with equal representation between event and non-event observations are created so that machine learning algorithms can extract useful insights from the rare but important event observations. The proposed framework is demonstrated and validated with both synthetic signals and field data measurements.

Download Full-text

Machine Learning Computer Vision Applications for Spatial AI Object Recognition in Orange County, California

10.36227/techrxiv.15157689.v1 ◽

2021 ◽

Author(s):

Kostas Alexandridis

Keyword(s):

Machine Learning ◽

Computer Vision ◽

Object Recognition ◽

Spatial Data ◽

Field Data ◽

Orange County ◽

Data Systems ◽

Neural Network Learning ◽

Stop Sign ◽

Efficiency And Effectiveness

We provide an integrated and systematic automation approach to spatial object recognition and positional detection using AI machine learning and computer vision algorithms for Orange County, California. We describe a comprehensive methodology for multi-sensor, high-resolution field data acquisition, along with post-field processing and pre-analysis processing tasks. We developed a series of algorithmic formulations and workflows that integrate convolutional deep neural network learning with detected object positioning estimation in 360\textdegree~equirectancular photosphere imagery. We provide examples of application processing more than 800 thousand cardinal directions in photosphere images across two areas in Orange County, and present detection results for stop-sign and fire hydrant object recognition. We discuss the efficiency and effectiveness of our approach, along with broader inferences related to the performance and implications of this approach for future technological innovations, including automation of spatial data and public asset inventories, and near real-time AI field data systems.

Download Full-text

Machine Learning Computer Vision Applications for Spatial AI Object Recognition in Orange County, California

10.36227/techrxiv.15157689 ◽

2021 ◽

Author(s):

Kostas Alexandridis

Keyword(s):

Machine Learning ◽

Computer Vision ◽

Object Recognition ◽

Spatial Data ◽

Field Data ◽

Orange County ◽

Data Systems ◽

Neural Network Learning ◽

Stop Sign ◽

Efficiency And Effectiveness

We provide an integrated and systematic automation approach to spatial object recognition and positional detection using AI machine learning and computer vision algorithms for Orange County, California. We describe a comprehensive methodology for multi-sensor, high-resolution field data acquisition, along with post-field processing and pre-analysis processing tasks. We developed a series of algorithmic formulations and workflows that integrate convolutional deep neural network learning with detected object positioning estimation in 360\textdegree~equirectancular photosphere imagery. We provide examples of application processing more than 800 thousand cardinal directions in photosphere images across two areas in Orange County, and present detection results for stop-sign and fire hydrant object recognition. We discuss the efficiency and effectiveness of our approach, along with broader inferences related to the performance and implications of this approach for future technological innovations, including automation of spatial data and public asset inventories, and near real-time AI field data systems.

Download Full-text

Identification of Baikal phytoplankton inferred from computer vision methods and machine learning

Limnology and Freshwater Biology ◽

10.31951/2658-3518-2021-a-3-1143 ◽

2021 ◽

pp. 1143-1146

Author(s):

A.V. Lysenko ◽

◽

M.S. Oznobikhin ◽

E.A. Kireev ◽

...

Keyword(s):

Neural Network ◽

Machine Learning ◽

Neural Networks ◽

Computer Vision ◽

Object Recognition ◽

Light Microscope ◽

Optimal Size ◽

Optimal Parameters ◽

The Neural Network

Abstract. This study discusses the problem of phytoplankton classification using computer vision methods and convolutional neural networks. We created a system for automatic object recognition consisting of two parts: analysis and primary processing of phytoplankton images and development of the neural network based on the obtained information about the images. We developed software that can detect particular objects in images from a light microscope. We trained a convolutional neural network in transfer learning and determined optimal parameters of this neural network and the optimal size of using dataset. To increase accuracy for these groups of classes, we created three neural networks with the same structure. The obtained accuracy in the classification of Baikal phytoplankton by these neural networks was up to 80%.

Download Full-text

Linguistic Indexing of Images with Database Mediation

Encyclopedia of Information Science and Technology, Second Edition ◽

10.4018/978-1-60566-026-4.ch385 ◽

2011 ◽

pp. 2420-2425

Author(s):

Emmanuel Udoh

Keyword(s):

Machine Learning ◽

Computer Vision ◽

Object Recognition ◽

Image Retrieval ◽

Statistical Modeling ◽

Digital Images ◽

Statistical Error ◽

Large Databases ◽

Visual Aid ◽

Active Research

Computer vision or object recognition complements human or biological vision using techniques from machine learning, statistics, scene reconstruction, indexing and event analysis. Object recognition is an active research area that implements artificial vision in software and hardware. Some application examples are autonomous robots, surveillance, indexing databases of pictures and human computer interaction. This visual aid is beneficial to users, because humans remember information with greater accuracy when it is presented visually than when it originates in writing, speech or in kinesthetic form. Linguistic indexing adds another dimension to computer vision by automatically assigning words or textual descriptions to images. This augments content-based image retrieval (CBIR) that extracts or searches for digital images in large databases. According to Li and Wang (2003), most of the existing CBIR projects are general-purpose image retrieval systems that search images visually similar to a query sketch. Current CBIR systems are incapable of assigning words automatically to images due to the inherent difficulty of recognizing numerous objects at once. This current situation is stimulating several research endeavors that seek to assign text to images, thereby improving image retrieval in large databases. To enhance information processing using object recognition techniques, current research has focused on automatic linguistic indexing of digital images (ALIDI). ALIDI requires a combination of mathematical, statistical, computational, and graphical backgrounds. Many researchers have focused on various aspects of linguistic processing such as CBIR (Ghosal, Ircing, & Khudanpur, 2005; Iqbal & Aggarwal, 2002, Wang, 2001) machine learning techniques (Iqbal & Aggarwal, 2002), digital library (Witen & Bainbridge, 2003) and statistical modeling (Li, Gray, & Olsen, 20004, Li & Wang, 2003). A growing approach is the utilization of statistical models as demonstrated by Li and Wang (2003). It entails building databases of images to be used for supervised learning. A trained system is used to recognize and identify new images with statistical error margin. This statistical modeling approach uses a hidden Markov model to extract representative information about any category of images analyzed. However, in using computer to recognize images with textual description, some of the researchers employ solely text-based approaches. In this article, the focus is on the computational and graphical aspects of ALIDI in a system that uses Web-based access in order to enable wider usage (Ntoulas, Chao, & Cho, 2005). This system uses image composition (primary hue and saturation) in the linguistic indexing of digital images or pictures.

Download Full-text

Machine Learning Computer Vision Applications for Spatial AI Object Recognition in Orange County, California

10.36227/techrxiv.15157689.v2 ◽

2021 ◽

Author(s):

Kostas Alexandridis

Keyword(s):

Machine Learning ◽

Computer Vision ◽

Object Recognition ◽

Spatial Data ◽

Field Data ◽

Orange County ◽

Data Systems ◽

Neural Network Learning ◽

Stop Sign ◽

Efficiency And Effectiveness

We provide an integrated and systematic automation approach to spatial object recognition and positional detection using AI machine learning and computer vision algorithms for Orange County, California. We describe a comprehensive methodology for multi-sensor, high-resolution field data acquisition, along with post-field processing and pre-analysis processing tasks. We developed a series of algorithmic formulations and workflows that integrate convolutional deep neural network learning with detected object positioning estimation in 360 degree equirectancular photosphere imagery. We provide examples of application processing more than 800 thousand cardinal directions in photosphere images across two areas in Orange County, and present detection results for stop-sign and fire hydrant object recognition. We discuss the efficiency and effectiveness of our approach, along with broader inferences related to the performance and implications of this approach for future technological innovations, including automation of spatial data and public asset inventories, and near real-time AI field data systems.

Download Full-text