Robotic Vision | ScienceGate

Artificial Visual Attention Using Combinatorial Pyramids

Robotic Vision ◽

10.4018/978-1-4666-2672-0.ch022 ◽

2013 ◽

pp. 437-455

Author(s):

E. Antúnez ◽

Y. Haxhimusa ◽

R. Marfil ◽

W. G. Kropatsch ◽

A. Bandera

Keyword(s):

Image Data ◽

Recognition Algorithm ◽

Top Down ◽

Bottom Up ◽

Geometrical Properties ◽

Image Descriptors ◽

Topological Features ◽

Two Factors ◽

Current Task ◽

Computer Vision Systems

Computer vision systems have to deal with thousands, sometimes millions of pixel values from each frame, and the computational complexity of many problems related to the interpretation of image data is very high. The task becomes especially difficult if a system has to operate in real-time. Within the Combinatorial Pyramid framework, the proposed computational model of attention integrates bottom-up and top-down factors for attention. Neurophysiologic studies have shown that, in humans, these two factors are the main responsible ones to drive attention. Bottom-up factors emanate from the scene and focus attention on regions whose features are sufficiently discriminative with respect to the features of their surroundings. On the other hand, top-down factors are derived from cognitive issues, such as knowledge about the current task. Specifically, the authors only consider in this model the knowledge of a given target to drive attention to specific regions of the image. With respect to previous approaches, their model takes into consideration not only geometrical properties and appearance information, but also internal topological layout. Once the focus of attention has been fixed to a region of the scene, the model evaluates if the focus is correctly located over the desired target. This recognition algorithm considers topological features provided by the pre-attentive stage. Thus, attention and recognition are tied together, sharing the same image descriptors.

Download Full-text

An Embedded Vision System for RoboCup

Robotic Vision ◽

10.4018/978-1-4666-2672-0.ch018 ◽

2013 ◽

pp. 333-351

Author(s):

P. Cavestany Olivares ◽

D. Herrero-Pérez ◽

J. J. Alcaraz Jiménez ◽

H. Martínez Barberá

Keyword(s):

Vision System ◽

Frame Rate ◽

Image Process ◽

Embedded Vision ◽

Embedded Vision System ◽

Robust Filters ◽

Field Lines

In this chapter, the authors describe their vision system used in the Standard Platform League (SPL), one of the official leagues in RoboCup competition. The characteristics of SPL are very demanding, as all the processing must be done on board, and the changeable environment requires powerful methods for extracting information and robust filters. The purpose is to show a vision system that meets these goals. The chapter describes the architecture of the authors’ system as well as the flowchart of the image process, which is designed in such a manner that allows a rapid and reliable calibration. The authors deal with field features detection by finding intersections between field lines at frame rate, using a fuzzy-Markov localisation technique. Also, the methods implemented to recognise the ball and goals are explained.

Download Full-text

Computer Vision for Learning to Interact Socially with Humans

Robotic Vision ◽

10.4018/978-1-4666-2672-0.ch014 ◽

2013 ◽

pp. 231-256

Author(s):

Renato Ramos da Silva ◽

Roseli Aparecida Francelin Romero

Keyword(s):

Computer Vision ◽

Human Behavior ◽

Robotic System ◽

Social Robot ◽

Shared Attention ◽

Learning Mechanism ◽

Active Mechanism ◽

Robotic Architecture ◽

Social Robotic

Computer vision is essential to develop a social robotic system capable to interact with humans. It is responsible to extract and represent the information around the robot. Furthermore, a learning mechanism, to select correctly an action to be executed in the environment, pro-active mechanism, to engage in an interaction, and voice mechanism, are indispensable to develop a social robot. All these mechanisms together provide a robot emulate some human behavior, like shared attention. Then, this chapter presents a robotic architecture that is composed with such mechanisms to make possible interactions between a robotic head with a caregiver, through of the shared attention learning with identification of some objects.

Download Full-text

Computer Vision Applications of Self-Organizing Neural Networks

Robotic Vision ◽

10.4018/978-1-4666-2672-0.ch008 ◽

2013 ◽

pp. 129-138

Author(s):

José García-Rodríguez ◽

Juan Manuel García-Chamizo ◽

Sergio Orts-Escolano ◽

Vicente Morell-Gimenez ◽

José Antonio Serra-Pérez ◽

...

Keyword(s):

Real Time ◽

Network Models ◽

Suitable Model ◽

Surveillance Systems ◽

Neural Network Models ◽

Neural Gas ◽

3D Data ◽

Computer Vision Applications ◽

Recognition Systems ◽

Self Organizing

This chapter aims to address the ability of self-organizing neural network models to manage video and image processing in real-time. The Growing Neural Gas networks (GNG) with its attributes of growth, flexibility, rapid adaptation, and excellent quality representation of the input space makes it a suitable model for real time applications. A number of applications are presented, including: image compression, hand and medical image contours representation, surveillance systems, hand gesture recognition systems, and 3D data reconstruction.

Download Full-text

Learning Robot Vision for Assisted Living

Robotic Vision ◽

10.4018/978-1-4666-2672-0.ch015 ◽

2013 ◽

pp. 257-280

Author(s):

Wenjie Yan ◽

Elena Torta ◽

David van der Pol ◽

Nils Meins ◽

Cornelius Weber ◽

...

Keyword(s):

Assisted Living ◽

Input Data ◽

Robot Navigation ◽

Ambient Assisted Living ◽

Robot Vision ◽

Human Robot Interaction ◽

Test Case ◽

Robot Interaction ◽

Learning Abilities

This chapter presents an overview of a typical scenario of Ambient Assisted Living (AAL) in which a robot navigates to a person for conveying information. Indoor robot navigation is a challenging task due to the complexity of real-home environments and the need of online learning abilities to adjust for dynamic conditions. A comparison between systems with different sensor typologies shows that vision-based systems promise to provide good performance and a wide scope of usage at reasonable cost. Moreover, vision-based systems can perform different tasks simultaneously by applying different algorithms to the input data stream thus enhancing the flexibility of the system. The authors introduce the state of the art of several computer vision methods for realizing indoor robotic navigation to a person and human-robot interaction. A case study has been conducted in which a robot, which is part of an AAL system, navigates to a person and interacts with her. The authors evaluate this test case and give an outlook on the potential of learning robot vision in ambient homes.

Download Full-text

A Review of Registration Methods on Mobile Robots

Robotic Vision ◽

10.4018/978-1-4666-2672-0.ch009 ◽

2013 ◽

pp. 140-153

Author(s):

Vicente Morell-Gimenez ◽

Sergio Orts-Escolano ◽

José García-Rodríguez ◽

Miguel Cazorla ◽

Diego Viejo

Keyword(s):

Mobile Robots ◽

Fundamental Problem ◽

Three Dimensional ◽

Iterative Closest Point ◽

Data Sets ◽

Feature Detector ◽

3D Data ◽

Common Coordinate System ◽

Feature Based ◽

Rigid Motions

The task of registering three dimensional data sets with rigid motions is a fundamental problem in many areas as computer vision, medical images, mobile robotic, arising whenever two or more 3D data sets must be aligned in a common coordinate system. In this chapter, the authors review registration methods. Focusing on mobile robots area, this chapter reviews the main registration methods in the literature. A possible classification could be distance-based and feature-based methods. The distance based methods, from which the classical Iterative Closest Point (ICP) is the most representative, have a lot of variations which obtain better results in situations where noise, time, or accuracy conditions are present. Feature based methods try to reduce the great number or points given by the current sensors using a combination of feature detector and descriptor which can be used to compute the final transformation with a method like RANSAC or Genetic Algorithms.

Download Full-text

Real-Time Structure Estimation in Dynamic Scenes Using a Single Camera

Robotic Vision ◽

10.4018/978-1-4666-2672-0.ch011 ◽

2013 ◽

pp. 173-191

Author(s):

Ashwin P. Dani ◽

Zhen Kan ◽

Nic Fischer ◽

Warren E. Dixon

Keyword(s):

Moving Objects ◽

3D Structure ◽

Estimation Algorithm ◽

Unknown Input ◽

Single Camera ◽

Time Varying ◽

Moving Camera ◽

Structure Estimation ◽

Modeling Uncertainties ◽

Exponentially Stable

In this chapter, an online method is developed for estimating 3D structure (with proper scale) of moving objects seen by a moving camera. In contrast to traditionally developed batch solutions for this problem, a nonlinear unknown input observer strategy is used where the object’s velocity is considered as an unknown input to the perspective dynamical system. The estimator is exponentially stable, and hence, provides robustness against modeling uncertainties and measurement noise from the camera. The developed method provides first causal, observer based structure estimation algorithm for a moving camera viewing a moving object with unknown time-varying object velocities.

Download Full-text

A Robust Color Watershed Transformation and Image Segmentation Defined on RGB Spherical Coordinates

Robotic Vision ◽

10.4018/978-1-4666-2672-0.ch007 ◽

2013 ◽

pp. 112-128

Author(s):

Ramón Moreno ◽

Manuel Graña ◽

Kurosh Madani

Keyword(s):

Image Segmentation ◽

Color Space ◽

Color Constancy ◽

Spherical Coordinates ◽

Spherical Coordinate ◽

Nao Robot ◽

Rgb Color Space ◽

And Performance ◽

Definition Of ◽

Merging Strategy

The representation of the RGB color space points in spherical coordinates allows to retain the chromatic components of image pixel colors, pulling apart easily the intensity component. This representation allows the definition of a chromatic distance and a hybrid gradient with good properties of perceptual color constancy. In this chapter, the authors present a watershed based image segmentation method using this hybrid gradient. Oversegmentation is solved by applying a region merging strategy based on the chromatic distance defined on the spherical coordinate representation. The chapter shows the robustness and performance of the approach on well known test images and the Berkeley benchmarking image database and on images taken with a NAO robot.

Download Full-text

Comparative Analysis of Temporal Segmentation Methods of Video Sequences

Robotic Vision ◽

10.4018/978-1-4666-2672-0.ch003 ◽

2013 ◽

pp. 43-58

Author(s):

Marcelo Saval-Calvo ◽

Jorge Azorín-López ◽

Andrés Fuster-Guilló

Keyword(s):

Comparative Analysis ◽

Optical Flow ◽

Background Subtraction ◽

False Positive ◽

Image Sequences ◽

Video Sequences ◽

True Positive ◽

Temporal Segmentation ◽

Segmentation Methods ◽

Temporal Differencing

In this chapter, a comparative analysis of basic segmentation methods of video sequences and their combinations is carried out. Analysis of different algorithms is based on the efficiency (true positive and false positive rates) and temporal cost to provide regions in the scene. These are two of the most important requirements of the design to provide to the tracking with segmentation in an efficient and timely manner constrained to the application. Specifically, methods using temporal information as Background Subtraction, Temporal Differencing, Optical Flow, and the four combinations of them have been analyzed. Experimentation has been done using image sequences of CAVIAR project database. Efficiency results show that Background Subtraction achieves the best individual result whereas the combination of the three basic methods is the best result in general. However, combinations with Optical Flow should be considered depending of application, because its temporal cost is too high with respect to efficiency provided to the combination.

Download Full-text

Uniform Sampling of Rotations for Discrete and Continuous Learning of 2D Shape Models

Robotic Vision ◽

10.4018/978-1-4666-2672-0.ch002 ◽

2013 ◽

pp. 23-42 ◽

Cited By ~ 6

Author(s):

Xavier Perez-Sala ◽

Laura Igual ◽

Sergio Escalera ◽

Cecilio Angulo

Keyword(s):

Model Building ◽

Rotation Group ◽

Point Of View ◽

Procrustes Analysis ◽

Uniform Sampling ◽

3D Objects ◽

Shape Models ◽

Continuous Point ◽

Discrete Methods ◽

Non Uniform Sampling

Different methodologies of uniform sampling over the rotation group, SO(3), for building unbiased 2D shape models from 3D objects are introduced and reviewed in this chapter. State-of-the-art non uniform sampling approaches are discussed, and uniform sampling methods using Euler angles and quaternions are introduced. Moreover, since presented work is oriented to model building applications, it is not limited to general discrete methods to obtain uniform 3D rotations, but also from a continuous point of view in the case of Procrustes Analysis.

Download Full-text

Robotic Vision
Latest Publications

TOTAL DOCUMENTS

H-INDEX

Published By IGI Global

Artificial Visual Attention Using Combinatorial Pyramids

An Embedded Vision System for RoboCup

Computer Vision for Learning to Interact Socially with Humans

Computer Vision Applications of Self-Organizing Neural Networks

Learning Robot Vision for Assisted Living

A Review of Registration Methods on Mobile Robots

Real-Time Structure Estimation in Dynamic Scenes Using a Single Camera

A Robust Color Watershed Transformation and Image Segmentation Defined on RGB Spherical Coordinates

Comparative Analysis of Temporal Segmentation Methods of Video Sequences

Uniform Sampling of Rotations for Discrete and Continuous Learning of 2D Shape Models

Export Citation Format

Robotic VisionLatest Publications

TOTAL DOCUMENTS

H-INDEX

Published By IGI Global

Artificial Visual Attention Using Combinatorial Pyramids

An Embedded Vision System for RoboCup

Computer Vision for Learning to Interact Socially with Humans

Computer Vision Applications of Self-Organizing Neural Networks

Learning Robot Vision for Assisted Living

A Review of Registration Methods on Mobile Robots

Real-Time Structure Estimation in Dynamic Scenes Using a Single Camera

A Robust Color Watershed Transformation and Image Segmentation Defined on RGB Spherical Coordinates

Comparative Analysis of Temporal Segmentation Methods of Video Sequences

Uniform Sampling of Rotations for Discrete and Continuous Learning of 2D Shape Models

Robotic Vision
Latest Publications