Terahertz optical machine learning for object recognition

Recognizing objects based on their appearance (visual recognition) is one of the most significant abilities of many living creatures. In this study, recent advances in the area of automated object recognition are reviewed; the authors specifically look into several learning frameworks to discuss how they can be utilized in solving object recognition paradigms. This includes reinforcement learning, a biologically-inspired machine learning technique to solve sequential decision problems and transductive learning, and a framework where the learner observes query data and potentially exploits its structure for classification. The authors also discuss local and global appearance models for object recognition, as well as how similarities between objects can be learnt and evaluated.

Download Full-text

Object Recognition with Machine Learning: Case Study of Demand-Responsive Service

2019 IEEE International Conference on Internet of Things and Intelligence System (IoTaIS) ◽

10.1109/iotais47347.2019.8980440 ◽

2019 ◽

Author(s):

Pei-Jung Lin ◽

Stephen Hung ◽

Shing Fai Steven Lam ◽

Bo Ching Chen

Keyword(s):

Machine Learning ◽

Object Recognition

Download Full-text

Non-Rectangular RoI Extraction and Machine Learning Based Multiple Object Recognition Used for Time-Series Areal Images Obtained Using MAV

Procedia Computer Science ◽

10.1016/j.procs.2018.07.280 ◽

2018 ◽

Vol 126 ◽

pp. 462-471 ◽

Cited By ~ 1

Author(s):

Hirokazu Madokoro ◽

Asahi Kainuma ◽

Kazuhito Sato

Keyword(s):

Machine Learning ◽

Time Series ◽

Object Recognition ◽

Multiple Object ◽

Roi Extraction

Download Full-text

A Comparative Analysis for 2D Object Recognition: A Case Study with Tactode Puzzle-Like Tiles

Journal of Imaging ◽

10.3390/jimaging7040065 ◽

2021 ◽

Vol 7 (4) ◽

pp. 65

Author(s):

Daniel Silva ◽

Armando Sousa ◽

Valter Costa

Keyword(s):

Machine Learning ◽

Deep Learning ◽

Comparative Analysis ◽

Object Recognition ◽

Template Matching ◽

Recognition Performance ◽

Regions Of Interest ◽

The Other ◽

Classification Methods

Object recognition represents the ability of a system to identify objects, humans or animals in images. Within this domain, this work presents a comparative analysis among different classification methods aiming at Tactode tile recognition. The covered methods include: (i) machine learning with HOG and SVM; (ii) deep learning with CNNs such as VGG16, VGG19, ResNet152, MobileNetV2, SSD and YOLOv4; (iii) matching of handcrafted features with SIFT, SURF, BRISK and ORB; and (iv) template matching. A dataset was created to train learning-based methods (i and ii), and with respect to the other methods (iii and iv), a template dataset was used. To evaluate the performance of the recognition methods, two test datasets were built: tactode_small and tactode_big, which consisted of 288 and 12,000 images, holding 2784 and 96,000 regions of interest for classification, respectively. SSD and YOLOv4 were the worst methods for their domain, whereas ResNet152 and MobileNetV2 showed that they were strong recognition methods. SURF, ORB and BRISK demonstrated great recognition performance, while SIFT was the worst of this type of method. The methods based on template matching attained reasonable recognition results, falling behind most other methods. The top three methods of this study were: VGG16 with an accuracy of 99.96% and 99.95% for tactode_small and tactode_big, respectively; VGG19 with an accuracy of 99.96% and 99.68% for the same datasets; and HOG and SVM, which reached an accuracy of 99.93% for tactode_small and 99.86% for tactode_big, while at the same time presenting average execution times of 0.323 s and 0.232 s on the respective datasets, being the fastest method overall. This work demonstrated that VGG16 was the best choice for this case study, since it minimised the misclassifications for both test datasets.

Download Full-text

Machine Learning Approach for Object Recognition

International Journal of Modeling and Optimization ◽

10.7763/ijmo.2012.v2.196 ◽

2012 ◽

pp. 622-628 ◽

Cited By ~ 1

Author(s):

V. N. Pawar ◽

S. N. Talbar

Keyword(s):

Machine Learning ◽

Object Recognition ◽

Learning Approach ◽

Machine Learning Approach

Download Full-text

Confidence Analysis for Multi-Class Object Recognition using the Intermediate Values from Machine Learning Algorithms

The Journal of The Institute of Image Information and Television Engineers ◽

10.3169/itej.69.j257 ◽

2015 ◽

Vol 69 (8) ◽

pp. J257-J260

Author(s):

Toshihiko Yamasaki ◽

Shinnosuke Ohshima ◽

Kiyoharu Aizawa

Keyword(s):

Machine Learning ◽

Object Recognition ◽

Learning Algorithms ◽

Machine Learning Algorithms

Download Full-text

Information Mandala: Statistical Distance Matrix with Clustering

10.36227/techrxiv.14271545.v1 ◽

2021 ◽

Author(s):

Xin Lu

Keyword(s):

Machine Learning ◽

Object Recognition ◽

Hierarchical Clustering ◽

Distance Function ◽

Metric Space ◽

Probability Distributions ◽

Distance Matrix ◽

Scalar Output ◽

Statistical Distance ◽

Image Pixels

In machine learning, observation features are measured in a metric space to obtain their distance function for optimization. Given similar features that are statistically sufficient as a population, a statistical distance between two probability distributions can be calculated for more precise learning. Provided the observed features are multi-valued, the statistical distance function is still efficient. However, due to its scalar output, it cannot be applied to represent detailed distances between feature elements. To resolve this problem, this paper extends the traditional statistical distance to a matrix form, called a statistical distance matrix. The proposed approach performs well in object recognition tasks and clearly and intuitively represents the dissimilarities between cat and dog images in the CIFAR dataset, even when directly calculated using the image pixels. By using the hierarchical clustering of the statistical distance matrix, the image pixels can be separated into several clusters that are geometrically arranged around a center like a Mandala pattern. The statistical distance matrix with clustering is called the Information Mandala. (This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible)

Download Full-text