Structuring Visual Words in 3D for Arbitrary-View Object Localization

The technology in current research scenario is marching towards automation forhigher productivity with accurate and precise product development. Vision andRobotics are domains which work to create autonomous systems and are the keytechnology in quest for mass productivity. The automation in an industry canbe achieved by detecting interactive objects and estimating the pose to manipulatethem. Therefore the object localization ( i.e., pose) includes position andorientation of object, has profound ?significance. The application of object poseestimation varies from industry automation to entertainment industry and fromhealth care to surveillance. The objective of pose estimation of objects is verysigni?cant in many cases, like in order for the robots to manipulate the objects,for accurate rendering of Augmented Reality (AR) among others.This thesis tries to solve the issue of object pose estimation using 3D dataof scene acquired from 3D sensors (e.g. Kinect, Orbec Astra Pro among others).The 3D data has an advantage of independence from object texture and invarianceto illumination. The proposal is divided into two phases : An o?ine phasewhere the 3D model template of the object ( for estimation of pose) is built usingIterative Closest Point (ICP) algorithm. And an online phase where the pose ofthe object is estimated by aligning the scene to the model using ICP, providedwith an initial alignment using 3D descriptors (like Fast Point Feature Transform(FPFH)).The approach we develop is to be integrated on two di?erent platforms :1)Humanoid robot `Pyrene' which has Orbec Astra Pro 3D sensor for data acquisition,and 2)Unmanned Aerial Vehicle (UAV) which has Intel Realsense Euclidon it. The datasets of objects (like electric drill, brick, a small cylinder, cake box)are acquired using Microsoft Kinect, Orbec Astra Pro and Intel RealSense Euclidsensors to test the performance of this technique. The objects which are used totest this approach are the ones which are used by robot. This technique is testedin two scenarios, fi?rstly, when the object is on the table and secondly when theobject is held in hand by a person. The range of objects from the sensor is 0.6to 1.6m. This technique could handle occlusions of the object by hand (when wehold the object), as ICP can work even if partial object is visible in the scene.

Download Full-text

A computer vision approach to identifying the manufacturer and model of anterior cervical spinal hardware

Journal of Neurosurgery Spine ◽

10.3171/2019.6.spine19463 ◽

2019 ◽

Vol 31 (6) ◽

pp. 844-850 ◽

Cited By ~ 1

Author(s):

Kevin T. Huang ◽

Michael A. Silva ◽

Alfred P. See ◽

Kyle C. Wu ◽

Troy Gallerani ◽

...

Keyword(s):

Computer Vision ◽

Feature Detection ◽

High Accuracy ◽

Detection Accuracy ◽

Data Sets ◽

Visual Words ◽

Fusion Systems ◽

Kaze Feature ◽

Applications Of Machine Learning ◽

Cervical Plating

OBJECTIVERecent advances in computer vision have revolutionized many aspects of society but have yet to find significant penetrance in neurosurgery. One proposed use for this technology is to aid in the identification of implanted spinal hardware. In revision operations, knowing the manufacturer and model of previously implanted fusion systems upfront can facilitate a faster and safer procedure, but this information is frequently unavailable or incomplete. The authors present one approach for the automated, high-accuracy classification of anterior cervical hardware fusion systems using computer vision.METHODSPatient records were searched for those who underwent anterior-posterior (AP) cervical radiography following anterior cervical discectomy and fusion (ACDF) at the authors’ institution over a 10-year period (2008–2018). These images were then cropped and windowed to include just the cervical plating system. Images were then labeled with the appropriate manufacturer and system according to the operative record. A computer vision classifier was then constructed using the bag-of-visual-words technique and KAZE feature detection. Accuracy and validity were tested using an 80%/20% training/testing pseudorandom split over 100 iterations.RESULTSA total of 321 total images were isolated containing 9 different ACDF systems from 5 different companies. The correct system was identified as the top choice in 91.5% ± 3.8% of the cases and one of the top 2 or 3 choices in 97.1% ± 2.0% and 98.4 ± 13% of the cases, respectively. Performance persisted despite the inclusion of variable sizes of hardware (i.e., 1-level, 2-level, and 3-level plates). Stratification by the size of hardware did not improve performance.CONCLUSIONSA computer vision algorithm was trained to classify at least 9 different types of anterior cervical fusion systems using relatively sparse data sets and was demonstrated to perform with high accuracy. This represents one of many potential clinical applications of machine learning and computer vision in neurosurgical practice.

Download Full-text

Superquadrics Model-based 3D Object Localization Algorithm

ROBOT ◽

10.3724/sp.j.1218.2013.00439 ◽

2013 ◽

Vol 35 (4) ◽

pp. 439 ◽

Cited By ~ 2

Author(s):

Lin WANG ◽

Jianfu CAO ◽

Chongzhao HAN

Keyword(s):

Object Localization ◽

Localization Algorithm ◽

3D Object ◽

Model Based

Download Full-text

Robust Acoustic Event Classification Using Bag-of-Visual-Words

10.21437/interspeech.2018-1905 ◽

2018 ◽

Cited By ~ 1

Author(s):

Manjunath Mulimani ◽

Shashidhar G Koolagudi

Keyword(s):

Bag Of Visual Words ◽

Event Classification ◽

Visual Words ◽

Acoustic Event

Download Full-text

Natural Scene Image Annotation Using Local Semantic Concepts and Spatial Bag of Visual Words

International Journal of Sensors Wireless Communications and Control ◽

10.2174/2210327906666160606152043 ◽

2016 ◽

Vol 6 (3) ◽

pp. 153-173

Author(s):

Yousef Alqasrawi

Keyword(s):

Image Annotation ◽

Bag Of Visual Words ◽

Natural Scene ◽

Visual Words ◽

Scene Image ◽

Semantic Concepts

Download Full-text

Efficient object localization with gaussianized vector representation

Proceedings of the 1st international workshop on Interactive multimedia for consumer electronics - IMCE '09 ◽

10.1145/1631040.1631055 ◽

2009 ◽

Cited By ~ 1

Author(s):

Xiaodan Zhuang ◽

Xi Zhou ◽

Mark A. Hasegawa-Johnson ◽

Thomas S. Huang

Keyword(s):

Object Localization ◽

Vector Representation

Download Full-text

Explainable Fully Connected Visual Words for the Classification of Skin Cancer Confocal Images

11th Hellenic Conference on Artificial Intelligence ◽

10.1145/3411408.3411435 ◽

2020 ◽

Author(s):

Athanasios Kallipolitis ◽

Alexandros Stratigos ◽

Alexios Zarras ◽

Ilias Maglogiannis

Keyword(s):

Skin Cancer ◽

Visual Words ◽

Confocal Images ◽

Fully Connected

Download Full-text

Multi-camera Object Localization in Intelligent Transportation Systems

2020 28th Telecommunications Forum (TELFOR) ◽

10.1109/telfor51502.2020.9306541 ◽

2020 ◽

Author(s):

Dejan Bordoski ◽

Srdan Usorac ◽

Dragan Samardzija ◽

Zeljko Lukac

Keyword(s):

Intelligent Transportation Systems ◽

Intelligent Transportation ◽

Transportation Systems ◽

Object Localization

Download Full-text

Contrastive consistent feature learning for weakly supervised object localization semantic segmentation

Neurocomputing ◽

10.1016/j.neucom.2021.03.023 ◽

2021 ◽

Author(s):

Minsong Ki ◽

Youngjung Uh ◽

Wonyoung Lee ◽

Hyeran Byun

Keyword(s):

Feature Learning ◽

Semantic Segmentation ◽

Object Localization ◽

Consistent Feature ◽

Weakly Supervised

Download Full-text