Stiffness Estimation from Vision and Touch Using Object Detection and Probabilistic Model: An Application to Object Identification

Author(s):  
Masahiro Kamigaki ◽  
Seiichiro Katsura
2017 ◽  
Vol 2 (1) ◽  
pp. 80-87
Author(s):  
Puyda V. ◽  
◽  
Stoian. A.

Detecting objects in a video stream is a typical problem in modern computer vision systems that are used in multiple areas. Object detection can be done on both static images and on frames of a video stream. Essentially, object detection means finding color and intensity non-uniformities which can be treated as physical objects. Beside that, the operations of finding coordinates, size and other characteristics of these non-uniformities that can be used to solve other computer vision related problems like object identification can be executed. In this paper, we study three algorithms which can be used to detect objects of different nature and are based on different approaches: detection of color non-uniformities, frame difference and feature detection. As the input data, we use a video stream which is obtained from a video camera or from an mp4 video file. Simulations and testing of the algoritms were done on a universal computer based on an open-source hardware, built on the Broadcom BCM2711, quad-core Cortex-A72 (ARM v8) 64-bit SoC processor with frequency 1,5GHz. The software was created in Visual Studio 2019 using OpenCV 4 on Windows 10 and on a universal computer operated under Linux (Raspbian Buster OS) for an open-source hardware. In the paper, the methods under consideration are compared. The results of the paper can be used in research and development of modern computer vision systems used for different purposes. Keywords: object detection, feature points, keypoints, ORB detector, computer vision, motion detection, HSV model color


2020 ◽  
Vol 8 (2) ◽  
pp. 112 ◽  
Author(s):  
Peng Chen ◽  
Ying Li ◽  
Hui Zhou ◽  
Bingxin Liu ◽  
Peng Liu

The synthetic aperture radar (SAR) has a special ability to detect objects in any climate and weather conditions. Consequently, SAR images are widely used in maritime transportation safety and fishery law enforcement for maritime object detection. Currently, deep-learning models are being extensively used for the detection of objects from images. Among them, the feature pyramid network (FPN) uses pyramids for representing semantic information regardless of the scale and has an improved accuracy of object detection. It is also suitable for the detection of multiple small ship objects in SAR images. This study aims to resolve the problems associated with small-object and multi-object ship detection in complex scenarios e.g., when a ship nears the port, by proposing a detection method based on an optimized FPN model. The feature pyramid model is first embedded in a traditional region proposal network (RPN) and mapped into a new feature space for object identification. Subsequently, the k-means clustering algorithm based on the shape similar distance (SSD) measure is used to optimize the FPN. Initial anchor boxes and tests are created using the SAR ship dataset. Experimental results show that the proposed algorithm for object detection shows an accuracy of 98.62%. Compared with Yolo, the RPN based on VGG/ResNet, FPN based on VGG/ResNet, and other models in complex scenarios, the proposed model shows a higher accuracy rate and better overall performance.


Author(s):  
S Gopi Naik

Abstract: The plan is to establish an integrated system that can manage high-quality visual information and also detect weapons quickly and efficiently. It is obtained by integrating ARM-based computer vision and optimization algorithms with deep neural networks able to detect the presence of a threat. The whole system is connected to a Raspberry Pi module, which will capture live broadcasting and evaluate it using a deep convolutional neural network. Due to the intimate interaction between object identification and video and image analysis in real-time objects, By generating sophisticated ensembles that incorporate various low-level picture features with high-level information from object detection and scenario classifiers, their performance can quickly plateau. Deep learning models, which can learn semantic, high-level, deeper features, have been developed to overcome the issues that are present in optimization algorithms. It presents a review of deep learning based object detection frameworks that use Convolutional Neural Network layers for better understanding of object detection. The Mobile-Net SSD model behaves differently in network design, training methods, and optimization functions, among other things. The crime rate in suspicious areas has been reduced as a consequence of weapon detection. However, security is always a major concern in human life. The Raspberry Pi module, or computer vision, has been extensively used in the detection and monitoring of weapons. Due to the growing rate of human safety protection, privacy and the integration of live broadcasting systems which can detect and analyse images, suspicious areas are becoming indispensable in intelligence. This process uses a Mobile-Net SSD algorithm to achieve automatic weapons and object detection. Keywords: Computer Vision, Weapon and Object Detection, Raspberry Pi Camera, RTSP, SMTP, Mobile-Net SSD, CNN, Artificial Intelligence.


Author(s):  
D.P. Tripathy ◽  
K. Guru Raghavendra Reddy

Moving object detection is an important task in many computer vision classifications applications. The goal of this study is to identify a moving object detection method that provides a reliable and accurate identification of objects on the conveyor belt. In this paper, a study of the moving object detection methods is presented. Firstly, moving object detection pixel by pixel was performed using background subtraction, frame difference method. The threshold value in both background subtraction and frame difference is a fixed value, which determines the accuracy of object identification. The adaptive threshold values were calculated for both the methods to improve the accuracy. The performance of these methods was compared with the ground truth image.


2020 ◽  
Vol 17 (1) ◽  
pp. 439-444 ◽  
Author(s):  
Katamneni Vinaya Sree ◽  
G. Jeyakumar

In the given image identifying the existence of a required object is the concern of the object detection process. This is quite natural for Human without any effort, however making a machine to detect an object in image is tedious. To make machines to recognize the objects, the feature descriptor algorithms are to be implemented. The general object detection approaches use collection of local and global descriptors to represent an image. Difficulties arise during this process when there is variation in lightening, positioning, rotation, mirroring, occlusion, scaling etc., of the same object in different image scenes. To overcome these difficulties, we need combination of features that detects the object in the image scene. But there exist lot of descriptors that can be used. Hence, finding the required number of feature descriptors for object detection is a crucial task. The question that comes out here is how to select the optimum number of descriptors to achieve optimum accuracy? The answer for the question is an optimization algorithm, which can be employed to select the best combination of the descriptors with maximum detection accuracy. This paper proposing an Evolutionary Computation (EC) based approach with the Differential Evolution (DE) algorithm to find the optimal combination of feature descriptors to achieve optimal object detection accuracy. The proposed approach is implemented and its superiority is verified with four different images and results obtained are presented in this paper.


2021 ◽  
Vol 13 (12) ◽  
pp. 307
Author(s):  
Vijayakumar Varadarajan ◽  
Dweepna Garg ◽  
Ketan Kotecha

Deep learning is a relatively new branch of machine learning in which computers are taught to recognize patterns in massive volumes of data. It primarily describes learning at various levels of representation, which aids in understanding data that includes text, voice, and visuals. Convolutional neural networks have been used to solve challenges in computer vision, including object identification, image classification, semantic segmentation and a lot more. Object detection in videos involves confirming the presence of the object in the image or video and then locating it accurately for recognition. In the video, modelling techniques suffer from high computation and memory costs, which may decrease performance measures such as accuracy and efficiency to identify the object accurately in real-time. The current object detection technique based on a deep convolution neural network requires executing multilevel convolution and pooling operations on the entire image to extract deep semantic properties from it. For large objects, detection models can provide superior results; however, those models fail to detect the varying size of the objects that have low resolution and are greatly influenced by noise because the features after the repeated convolution operations of existing models do not fully represent the essential characteristics of the objects in real-time. With the help of a multi-scale anchor box, the proposed approach reported in this paper enhances the detection accuracy by extracting features at multiple convolution levels of the object. The major contribution of this paper is to design a model to understand better the parameters and the hyper-parameters which affect the detection and the recognition of objects of varying sizes and shapes, and to achieve real-time object detection and recognition speeds by improving accuracy. The proposed model has achieved 84.49 mAP on the test set of the Pascal VOC-2007 dataset at 11 FPS, which is comparatively better than other real-time object detection models.


Object identification and multi-object picture separation are two firmly related processes and it can be enhanced when understood jointly by supporting data from one assignment to the next. Be that as it may, current best in object models are different portrayal for each space creation joint objects and leaving the categorization of numerous part of the scene uncertain. Picture element appearance highlights enable us to do well on classifying formless foundation classes, while the express portrayal of districts encourage the calculation of increasingly complex highlights essential for object detection. Vitally, our model gives a solitary bound together portrayal of the scene we clarify each picture elements of image and authorize it contains in the web between every random variable in our model.


2021 ◽  
Vol 50 (1) ◽  
pp. 13-27
Author(s):  
Binbin Shi ◽  
Xun Li ◽  
Tingting Nie ◽  
Kaibin Zhang ◽  
Wenjie Wang

A method of vehicle multi-object identification and classification based on the YOLOv2 algorithm is proposed to solve the problems of low detection rate, poor robustness, and unsatisfactory classification effect for the classical multi-object detection and vehicle type classification on real road environment. Based on the YOLOv2 algorithm, the network structure of YOLOv2-voc is improved according to the actual road conditions. The classification training model was obtained based on the ImageNet data and fine-tuning technology, according to the analysis of training results and vehicle object characteristics. This paper proposed the improved vehicle identification classification network structure, namely called YOLOv2-voc_mul. In order to verify the validity of the detection method, experiments are performed using samples from simple backgrounds and complex backgrounds and compared with the existing YOLOv2, YOLOv2-voc, and YOLOv3 models after 70000 iterations, respectively. The results show that the proposed YOLOv2-voc_mul model has an accuracy of 98.6% under the simple background, and the mAP (mean Average Precision) of different models reaches 87.81%. Under the complex background, the improved YOLOv2-voc_mul model has an average accuracy of 92.09% and 89.64% for single and multi-object detection of four different models. In summary, our proposed method has better accuracy, a low false detection rate, and good robustness.


Sign in / Sign up

Export Citation Format

Share Document