Spherical object recognition based on the number of contour edges extracted by fitting and convex hull processing

The detection of spherical targets in workpiece shape clustering and fruit classification tasks is challenging. Spherical targets produce low detection accuracy in complex fields, and single-feature processing cannot accurately recognize spheres. Therefore, a novel spherical descriptor (SD) for contour fitting and convex hull processing is proposed. The SD achieves image de-noising by combining flooding processing and morphological operations. The number of polygon-fitted edges is obtained by convex hull processing based on contour extraction and fitting, and two RGB images of the same group of objects are obtained from different directions. The two fitted edges of the same target object obtained at two RGB images are extracted to form a two-dimensional array. The target object is defined as a sphere if the two values of the array are greater than a custom threshold. The first classification result is obtained by an improved K-NN algorithm. Circle detection is then performed on the results using improved Hough circle detection. We abbreviate it as a new Hough transform sphere descriptor (HSD). Experiments demonstrate that recognition of spherical objects is obtained with 98.8% accuracy. Therefore, experimental results show that our method is compared with other latest methods, HSD has higher identification accuracy than other methods.

Download Full-text

The Image Positioning and Segmentation Techniques about Gauge Visual Detection System between High Signals and Contact Net Based on Target Feature

Key Engineering Materials ◽

10.4028/www.scientific.net/kem.522.351 ◽

2012 ◽

Vol 522 ◽

pp. 351-354

Author(s):

Xiao Mei Li ◽

Xi Lin Zhu ◽

Yong Yu ◽

Xiang Zou ◽

Chen Jun Huang

Keyword(s):

Design Process ◽

Contact Line ◽

Visual Detection ◽

Detection System ◽

Structural Characteristics ◽

Target Object ◽

Morphological Operations ◽

Circle Detection ◽

Target Feature ◽

Sobel Operator

In the design process of gauge visual detection system between high signals and contact net, it is need to do target positioning and segmentation between high signals and contact line. The paper first analyzes the structural characteristics of high signals and contact line, and then uses the Sobel operator for edge detection, and uses first close and then open of image morphological operations for edge treatment, finally uses Hough transform for line and circle detection to extract the object's edge. This positioning and segmentation of target object would be achieved.

Download Full-text

Visual attention strategies for target object detection

10.26686/wgtn.17067635 ◽

2021 ◽

Author(s):

◽

Ibrahim Mohammad Hussain Rahman

Keyword(s):

Visual Attention ◽

Object Detection ◽

Target Object ◽

Detection Accuracy ◽

Estimation Model ◽

Top Down ◽

Bottom Up ◽

Feature Map ◽

Low Level ◽

Visual Tasks

<p>The human visual attention system (HVA) encompasses a set of interconnected neurological modules that are responsible for analyzing visual stimuli by attending to those regions that are salient. Two contrasting biological mechanisms exist in the HVA systems; bottom-up, data-driven attention and top-down, task-driven attention. The former is mostly responsible for low-level instinctive behaviors, while the latter is responsible for performing complex visual tasks such as target object detection. Very few computational models have been proposed to model top-down attention, mainly due to three reasons. The first is that the functionality of top-down process involves many influential factors. The second reason is that there is a diversity in top-down responses from task to task. Finally, many biological aspects of the top-down process are not well understood yet. For the above reasons, it is difficult to come up with a generalized top-down model that could be applied to all high level visual tasks. Instead, this thesis addresses some outstanding issues in modelling top-down attention for one particular task, target object detection. Target object detection is an essential step for analyzing images to further perform complex visual tasks. Target object detection has not been investigated thoroughly when modelling top-down saliency and hence, constitutes the may domain application for this thesis. The thesis will investigate methods to model top-down attention through various high-level data acquired from images. Furthermore, the thesis will investigate different strategies to dynamically combine bottom-up and top-down processes to improve the detection accuracy, as well as the computational efficiency of the existing and new visual attention models. The following techniques and approaches are proposed to address the outstanding issues in modelling top-down saliency: 1. A top-down saliency model that weights low-level attentional features through contextual knowledge of a scene. The proposed model assigns weights to features of a novel image by extracting a contextual descriptor of the image. The contextual descriptor plays the role of tuning the weighting of low-level features to maximize detection accuracy. By incorporating context into the feature weighting mechanism we improve the quality of the assigned weights to these features. 2. Two modules of target features combined with contextual weighting to improve detection accuracy of the target object. In this proposed model, two sets of attentional feature weights are learned, one through context and the other through target features. When both sources of knowledge are used to model top-down attention, a drastic increase in detection accuracy is achieved in images with complex backgrounds and a variety of target objects. 3. A top-down and bottom-up attention combination model based on feature interaction. This model provides a dynamic way for combining both processes by formulating the problem as feature selection. The feature selection exploits the interaction between these features, yielding a robust set of features that would maximize both the detection accuracy and the overall efficiency of the system. 4. A feature map quality score estimation model that is able to accurately predict the detection accuracy score of any previously novel feature map without the need of groundtruth data. The model extracts various local, global, geometrical and statistical characteristic features from a feature map. These characteristics guide a regression model to estimate the quality of a novel map. 5. A dynamic feature integration framework for combining bottom-up and top-down saliencies at runtime. If the estimation model is able to predict the quality score of any novel feature map accurately, then it is possible to perform dynamic feature map integration based on the estimated value. We propose two frameworks for feature map integration using the estimation model. The proposed integration framework achieves higher human fixation prediction accuracy with minimum number of feature maps than that achieved by combining all feature maps. The proposed works in this thesis provide new directions in modelling top-down saliency for target object detection. In addition, dynamic approaches for top-down and bottom-up combination show considerable improvements over existing approaches in both efficiency and accuracy.</p>

Download Full-text

RICNN: A ResNet&Inception convolutional neural network for intrusion detection of abnormal traffic

Computer Science and Information Systems ◽

10.2298/csis210617055x ◽

2021 ◽

pp. 55-55

Author(s):

Benhui Xia ◽

Dezhi Han ◽

Ximing Yin ◽

Gao Na

Keyword(s):

Neural Network ◽

Intrusion Detection ◽

Convolutional Neural Network ◽

Detection Rate ◽

Recognition Rate ◽

Identification Accuracy ◽

Detection Accuracy ◽

Traffic Classification ◽

High Detection Rate ◽

Network Intrusion

To secure cloud computing and outsourced data while meeting the requirements of automation, many intrusion detection schemes based on deep learn ing are proposed. Though the detection rate of many network intrusion detection solutions can be quite high nowadays, their identification accuracy on imbalanced abnormal network traffic still remains low. Therefore, this paper proposes a ResNet &Inception-based convolutional neural network (RICNN) model to abnormal traffic classification. RICNN can learn more traffic features through the Inception unit, and the degradation problem of the network is eliminated through the direct map ping unit of ResNet, thus the improvement of the model?s generalization ability can be achievable. In addition, to simplify the network, an improved version of RICNN, which makes it possible to reduce the number of parameters that need to be learnt without degrading identification accuracy, is also proposed in this paper. The experimental results on the dataset CICIDS2017 show that RICNN not only achieves an overall accuracy of 99.386% but also has a high detection rate across different categories, especially for small samples. The comparison experiments show that the recognition rate of RICNN outperforms a variety of CNN models and RNN models, and the best detection accuracy can be achieved.

Download Full-text

Foreign Circular Element Detection in Chest X-Rays for Effective Automated Pulmonary Abnormality Screening

International Journal of Computer Vision and Image Processing ◽

10.4018/ijcvip.2017040103 ◽

2017 ◽

Vol 7 (2) ◽

pp. 36-49 ◽

Cited By ~ 7

Author(s):

Fatema Tuz Zohora ◽

K.C. Santosh

Keyword(s):

Computational Time ◽

Detection Accuracy ◽

Morphological Operations ◽

Potential Candidate ◽

X Rays ◽

Detection Algorithms ◽

Digital Chest ◽

Chest X Ray ◽

Pulmonary Abnormality ◽

A New Technique

In automated chest X-ray screening (to detect pulmonary abnormality: Tuberculosis (TB), for instance), the presence of foreign element such as buttons and medical devices hinders its performance. In this paper, using digital chest radiographs, the authors present a new technique to detect circular foreign element, within the lung regions. They first compute edge map by using several different edge detection algorithms, which is followed by morphological operations for potential candidate selection. These candidates are then confirmed by using circular Hough transform (CHT). In their test, the authors have achieved precision, recall, and F1 score of 96%, 90%, and 92%, respectively with lung segmentation. Compared to state-of-the-art work, their technique excels performance in terms of both detection accuracy and computational time.

Download Full-text

SHIP DETECTION BASED ON MULTIPLE FEATURES IN RANDOM FOREST MODEL FOR HYPERSPECTRAL IMAGES

ISPRS - International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences ◽

10.5194/isprs-archives-xlii-3-891-2018 ◽

2018 ◽

Vol XLII-3 ◽

pp. 891-895

Author(s):

N. Li ◽

L. Ding ◽

H. Zhao ◽

J. Shi ◽

D. Wang ◽

...

Keyword(s):

Random Forest ◽

Near Infrared ◽

Texture Features ◽

Short Wave ◽

Hyperspectral Images ◽

Support Vector ◽

High Signal ◽

Detection Accuracy ◽

Multiple Features ◽

Single Feature

A novel method for detecting ships which aim to make full use of both the spatial and spectral information from hyperspectral images is proposed. Firstly, the band which is high signal-noise ratio in the range of near infrared or short-wave infrared spectrum, is used to segment land and sea on Otsu threshold segmentation method. Secondly, multiple features that include spectral and texture features are extracted from hyperspectral images. Principal components analysis (PCA) is used to extract spectral features, the Grey Level Co-occurrence Matrix (GLCM) is used to extract texture features. Finally, Random Forest (RF) model is introduced to detect ships based on the extracted features. To illustrate the effectiveness of the method, we carry out experiments over the EO-1 data by comparing single feature and different multiple features. Compared with the traditional single feature method and Support Vector Machine (SVM) model, the proposed method can stably achieve the target detection of ships under complex background and can effectively improve the detection accuracy of ships.

Download Full-text

Graph-Based Visual Manipulation Relationship Reasoning Network for Robotic Grasping

Frontiers in Neurorobotics ◽

10.3389/fnbot.2021.719731 ◽

2021 ◽

Vol 15 ◽

Author(s):

Guoyu Zuo ◽

Jiayuan Tong ◽

Hongxing Liu ◽

Wenbai Chen ◽

Jianfeng Li

Keyword(s):

Contextual Information ◽

Target Object ◽

Robotic Grasping ◽

Convolutional Network ◽

Real Environment ◽

Rgb Images ◽

Robot Grasping ◽

Advanced Interaction ◽

Object Relationships

To grasp the target object stably and orderly in the object-stacking scenes, it is important for the robot to reason the relationships between objects and obtain intelligent manipulation order for more advanced interaction between the robot and the environment. This paper proposes a novel graph-based visual manipulation relationship reasoning network (GVMRN) that directly outputs object relationships and manipulation order. The GVMRN model first extracts features and detects objects from RGB images, and then adopts graph convolutional network (GCN) to collect contextual information between objects. To improve the efficiency of relation reasoning, a relationship filtering network is built to reduce object pairs before reasoning. The experiments on the Visual Manipulation Relationship Dataset (VMRD) show that our model significantly outperforms previous methods on reasoning object relationships in object-stacking scenes. The GVMRN model is also tested on the images we collected and applied on the robot grasping platform. The results demonstrated the generalization and applicability of our method in real environment.

Download Full-text

Storm-Drain and Manhole Detection Using the RetinaNet Method

Sensors ◽

10.3390/s20164450 ◽

2020 ◽

Vol 20 (16) ◽

pp. 4450 ◽

Cited By ~ 3

Author(s):

Anderson Santos ◽

José Marcato Junior ◽

Jonathan de Andrade Silva ◽

Rodrigo Pereira ◽

Daniel Matos ◽

...

Keyword(s):

Urban Areas ◽

Hydrological Modeling ◽

Drainage System ◽

Future Research ◽

Detection Accuracy ◽

Mobile Mapping ◽

Experimental Assessment ◽

Mato Grosso ◽

Rgb Images ◽

Storm Drain

As key-components of the urban-drainage system, storm-drains and manholes are essential to the hydrological modeling of urban basins. Accurately mapping of these objects can help to improve the storm-drain systems for the prevention and mitigation of urban floods. Novel Deep Learning (DL) methods have been proposed to aid the mapping of these urban features. The main aim of this paper is to evaluate the state-of-the-art object detection method RetinaNet to identify storm-drain and manhole in urban areas in street-level RGB images. The experimental assessment was performed using 297 mobile mapping images captured in 2019 in the streets in six regions in Campo Grande city, located in Mato Grosso do Sul state, Brazil. Two configurations of training, validation, and test images were considered. ResNet-50 and ResNet-101 were adopted in the experimental assessment as the two distinct feature extractor networks (i.e., backbones) for the RetinaNet method. The results were compared with the Faster R-CNN method. The results showed a higher detection accuracy when using RetinaNet with ResNet-50. In conclusion, the assessed DL method is adequate to detect storm-drain and manhole from mobile mapping RGB images, outperforming the Faster R-CNN method. The labeled dataset used in this study is available for future research.

Download Full-text

Wavelet Ideology and Morphological Operations for Analyzing RGB Images of Cytological Preparations

2019 7th International Conference on Future Internet of Things and Cloud Workshops (FiCloudW) ◽

10.1109/ficloudw.2019.00034 ◽

2019 ◽

Author(s):

Vyacheslav Lyashenko ◽

Oleg Kobylin ◽

Oleksandr Ryazantsev ◽

Ivan Ryazantsev ◽

Viktor Barbaruk ◽

...

Keyword(s):

Morphological Operations ◽

Rgb Images

Download Full-text

Is Coarse-to-Fine Tuning in Object Recognition One of Size or Scale?

Perception ◽

10.1068/v96l1104 ◽

1996 ◽

Vol 25 (1_suppl) ◽

pp. 48-48

Author(s):

J Fiser ◽

S Subramaniam ◽

I Biederman

Keyword(s):

Visual Presentation ◽

Target Object ◽

Fine Tuning ◽

Detection Accuracy ◽

Gray Level ◽

Octave Band ◽

Large Size ◽

Scale Size ◽

Coarse To Fine ◽

Strong Asymmetry

Subjects attempted to detect a target object specified by name (eg “chair”) in rapid serial visual presentation sequences, with each sequence consisting of 40 gray level, 72 ms images of common objects. On the 50% of the trials in which a target was present it was never in the first or last eight positions. In homogeneous sequences, the images were all of the same size (all large or all small differing by a factor of five) or scale (spatial frequency, SF, either all low-passed [2 cycles deg−1] or all high passed [10 cycles deg−1], in a 1.5 octave band). There was no difference in detection accuracy between the different sizes or the different scales. In the switched sequences, the target could differ in size or scale from all the other images in the sequence. A strong asymmetry now emerged in that small-size or high-passed targets were much more difficult to detect than when in homogeneous sequences, whereas the large-size or low-passed images were much easier! Is this coarse-to-fine tuning based on size or scale? When size was pitted against scale, in that targets could either be of small size and low SF or large size and high SF, the large-size objects were much easier to detect, with only a slight modulating effect of scale. Size — scale tuning thus operates on space (={size}) rather than SF, and requires some time for its implementation, though less than that required for the first eight images. Once implemented, the tuning can be completely efficient, as evidenced by the equivalent levels of performance in the homogeneous conditions.

Download Full-text

An Enhanced Approach for Number Plate Detection and Recognition

Recent Advances in Computer Science and Communications ◽

10.2174/2666255813999200904161500 ◽

2020 ◽

Vol 13 ◽

Author(s):

Diksha Kurchaniya ◽

Mohd. Aquib Ansari ◽

Durga Patel

Keyword(s):

Character Recognition ◽

High Speed ◽

Detection System ◽

Wiener Filter ◽

Noise Removal ◽

Motor Vehicles ◽

Detection Accuracy ◽

Morphological Operations ◽

Wide Range ◽

High Speed Vehicle

Introduction: The number of vehicles is increasing day by day in our life. The vehicle may violate traffic rules and cause accidents. The automatic number plate detection system (ANPR) plays a significant role to identify these vehicles. Number plate detection is very difficult sometimes because each country has its own format for representing the number plate and font types and sizes may also vary for different vehicles. The number of ANPR systems is available nowadays but still, it is a big problem to detect the number plate correctly in various scenarios like high-speed vehicle, number plate language, etc. Methods: In the development of this method, we mainly used wiener filter for noise removal, morphological operations for number plate localization, connected component algorithm for character segmentation, and template based matching for character recognition. Results: Our proposed methodology is providing promising results in terms of detection accuracy. Discussion: The automatic number plate detection system (ANPR) has wide range of applications because the license number is the crucial, commonly putative and essential identifier of motor vehicles. These applications include ticketless parking fee management, parking access automation, car theft prevention, security guide assistance, Motorway Road Tolling, Border Control, Journey Time Measurement, Law Enforcement and many more. Conclusion: In this paper, an enhanced approach of automatic number plate detection system is proposed using some different techniques which not only detect the number plate of the vehicle but also recognize each character present in the detected number plate image.

Download Full-text