Multi-modal Conditional Bounding Box Regression for Music Score Following

2019 ◽

pp. 295-299

Author(s):

Кonstantin А. Elshin ◽

Еlena I. Molchanova ◽

Мarina V. Usoltseva ◽

Yelena V. Likhoshway

Keyword(s):

Object Detection ◽

Loss Function ◽

Classification Accuracy ◽

Diatom Species ◽

Bounding Box ◽

Synedra Acus ◽

And Training

Using the TensorFlow Object Detection API, an approach to identifying and registering Baikal diatom species Synedra acus subsp. radians has been tested. As a result, a set of images was formed and training was conducted. It is shown that аfter 15000 training iterations, the total value of the loss function was obtained equal to 0,04. At the same time, the classification accuracy is equal to 95%, and the accuracy of construction of the bounding box is also equal to 95%.

Download Full-text

Improved WOMAC Score Following Treatment with Nanoparticle Phyllanthus Amarus Phonophoresis Gel for Knee Osteoarthritis

Indian Journal of Public Health Research & Development ◽

10.37506/v10/i12/2019/ijphrd/192093 ◽

2019 ◽

Vol 10 (12) ◽

pp. 1623

Author(s):

Decha Pinkaew ◽

Kanokwan Kiattisin ◽

Khanittha Wonglangka ◽

Pisittawoot Awoot

Keyword(s):

Knee Osteoarthritis ◽

Womac Score ◽

Phyllanthus Amarus ◽

Score Following

Download Full-text

Face detection in single and multiple images using different skin color models

Recent Advances in Computer Science and Communications ◽

10.2174/2666255813999200818202346 ◽

2020 ◽

Vol 13 ◽

Author(s):

Manpreet Kaur ◽

Jasdev Bhatti ◽

Mohit Kumar Kakkar ◽

Arun Upmanyu

Keyword(s):

Face Detection ◽

Skin Color ◽

Input Image ◽

Morphological Operations ◽

Reliability Model ◽

Multiple Images ◽

Face Region ◽

Bounding Box ◽

The Face ◽

Precision And Accuracy

Introduction: Face Detection is used in many different steams like video conferencing, human-computer interface, in face detection, and in the database management of image. Therefore, the aim of our paper is to apply Red Green Blue ( Methods: The morphological operations are performed in the face region to a number of pixels as the proposed parameter to check either an input image contains face region or not. Canny edge detection is also used to show the boundaries of a candidate face region, in the end, the face can be shown detected by using bounding box around the face. Results: The reliability model has also been proposed for detecting the faces in single and multiple images. The results of the experiments reflect that the algorithm been proposed performs very well in each model for detecting the faces in single and multiple images and the reliability model provides the best fit by analyzing the precision and accuracy. Moreover Discussion: The calculated results show that HSV model works best for single faced images whereas YCbCr and TSL models work best for multiple faced images. Also, the evaluated results by this paper provides the better testing strategies that helps to develop new techniques which leads to an increase in research effectiveness. Conclusion: The calculated value of all parameters is helpful for proving that the proposed algorithm has been performed very well in each model for detecting the face by using a bounding box around the face in single as well as multiple images. The precision and accuracy of all three models are analyzed through the reliability model. The comparison calculated in this paper reflects that HSV model works best for single faced images whereas YCbCr and TSL models work best for multiple faced images.

Download Full-text

3D-like Bounding Box for Vehicle Detection

2019 34rd Youth Academic Annual Conference of Chinese Association of Automation (YAC) ◽

10.1109/yac.2019.8787586 ◽

2019 ◽

Author(s):

Chao Wang ◽

Lukuan Zhou ◽

Jun Li ◽

Wankou Yang

Keyword(s):

Vehicle Detection ◽

Bounding Box

Download Full-text

Rethink the IoU-based loss functions for bounding box regression

2020 IEEE 9th Joint International Information Technology and Artificial Intelligence Conference (ITAIC) ◽

10.1109/itaic49862.2020.9339070 ◽

2020 ◽

Author(s):

Hongyu Zhai ◽

Jian Cheng ◽

Mengyong Wang

Keyword(s):

Loss Functions ◽

Bounding Box

Download Full-text

Automatic bounding-box-labeling method of occluded objects in virtual image data

Proceedings of the 5th International Conference on Multimedia and Image Processing ◽

10.1145/3381271.3381292 ◽

2020 ◽

Author(s):

Xinyue Wang ◽

Lingzhong Meng ◽

Yunzhi Xue

Keyword(s):

Image Data ◽

Virtual Image ◽

Bounding Box ◽

Occluded Objects ◽

Labeling Method

Download Full-text

Iterative Bounding Box Annotation for Object Detection

2020 25th International Conference on Pattern Recognition (ICPR) ◽

10.1109/icpr48806.2021.9412956 ◽

2021 ◽

Author(s):

Bishwo Adhikari ◽

Heikki Huttunen

Keyword(s):

Object Detection ◽

Bounding Box

Download Full-text

The Importance of Bounding Box in Motion Detection

2020 Fifth International Conference on Informatics and Computing (ICIC) ◽

10.1109/icic50835.2020.9288604 ◽

2020 ◽

Author(s):

Ahmad Fauzi ◽

Sarifuddin Madenda ◽

Ernastuti ◽

Eri Prasetyo Wibowo ◽

Anis Fitri Nur Masruriyah

Keyword(s):

Motion Detection ◽

Bounding Box

Download Full-text

Estimation of 6D Object Pose Using a 2D Bounding Box

Sensors ◽

10.3390/s21092939 ◽

2021 ◽

Vol 21 (9) ◽

pp. 2939

Author(s):

Yong Hong ◽

Jin Liu ◽

Zahid Jahangir ◽

Sheng He ◽

Qing Zhang

Keyword(s):

Neural Network ◽

Loss Function ◽

Three Dimensional ◽

Unit Vector ◽

Prediction Algorithm ◽

Computational Time ◽

Bounding Box ◽

Dimensional Unit ◽

Bounding Boxes ◽

Rgb Image

This paper provides an efficient way of addressing the problem of detecting or estimating the 6-Dimensional (6D) pose of objects from an RGB image. A quaternion is used to define an object′s three-dimensional pose, but the pose represented by q and the pose represented by -q are equivalent, and the L2 loss between them is very large. Therefore, we define a new quaternion pose loss function to solve this problem. Based on this, we designed a new convolutional neural network named Q-Net to estimate an object’s pose. Considering that the quaternion′s output is a unit vector, a normalization layer is added in Q-Net to hold the output of pose on a four-dimensional unit sphere. We propose a new algorithm, called the Bounding Box Equation, to obtain 3D translation quickly and effectively from 2D bounding boxes. The algorithm uses an entirely new way of assessing the 3D rotation (R) and 3D translation rotation (t) in only one RGB image. This method can upgrade any traditional 2D-box prediction algorithm to a 3D prediction model. We evaluated our model using the LineMod dataset, and experiments have shown that our methodology is more acceptable and efficient in terms of L2 loss and computational time.

Download Full-text