G2MF-WA: Geometric multi-model fitting with weakly annotated data

Chao Zhang; Xuequan Lu; Katsuya Hotta; Xi Yang

doi:10.1007/s41095-020-0166-8

G2MF-WA: Geometric multi-model fitting with weakly annotated data

Computational Visual Media ◽

10.1007/s41095-020-0166-8 ◽

2020 ◽

Vol 6 (2) ◽

pp. 135-145

Author(s):

Chao Zhang ◽

Xuequan Lu ◽

Katsuya Hotta ◽

Xi Yang

Keyword(s):

Prior Knowledge ◽

High Probability ◽

State Of The Art ◽

Model Fitting ◽

Homography Estimation ◽

Data Points ◽

Novel Method ◽

Art Techniques

Abstract In this paper we address the problem of geometric multi-model fitting using a few weakly annotated data points, which has been little studied so far. In weak annotating (WA), most manual annotations are supposed to be correct yet inevitably mixed with incorrect ones. SuchWA data can naturally arise through interaction in various tasks. For example, in the case of homography estimation, one can easily annotate points on the same plane or object with a single label by observing the image. Motivated by this, we propose a novel method to make full use of WA data to boost multi-model fitting performance. Specifically, a graph for model proposal sampling is first constructed using the WA data, given the prior that WA data annotated with the same weak label has a high probability of belonging to the same model. By incorporating this prior knowledge into the calculation of edge probabilities, vertices (i.e., data points) lying on or near the latent model are likely to be associated and further form a subset or cluster for effective proposal generation. Having generated proposals, a-expansion is used for labeling, and our method in return updates the proposals. This procedure works in an iterative way. Extensive experiments validate our method and show that it produces noticeably better results than state-of-the-art techniques in most cases.

Download Full-text

Hypergraph Optimization for Multi-Structural Geometric Model Fitting

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v33i01.33018730 ◽

2019 ◽

Vol 33 ◽

pp. 8730-8737

Author(s):

Shuyuan Lin ◽

Guobao Xiao ◽

Yan Yan ◽

David Suter ◽

Hanzi Wang

Keyword(s):

Spectral Clustering ◽

Input Data ◽

State Of The Art ◽

Geometric Model ◽

Model Fitting ◽

Synthetic Data ◽

Estimation Algorithm ◽

Sampling Efficiency ◽

Data Points ◽

Fitting In

Recently, some hypergraph-based methods have been proposed to deal with the problem of model fitting in computer vision, mainly due to the superior capability of hypergraph to represent the complex relationship between data points. However, a hypergraph becomes extremely complicated when the input data include a large number of data points (usually contaminated with noises and outliers), which will significantly increase the computational burden. In order to overcome the above problem, we propose a novel hypergraph optimization based model fitting (HOMF) method to construct a simple but effective hypergraph. Specifically, HOMF includes two main parts: an adaptive inlier estimation algorithm for vertex optimization and an iterative hyperedge optimization algorithm for hyperedge optimization. The proposed method is highly efficient, and it can obtain accurate model fitting results within a few iterations. Moreover, HOMF can then directly apply spectral clustering, to achieve good fitting performance. Extensive experimental results show that HOMF outperforms several state-of-the-art model fitting methods on both synthetic data and real images, especially in sampling efficiency and in handling data with severe outliers.

Download Full-text

Collaboration Based Multi-Label Learning

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v33i01.33013550 ◽

2019 ◽

Vol 33 ◽

pp. 3550-3557 ◽

Cited By ~ 8

Author(s):

Lei Feng ◽

Bo An ◽

Shuo He

Keyword(s):

Prior Knowledge ◽

State Of The Art ◽

Experimental Results ◽

Sparse Reconstruction ◽

Learning Approach ◽

Hypothesis Space ◽

Explicitly Correlated ◽

Label Correlations ◽

Novel Method ◽

Model Training

It is well-known that exploiting label correlations is crucially important to multi-label learning. Most of the existing approaches take label correlations as prior knowledge, which may not correctly characterize the real relationships among labels. Besides, label correlations are normally used to regularize the hypothesis space, while the final predictions are not explicitly correlated. In this paper, we suggest that for each individual label, the final prediction involves the collaboration between its own prediction and the predictions of other labels. Based on this assumption, we first propose a novel method to learn the label correlations via sparse reconstruction in the label space. Then, by seamlessly integrating the learned label correlations into model training, we propose a novel multi-label learning approach that aims to explicitly account for the correlated predictions of labels while training the desired model simultaneously. Extensive experimental results show that our approach outperforms the state-of-the-art counterparts.

Download Full-text

Quantized Residual Preference Based Linkage Clustering for Model Selection and Inlier Segmentation in Geometric Multi-Model Fitting

Sensors ◽

10.3390/s20133806 ◽

2020 ◽

Vol 20 (13) ◽

pp. 3806 ◽

Cited By ~ 2

Author(s):

Qing Zhao ◽

Yun Zhang ◽

Qianqing Qin ◽

Bin Luo

Keyword(s):

Model Selection ◽

State Of The Art ◽

Geometric Model ◽

Model Fitting ◽

Real Data ◽

Similarity Measurement ◽

Iterative Sampling ◽

Art Methods ◽

Data Points

In this paper, quantized residual preference is proposed to represent the hypotheses and the points for model selection and inlier segmentation in multi-structure geometric model fitting. First, a quantized residual preference is proposed to represent the hypotheses. Through a weighted similarity measurement and linkage clustering, similar hypotheses are put into one cluster, and hypotheses with good quality are selected from the clusters as the model selection results. After this, the quantized residual preference is also used to present the data points, and through the linkage clustering, the inliers belonging to the same model can be separated from the outliers. To exclude outliers as many as possible, an iterative sampling and clustering process is performed within the clustering process until the clusters are stable. The experiments undertake indicate that the proposed method performs even better on real data than the some state-of-the-art methods.

Download Full-text

Statistical Model Fitting: A State-of-the-Art Review

Contemporary Psychology ◽

10.1037/016731 ◽

1978 ◽

Vol 23 (11) ◽

pp. 937-938

Author(s):

JAMES R. KLUEGEL

Keyword(s):

Statistical Model ◽

State Of The Art ◽

Model Fitting

Download Full-text

COVID-19 infection map generation and detection from chest X-ray images

Health Information Science and Systems ◽

10.1007/s13755-021-00146-8 ◽

2021 ◽

Vol 9 (1) ◽

Author(s):

Aysen Degerli ◽

Mete Ahishali ◽

Mehmet Yamac ◽

Serkan Kiranyaz ◽

Muhammad E. H. Chowdhury ◽

...

Keyword(s):

State Of The Art ◽

Ground Truth ◽

Clinical Use ◽

X Ray ◽

Learning Techniques ◽

Map Generation ◽

Severity Grading ◽

Chest X Ray ◽

Novel Method ◽

Aided Diagnosis

AbstractComputer-aided diagnosis has become a necessity for accurate and immediate coronavirus disease 2019 (COVID-19) detection to aid treatment and prevent the spread of the virus. Numerous studies have proposed to use Deep Learning techniques for COVID-19 diagnosis. However, they have used very limited chest X-ray (CXR) image repositories for evaluation with a small number, a few hundreds, of COVID-19 samples. Moreover, these methods can neither localize nor grade the severity of COVID-19 infection. For this purpose, recent studies proposed to explore the activation maps of deep networks. However, they remain inaccurate for localizing the actual infestation making them unreliable for clinical use. This study proposes a novel method for the joint localization, severity grading, and detection of COVID-19 from CXR images by generating the so-called infection maps. To accomplish this, we have compiled the largest dataset with 119,316 CXR images including 2951 COVID-19 samples, where the annotation of the ground-truth segmentation masks is performed on CXRs by a novel collaborative human–machine approach. Furthermore, we publicly release the first CXR dataset with the ground-truth segmentation masks of the COVID-19 infected regions. A detailed set of experiments show that state-of-the-art segmentation networks can learn to localize COVID-19 infection with an F1-score of 83.20%, which is significantly superior to the activation maps created by the previous methods. Finally, the proposed approach achieved a COVID-19 detection performance with 94.96% sensitivity and 99.88% specificity.

Download Full-text

Gamma-ray Spectrometry in Geothermal Exploration: State of the Art Techniques

Energies ◽

10.3390/en7084757 ◽

2014 ◽

Vol 7 (8) ◽

pp. 4757-4780 ◽

Cited By ~ 17

Author(s):

Alistair McCay ◽

Thomas Harley ◽

Paul Younger ◽

David Sanderson ◽

Alan Cresswell

Keyword(s):

Gamma Ray ◽

State Of The Art ◽

Gamma Ray Spectrometry ◽

Geothermal Exploration ◽

Art Techniques

Download Full-text

Utterance Level Feature Aggregation with Deep Metric Learning for Speech Emotion Recognition

Sensors ◽

10.3390/s21124233 ◽

2021 ◽

Vol 21 (12) ◽

pp. 4233

Author(s):

Bogdan Mocanu ◽

Ruxandra Tapu ◽

Titus Zaharia

Keyword(s):

Emotion Recognition ◽

Loss Function ◽

State Of The Art ◽

Disease Diagnosis ◽

Data Representation ◽

Speech Emotion Recognition ◽

Audio Features ◽

Global Accuracy ◽

Space Data ◽

Art Techniques

Emotion is a form of high-level paralinguistic information that is intrinsically conveyed by human speech. Automatic speech emotion recognition is an essential challenge for various applications; including mental disease diagnosis; audio surveillance; human behavior understanding; e-learning and human–machine/robot interaction. In this paper, we introduce a novel speech emotion recognition method, based on the Squeeze and Excitation ResNet (SE-ResNet) model and fed with spectrogram inputs. In order to overcome the limitations of the state-of-the-art techniques, which fail in providing a robust feature representation at the utterance level, the CNN architecture is extended with a trainable discriminative GhostVLAD clustering layer that aggregates the audio features into compact, single-utterance vector representation. In addition, an end-to-end neural embedding approach is introduced, based on an emotionally constrained triplet loss function. The loss function integrates the relations between the various emotional patterns and thus improves the latent space data representation. The proposed methodology achieves 83.35% and 64.92% global accuracy rates on the RAVDESS and CREMA-D publicly available datasets, respectively. When compared with the results provided by human observers, the gains in global accuracy scores are superior to 24%. Finally, the objective comparative evaluation with state-of-the-art techniques demonstrates accuracy gains of more than 3%.

Download Full-text

ART-UP: A Novel Method for Generating Scanning-Robust Aesthetic QR Codes

ACM Transactions on Multimedia Computing Communications and Applications ◽

10.1145/3418214 ◽

2021 ◽

Vol 17 (1) ◽

pp. 1-23

Author(s):

Mingliang Xu ◽

Qingfeng Li ◽

Jianwei Niu ◽

Hao Su ◽

Xiting Liu ◽

...

Keyword(s):

State Of The Art ◽

Visual Quality ◽

Qr Code ◽

Quick Response ◽

Estimation Model ◽

Qr Codes ◽

Excellent Performance ◽

Novel Method ◽

Coarse To Fine

Quick response (QR) codes are usually scanned in different environments, so they must be robust to variations in illumination, scale, coverage, and camera angles. Aesthetic QR codes improve the visual quality, but subtle changes in their appearance may cause scanning failure. In this article, a new method to generate scanning-robust aesthetic QR codes is proposed, which is based on a module-based scanning probability estimation model that can effectively balance the tradeoff between visual quality and scanning robustness. Our method locally adjusts the luminance of each module by estimating the probability of successful sampling. The approach adopts the hierarchical, coarse-to-fine strategy to enhance the visual quality of aesthetic QR codes, which sequentially generate the following three codes: a binary aesthetic QR code, a grayscale aesthetic QR code, and the final color aesthetic QR code. Our approach also can be used to create QR codes with different visual styles by adjusting some initialization parameters. User surveys and decoding experiments were adopted for evaluating our method compared with state-of-the-art algorithms, which indicates that the proposed approach has excellent performance in terms of both visual quality and scanning robustness.

Download Full-text

Reconfigurable Intelligent Surface Aided Multi-User Communications: State-of-the-Art Techniques and Open Issues

IEEE Access ◽

10.1109/access.2021.3107316 ◽

2021 ◽

Vol 9 ◽

pp. 118584-118605

Author(s):

Munyaradzi Munochiveyi ◽

Arjun Chakravarthi Pogaku ◽

Dinh-Thuan Do ◽

Anh-Tu Le ◽

Miroslav Voznak ◽

...

Keyword(s):

State Of The Art ◽

Open Issues ◽

Art Techniques

Download Full-text

Design of True Random Number Circuit with Controllable Frequency

Electronics ◽

10.3390/electronics10131517 ◽

2021 ◽

Vol 10 (13) ◽

pp. 1517

Author(s):

Xinsheng Wang ◽

Xiyue Wang

Keyword(s):

Random Number ◽

Noise Source ◽

State Of The Art ◽

Building Blocks ◽

Encryption Algorithm ◽

Random Numbers ◽

Random Number Generators ◽

Random Telegraph Noise ◽

Telegraph Noise ◽

Novel Method

True random number generators (TRNGs) have been a research hotspot due to secure encryption algorithm requirements. Therefore, such circuits are necessary building blocks in state-of-the-art security controllers. In this paper, a TRNG based on random telegraph noise (RTN) with a controllable rate is proposed. A novel method of noise array circuits is presented, which consists of digital decoder circuits and RTN noise circuits. The frequency of generating random numbers is controlled by the speed of selecting different gating signals. The results of simulation show that the array circuits consist of 64 noise source circuits that can generate random numbers by a frequency from 1 kHz to 16 kHz.

Download Full-text