A Two-Stage Multi-Feature Integration Approach to Unsupervised Speaker Change Detection in Real-Time News Broadcasting

Object detection using remote sensing data is a key task of the perception systems of self-driving vehicles. While many generic deep learning architectures have been proposed for this problem, there is little guidance on their suitability when using them in a particular scenario such as autonomous driving. In this work, we aim to assess the performance of existing 2D detection systems on a multi-class problem (vehicles, pedestrians, and cyclists) with images obtained from the on-board camera sensors of a car. We evaluate several one-stage (RetinaNet, FCOS, and YOLOv3) and two-stage (Faster R-CNN) deep learning meta-architectures under different image resolutions and feature extractors (ResNet, ResNeXt, Res2Net, DarkNet, and MobileNet). These models are trained using transfer learning and compared in terms of both precision and efficiency, with special attention to the real-time requirements of this context. For the experimental study, we use the Waymo Open Dataset, which is the largest existing benchmark. Despite the rising popularity of one-stage detectors, our findings show that two-stage detectors still provide the most robust performance. Faster R-CNN models outperform one-stage detectors in accuracy, being also more reliable in the detection of minority classes. Faster R-CNN Res2Net-101 achieves the best speed/accuracy tradeoff but needs lower resolution images to reach real-time speed. Furthermore, the anchor-free FCOS detector is a slightly faster alternative to RetinaNet, with similar precision and lower memory usage.

Download Full-text

Real-time Two-Stage SPECK (TSSP) design and implementation for scalable video coding on embedded systems

2011 Visual Communications and Image Processing (VCIP) ◽

10.1109/vcip.2011.6115920 ◽

2011 ◽

Author(s):

Marijn J.H. Loomans ◽

Cornelis J. Koeleman ◽

Peter H.N. de With

Keyword(s):

Embedded Systems ◽

Video Coding ◽

Real Time ◽

Scalable Video Coding ◽

Scalable Video ◽

Two Stage ◽

Design And Implementation

Download Full-text

Patch-Based Change Detection Method for SAR Images with Label Updating Strategy

Remote Sensing ◽

10.3390/rs13071236 ◽

2021 ◽

Vol 13 (7) ◽

pp. 1236

Author(s):

Yuanjun Shu ◽

Wei Li ◽

Menglong Yang ◽

Peng Cheng ◽

Songchen Han

Keyword(s):

Neural Networks ◽

Deep Learning ◽

Change Detection ◽

Detection Method ◽

Training Data ◽

Synthetic Aperture ◽

Learning Method ◽

Sar Images ◽

Two Stage ◽

Second Stage

Convolutional neural networks (CNNs) have been widely used in change detection of synthetic aperture radar (SAR) images and have been proven to have better precision than traditional methods. A two-stage patch-based deep learning method with a label updating strategy is proposed in this paper. The initial label and mask are generated at the pre-classification stage. Then a two-stage updating strategy is applied to gradually recover changed areas. At the first stage, diversity of training data is gradually restored. The output of the designed CNN network is further processed to generate a new label and a new mask for the following learning iteration. As the diversity of data is ensured after the first stage, pixels within uncertain areas can be easily classified at the second stage. Experiment results on several representative datasets show the effectiveness of our proposed method compared with several existing competitive methods.

Download Full-text

A two-stage analogue model for real-time urban flood forecasting

10.5194/egusphere-egu21-15645 ◽

2021 ◽

Author(s):

Chris Onof ◽

Yuting Chen ◽

Li-Pen Wang ◽

Amy Jones ◽

Susana Ochoa Rodriguez

Keyword(s):

Real Time ◽

Flood Forecasting ◽

Image Feature ◽

Two Stage ◽

Analogue Model ◽

Flood Prediction ◽

Urban Flood ◽

Radar Images ◽

Proposed Model ◽

Flood Maps

In this work a two-stage (rainfall nowcasting + flood prediction) analogue model for real-time urban flood forecasting is presented. The proposed approach accounts for the complexities of urban rainfall nowcasting while avoiding the expensive computational requirements of real-time urban flood forecasting.The model has two consecutive stages:<ul><li>(1) Rainfall nowcasting: 0-6h lead time ensemble rainfall nowcasting is achieved by means of an analogue method, based on the assumption that similar climate condition will define similar patterns of temporal evolution of the rainfall. The framework uses the NORA analogue-based forecasting tool (Panziera et al., 2011), consisting of two layers. In the first layer, the 120 historical atmospheric (forcing) conditions most similar to the current atmospheric conditions are extracted, with the historical database consisting of ERA5 reanalysis data from the ECMWF and the current conditions derived from the US Global Forecasting System (GFS). In the second layer, twelve historical radar images most similar to the current one are extracted from amongst the historical radar images linked to the aforementioned 120 forcing analogues. Lastly, for each of the twelve analogues, the rainfall fields (at resolution of 1km/5min) observed after the present time are taken as one ensemble member. Note that principal component analysis (PCA) and uncorrelated multilinear PCA methods were tested for image feature extraction prior to applying the nearest neighbour technique for analogue selection.</li> <li>(2) Flood prediction: we predict flood extent using the high-resolution rainfall forecast from Stage 1, along with a database of pre-run flood maps at 1x1 km2 solution from 157 catalogued historical flood events. A deterministic flood prediction is obtained by using the averaged response from the twelve flood maps associated to the twelve ensemble rainfall nowcasts, where for each gridded area the median value is adopted (assuming flood maps are equiprobabilistic). A probabilistic flood prediction is obtained by generating a quantile-based flood map. Note that the flood maps were generated through rolling ball-based mapping of the flood volumes predicted at each node of the InfoWorks ICM sewer model of the pilot area.</li> </ul>The Minworth catchment in the UK (~400 km2) was used to demonstrate the proposed model. Cross&#8209;assessment was undertaken for each of 157 flooding events by leaving one event out from training in each iteration and using it for evaluation. With a focus on the spatial replication of flood/non-flood patterns, the predicted flood maps were converted to binary (flood/non-flood) maps. Quantitative assessment was undertaken by means of a contingency table. An average accuracy rate (i.e. proportion of correct predictions, out of all test events) of 71.4% was achieved, with individual accuracy rates ranging from 57.1% to 78.6%). Further testing is needed to confirm initial findings and flood mapping refinement will be pursued.The proposed model is fast, easy and relatively inexpensive to operate, making it suitable for direct use by local authorities who often lack the expertise on and/or capabilities for flood modelling and forecasting.References: Panziera et al. 2011. NORA&#8211;Nowcasting of Orographic Rainfall by means of Analogues. Quarterly Journal of the Royal Meteorological Society. 137, 2106-2123.

Download Full-text

A Two-Stage Multi-Feature Integration Approach to Unsupervised Speaker Change Detection in Real-Time News Broadcasting

Speaker change detection and tracking in real-time news broadcasting analysis

Real-time unsupervised speaker change detection

Speaker Change Detection for Conversational Speech using Synthesized Voice Embedding

A real-time two-stage and dual-check template matching algorithm based on normalized cross-correlation for industrial vision positioning

A Real-time Driver Fatigue Detection Method Based on Two-Stage Convolutional Neural Network

Data Augmentation and D-vector Representation Methods for Speaker Change Detection

On the Performance of One-Stage and Two-Stage Object Detectors in Autonomous Vehicles Using Camera Data

Real-time Two-Stage SPECK (TSSP) design and implementation for scalable video coding on embedded systems

Patch-Based Change Detection Method for SAR Images with Label Updating Strategy

A two-stage analogue model for real-time urban flood forecasting

Export Citation Format