The transformer earthquake alerting model: A new versatile approach to earthquake early warning

Geophysical Journal International ◽

10.1093/gji/ggaa609 ◽

2020 ◽

Author(s):

Jannes Münchmeyer ◽

Dino Bindi ◽

Ulf Leser ◽

Frederik Tilmann

Keyword(s):

Deep Learning ◽

Seismic Hazard ◽

Early Warning ◽

Seismic Risk ◽

Domain Adaptation ◽

Strong Motion ◽

Training Data ◽

Preventive Action ◽

Advance Notice ◽

Seismic Networks

Summary Earthquakes are major hazards to humans, buildings and infrastructure. Early warning methods aim to provide advance notice of incoming strong shaking to enable preventive action and mitigate seismic risk. Their usefulness depends on accuracy, the relation between true, missed and false alerts, and timeliness, the time between a warning and the arrival of strong shaking. Current approaches suffer from apparent aleatoric uncertainties due to simplified modelling or short warning times. Here we propose a novel early warning method, the deep-learning based transformer earthquake alerting model (TEAM), to mitigate these limitations. TEAM analyzes raw, strong motion waveforms of an arbitrary number of stations at arbitrary locations in real-time, making it easily adaptable to changing seismic networks and warning targets. We evaluate TEAM on two regions with high seismic hazard, Japan and Italy, that are complementary in their seismicity. On both datasets TEAM outperforms existing early warning methods considerably, offering accurate and timely warnings. Using domain adaptation, TEAM even provides reliable alerts for events larger than any in the training data, a property of highest importance as records from very large events are rare in many regions.

Download Full-text

AI-based Identification of Plant Photographs from Herbarium Specimens

Biodiversity Information Science and Standards ◽

10.3897/biss.5.73751 ◽

2021 ◽

Vol 5 ◽

Author(s):

Hervé Goëau ◽

Pierre Bonnet ◽

Alexis Joly

Keyword(s):

Deep Learning ◽

Domain Adaptation ◽

Metric Learning ◽

Species Traits ◽

Training Data ◽

Herbarium Specimens ◽

Representation Space ◽

Identification Performance ◽

Plant Identification ◽

Joint Representation

Automated plant identification has recently improved significantly due to advances in deep learning and the availability of large amounts of field photos. As an illustration, the classification accuracy of 10K species measured in the LifeCLEF challenge (Goëau et al. 2018) reached 90%, very close to that of human experts. However, the profusion of field images only concerns a few tens of thousands of species, mainly located in North America and Western Europe. Conversely, the richest regions in terms of biodiversity, such as tropical countries, suffer from a shortage of training data (Pitman 2021). Consequently, the identification performance of the most advanced models on the flora of these regions is much lower (Goëau et al. 2019). Nevertheless, for several centuries, botanists have systematically collected, catalogued, and stored plant specimens in herbaria. Considerable recent efforts by the biodiversity informatics community, such as DiSSCo (Addink et al. 2018) and iDigBio (Matsunaga et al. 2013), have made millions of digitized specimens from these collections available online. A key question is therefore whether these digitized specimens could be used to improve the identification performance of species for which we have very few (if any) photos. However, this is a very difficult problem from a machine learning point of view. The visual appearance of a herbarium specimen is actually very different from a field photograph because the specimens are dried and crushed on a herbarium sheet before being digitized (Fig. 1). To advance research on this topic, we built a large dataset that we shared as one of the challenges of the LifeCLEF 2020 (Goëau et al. 2020) and 2021 evaluation campaigns (Goëau et al. 2021). It includes more than 320K herbarium specimens collected mostly from the Guiana Shield and the Northern Amazon Rainforest, focusing on about 1K plant species of the French Guiana flora. A valuable asset of this collection is that some of the specimens are accompanied by a few photos of the same specimen, allowing for more precise machine learning. In addition to this training data, we also built a test set for model evaluation, composed of 3,186 field photos collected by two of the best experts on Guyanese flora. Based on this dataset, about ten research teams have developed deep learning methods to address the challenge (including the authors of this abstract as the organizing team). A detailed description of these methods can be found in the technical notes written by the participating teams (Goëau et al. 2020, Goëau et al. 2021). The methods can be divided into two categories: those based on classical convolutional neural networks (CNN) trained simply by mixing digitized specimens and photos and those based on advanced domain adaptation techniques with the objective of learning a joint representation space between field and herbarium representations. those based on classical convolutional neural networks (CNN) trained simply by mixing digitized specimens and photos and those based on advanced domain adaptation techniques with the objective of learning a joint representation space between field and herbarium representations. The domain adaptation methods themselves were of two types, those based on adversarial regularization (Motiian et al. 2017) to force herbarium specimens and photos to have the same representations, metric learning to maximize inter-species distances and minimize intra-species distances in the representation space adversarial regularization (Motiian et al. 2017) to force herbarium specimens and photos to have the same representations, metric learning to maximize inter-species distances and minimize intra-species distances in the representation space In Table 1, we report the results achieved by the different methods evaluated during the 2020 edition of the challenge. The evaluation metric used is the mean reciprocal rank (MRR), i.e., the average of the inverse of the rank of the correct species in the list of the predicted species. In addition to this main score, a second MRR score is computed on a subset of the test set composed of the most difficult species, i.e., the ones that are the least frequently photographed in the field. The main outcomes we can derive from these results are the following: Classical deep learning models fail to identify plant photos from digitized herbarium specimens. The best classical CNN trained on the provided data resulted in a very low MRR score (0.011). Even with the of use additional training data (e.g. photos and digitized herbarium from GBIF) the MRR score remains very low (0.039). Domain adaptation methods provide significant improvement but the task remains challenging. The best MRR score (0.180) was achieved by using adversarial regularization (FSDA Motiian et al. 2017). This is much better than the classical CNN models but there is still a lot of progress to be made to reach the performance of a truly functional identification system (the MRR score on classical plant identification tasks can be up to 0.9). No method fits all. As shown in Table 1, the metric learning method has a significantly better MRR score on the most difficult species (0.107). However, the performance of this method on the species with more photos is much lower than the adversarial technique. In 2021, the challenge was run again but with additional information provided to train the models, i.e., species traits (plant life form, woodiness and plant growth form). The use of the species traits allowed slight performance improvement of the best adversarial adaptation method (with a MRR equal to 0.198). In conclusion, the results of the experiments conducted are promising and demonstrate the potential interest of digitized herbarium data for automated plant identification. However, progress is still needed before integrating this type of approach into production applications.

Download Full-text

A Broad Transfer Learning Algorithm for Classification of Bearing Faults

Volume 2: Manufacturing Processes; Manufacturing Systems; Nano/Micro/Meso Manufacturing; Quality and Reliability ◽

10.1115/msec2020-8312 ◽

2020 ◽

Author(s):

Guokai Liu ◽

Liang Gao ◽

Weiming Shen ◽

Andrew Kusiak

Keyword(s):

Deep Learning ◽

Transfer Learning ◽

Manufacturing Industry ◽

Domain Adaptation ◽

Learning Algorithm ◽

Learning Algorithms ◽

Training Data ◽

Training Time ◽

Bearing Faults

Abstract Condition monitoring and fault diagnosis are of great interest to the manufacturing industry. Deep learning algorithms have shown promising results in equipment prognostics and health management. However, their success has been hindered by excessive training time. In addition, deep learning algorithms face the domain adaptation dilemma encountered in dynamic application environments. The emerging concept of broad learning addresses the training time and the domain adaptation issue. In this paper, a broad transfer learning algorithm is proposed for the classification of bearing faults. Data of the same frequency is used to construct one- and two-dimensional training data sets to analyze performance of the broad transfer and deep learning algorithms. A broad learning algorithm contains two main layers, an augmented feature layer and a classification layer. The broad learning algorithm with a sparse auto-encoder is employed to extract features. The optimal solution of a redefined cost function with a limited sample size to ten per class in the target domain offers the classifier of broad learning domain adaptation capability. The effectiveness of the proposed algorithm has been demonstrated on a benchmark dataset. Computational experiments have demonstrated superior efficiency and accuracy of the proposed algorithm over the deep learning algorithms tested.

Download Full-text

Advancing Medical Imaging Informatics by Deep Learning-Based Domain Adaptation

Yearbook of Medical Informatics ◽

10.1055/s-0040-1702009 ◽

2020 ◽

Vol 29 (01) ◽

pp. 129-138 ◽

Cited By ~ 1

Author(s):

Anirudh Choudhary ◽

Li Tong ◽

Yuanda Zhu ◽

May D. Wang

Keyword(s):

Deep Learning ◽

Medical Imaging ◽

Large Scale ◽

Domain Adaptation ◽

Rapid Development ◽

Synthetic Data ◽

Feature Space ◽

Imaging Informatics ◽

Training Data ◽

Domain Transformation

Introduction: There has been a rapid development of deep learning (DL) models for medical imaging. However, DL requires a large labeled dataset for training the models. Getting large-scale labeled data remains a challenge, and multi-center datasets suffer from heterogeneity due to patient diversity and varying imaging protocols. Domain adaptation (DA) has been developed to transfer the knowledge from a labeled data domain to a related but unlabeled domain in either image space or feature space. DA is a type of transfer learning (TL) that can improve the performance of models when applied to multiple different datasets. Objective: In this survey, we review the state-of-the-art DL-based DA methods for medical imaging. We aim to summarize recent advances, highlighting the motivation, challenges, and opportunities, and to discuss promising directions for future work in DA for medical imaging. Methods: We surveyed peer-reviewed publications from leading biomedical journals and conferences between 2017-2020, that reported the use of DA in medical imaging applications, grouping them by methodology, image modality, and learning scenarios. Results: We mainly focused on pathology and radiology as application areas. Among various DA approaches, we discussed domain transformation (DT) and latent feature-space transformation (LFST). We highlighted the role of unsupervised DA in image segmentation and described opportunities for future development. Conclusion: DA has emerged as a promising solution to deal with the lack of annotated training data. Using adversarial techniques, unsupervised DA has achieved good performance, especially for segmentation tasks. Opportunities include domain transferability, multi-modal DA, and applications that benefit from synthetic data.

Download Full-text

Could a Decentralized Onsite Earthquake Early Warning System Help in Mitigating Seismic Risk in Northeastern Italy? The Case of the 1976 Ms 6.5 Friuli Earthquake

Seismological Research Letters ◽

10.1785/0220200177 ◽

2020 ◽

Vol 91 (6) ◽

pp. 3323-3333

Author(s):

Stefano Parolai ◽

Luca Moratto ◽

Michele Bertoni ◽

Chiara Scaini ◽

Alessandro Rebez

Keyword(s):

Real Time ◽

Early Warning ◽

Seismic Risk ◽

Warning System ◽

Strong Motion ◽

Early Warning Systems ◽

Earthquake Early Warning ◽

Industrial Facilities ◽

Earthquake Early Warning System ◽

The Impact

Abstract In May 1976, a devastating earthquake of magnitude Ms 6.5 occurred in Friuli, Italy, resulting in 976 deaths, 2000 injured, and 60,000 homeless. It is notable that, at the time of the earthquake, only one station was installed in the affected region. The resulting lack of information, combined with a dearth of mitigation planning for responding to such events, lead to a clear picture of the impact of the disaster being available only after a few days. This region is now covered by nearly 100 seismological and strong-motion stations operating in real time. Furthermore, 30 average-cost strong-motion stations have been recently added, with the goals of improving the density of real-time ground-motion observations and measuring the level of shaking recorded at selected buildings. The final goal is to allow rapid impact estimations to be made to improve the response of civil protection authorities. Today, considering the higher density seismological network, new efforts in terms of the implementation and testing of earthquake early warning systems as a possible tool for mitigating seismic risk are certainly worthwhile. In this article, we show the results obtained by analyzing in playback and using an algorithm for decentralized onsite earthquake early warning, broadband synthetic strong-motion data calculated at 18 of the stations installed in the region, while considering the magnitude and location of the 1976 Friuli earthquake. The analysis shows that the anisotropy of the lead times is related not only to the finite nature of the source but also to the slip distribution. A reduction of 10% of injured persons appears to be possible if appropriate mitigating actions are employed, such as the development of efficient automatic procedures that improve the safety of strategic industrial facilities.

Download Full-text

Generative Adversarial Domain Adaptation for Nucleus Quantification in Images of Tissue Immunohistochemically Stained for Ki-67

JCO Clinical Cancer Informatics ◽

10.1200/cci.19.00108 ◽

2020 ◽

pp. 666-679 ◽

Cited By ~ 2

Author(s):

Xuhong Zhang ◽

Toby C. Cornish ◽

Lin Yang ◽

Tellen D. Bennett ◽

Debashis Ghosh ◽

...

Keyword(s):

Deep Learning ◽

Domain Adaptation ◽

Training Data ◽

Data Sets ◽

Learning Models ◽

Convolutional Network ◽

Ki 67 ◽

Data Set ◽

Target Data ◽

Real Target

PURPOSE We focus on the problem of scarcity of annotated training data for nucleus recognition in Ki-67 immunohistochemistry (IHC)–stained pancreatic neuroendocrine tumor (NET) images. We hypothesize that deep learning–based domain adaptation is helpful for nucleus recognition when image annotations are unavailable in target data sets. METHODS We considered 2 different institutional pancreatic NET data sets: one (ie, source) containing 38 cases with 114 annotated images and the other (ie, target) containing 72 cases with 20 annotated images. The gold standards were manually annotated by 1 pathologist. We developed a novel deep learning–based domain adaptation framework to count different types of nuclei (ie, immunopositive tumor, immunonegative tumor, nontumor nuclei). We compared the proposed method with several recent fully supervised deep learning models, such as fully convolutional network-8s (FCN-8s), U-Net, fully convolutional regression network (FCRN) A, FCRNB, and fully residual convolutional network (FRCN). We also evaluated the proposed method by learning with a mixture of converted source images and real target annotations. RESULTS Our method achieved an F1 score of 81.3% and 62.3% for nucleus detection and classification in the target data set, respectively. Our method outperformed FCN-8s (53.6% and 43.6% for nucleus detection and classification, respectively), U-Net (61.1% and 47.6%), FCRNA (63.4% and 55.8%), and FCRNB (68.2% and 60.6%) in terms of F1 score and was competitive with FRCN (81.7% and 70.7%). In addition, learning with a mixture of converted source images and only a small set of real target labels could further boost the performance. CONCLUSION This study demonstrates that deep learning–based domain adaptation is helpful for nucleus recognition in Ki-67 IHC stained images when target data annotations are not available. It would improve the applicability of deep learning models designed for downstream supervised learning tasks on different data sets.

Download Full-text

Estimating Rupture Dimensions of Three Major Earthquakes in Sichuan, China, for Early Warning and Rapid Loss Estimates

Bulletin of the Seismological Society of America ◽

10.1785/0120190117 ◽

2020 ◽

Vol 110 (2) ◽

pp. 920-936 ◽

Cited By ~ 1

Author(s):

Jiawei Li ◽

Maren Böse ◽

Max Wyss ◽

David J. Wald ◽

Alexandra Hutchison ◽

...

Keyword(s):

Real Time ◽

Early Warning ◽

Line Source ◽

Source Parameters ◽

Strong Motion ◽

Preventive Action ◽

Fault Rupture ◽

S Wave ◽

Finite Fault ◽

Source Models

ABSTRACT Large earthquakes, such as Wenchuan in 2008, Mw 7.9, Sichuan, China, provide an opportunity for earthquake early warning (EEW), as many heavily shaken areas are far (∼50 km) from the epicenter and warning times could be sufficient (≥5 s) to take preventive action. On the other hand, earthquakes with magnitudes larger than ∼M 6.5 are challenging for EEW because source dimensions need to be defined to adequately estimate shaking. Finite-fault rupture detector (FinDer) is an approach to identify fault rupture extents from real-time seismic records. In this study, we playback local and regional onscale strong-motion waveforms of the 2008 Mw 7.9 Wenchuan, 2013 Mw 6.6 Lushan, and 2017 Mw 6.5 Jiuzhaigou earthquakes to study the performance of FinDer for the current layout of the China Strong Motion Network. Overall, the FinDer line-source models agree well with the observed spatial distribution of aftershocks and models determined from waveform inversion. However, because FinDer models are constructed to characterize seismic ground motions (as needed for EEW) instead of source parameters, the rupture length can be overestimated for events radiating high levels of high-frequency motions. If the strong-motion data used had been available in real time, 50%–80% of sites experiencing intensity modified Mercalli intensity IV–VII (light to very strong) and 30% experiencing VIII–IX (severe to violent) could have been issued a warning with 10 and 5 s, respectively, before the arrival of the S wave. We also show that loss estimates based on the FinDer line source are more accurate compared to point-source models. For the Wenchuan earthquake, for example, they predict a four to six times larger number of fatalities and injured, which is consistent with official reports. These losses could be provided 1/2∼3 hr faster than if they were based on more complex inversion rupture models.

Download Full-text

Improving Classification Accuracy of Hand Gesture Recognition Based on 60 GHz FMCW Radar with Deep Learning Domain Adaptation

Electronics ◽

10.3390/electronics9122140 ◽

2020 ◽

Vol 9 (12) ◽

pp. 2140

Author(s):

Hyo Ryun Lee ◽

Jihun Park ◽

Young-Joo Suh

Keyword(s):

Deep Learning ◽

Gesture Recognition ◽

Electronic Device ◽

Domain Adaptation ◽

Recognition Accuracy ◽

Hand Gesture Recognition ◽

Training Data ◽

Hand Gesture ◽

60 Ghz ◽

Fmcw Radar

With the recent development of small radars with high resolution, various human–computer interaction (HCI) applications using them have been developed. In particular, a method of applying a user’s hand gesture recognition using a short-range radar to an electronic device is being actively studied. In general, the time delay and Doppler shift characteristics that occur when a transmitted signal that is reflected off an object returns are classified through deep learning to recognize the motion. However, the main obstacle in the commercialization of radar-based hand gesture recognition is that even for the same type of hand gesture, recognition accuracy is degraded due to a slight difference in movement for each individual user. To solve this problem, in this paper, the domain adaptation is applied to hand gesture recognition to minimize the differences among users’ gesture information in the learning and the use stage. To verify the effectiveness of domain adaptation, a domain discriminator that cheats the classifier was applied to a deep learning network with a convolutional neural network (CNN) structure. Seven different hand gesture data were collected for 10 participants and used for learning, and the hand gestures of 10 users that were not included in the training data were input to confirm the recognition accuracy of an average of 98.8%.

Download Full-text

Fault Diagnosis of Gearbox with a Transferable Deep Neural Network

10.20944/preprints202005.0347.v1 ◽

2020 ◽

Author(s):

Greg Smith ◽

Masayoshi Shibatani

Keyword(s):

Neural Network ◽

Machine Learning ◽

Deep Learning ◽

Fault Diagnosis ◽

Domain Adaptation ◽

Fault Detection And Diagnosis ◽

Training Data ◽

Operating Condition ◽

Cross Domain ◽

Testing Data

In the past years, various intelligent machine learning and deep learning algorithms have been developed and widely applied for gearbox fault detection and diagnosis. However, the real-time application of these intelligent algorithms has been limited, mainly due to the fact that the model developed using data from one machine or one operating condition has serious diagnosis performance degradation when applied to another machine or the same machine with a different operating condition. The reason for poor model generalization is the distribution discrepancy between the training and testing data. This paper proposes to address this issue using a deep learning based cross domain adaptation approach for gearbox fault diagnosis. Labelled data from training dataset and unlabeled data from testing dataset is used to achieve the cross-domain adaptation task. A deep convolutional neural network (CNN) is used as the main architecture. Maximum mean discrepancy is used as a measure to minimize the distribution distance between the labelled training data and unlabeled testing data. The study proposes to reduce the discrepancy between the two domains in multiple layers of the designed CNN to adapt the learned representations from the training data to be applied in the testing data. The proposed approach is evaluated using experimental data from a gearbox under significant speed variation and multiple health conditions. An appropriate benchmarking with both traditional machine learning methods and other domain adaptation methods demonstrates the superiority of the proposed method.

Download Full-text

Adversarial domain adaptation for cross data source macromolecule in situ structural classification in cellular electron cryo-tomograms

Bioinformatics ◽

10.1093/bioinformatics/btz364 ◽

2019 ◽

Vol 35 (14) ◽

pp. i260-i268 ◽

Cited By ~ 1

Author(s):

Ruogu Lin ◽

Xiangrui Zeng ◽

Kris Kitani ◽

Min Xu

Keyword(s):

Deep Learning ◽

Domain Adaptation ◽

Training Data ◽

Supplementary Information ◽

Target Domain ◽

Structural Classification ◽

Feature Extractor ◽

Data Source ◽

The Cross

Abstract Motivation Since 2017, an increasing amount of attention has been paid to the supervised deep learning-based macromolecule in situ structural classification (i.e. subtomogram classification) in cellular electron cryo-tomography (CECT) due to the substantially higher scalability of deep learning. However, the success of such supervised approach relies heavily on the availability of large amounts of labeled training data. For CECT, creating valid training data from the same data source as prediction data is usually laborious and computationally intensive. It would be beneficial to have training data from a separate data source where the annotation is readily available or can be performed in a high-throughput fashion. However, the cross data source prediction is often biased due to the different image intensity distributions (a.k.a. domain shift). Results We adapt a deep learning-based adversarial domain adaptation (3D-ADA) method to timely address the domain shift problem in CECT data analysis. 3D-ADA first uses a source domain feature extractor to extract discriminative features from the training data as the input to a classifier. Then it adversarially trains a target domain feature extractor to reduce the distribution differences of the extracted features between training and prediction data. As a result, the same classifier can be directly applied to the prediction data. We tested 3D-ADA on both experimental and realistically simulated subtomogram datasets under different imaging conditions. 3D-ADA stably improved the cross data source prediction, as well as outperformed two popular domain adaptation methods. Furthermore, we demonstrate that 3D-ADA can improve cross data source recovery of novel macromolecular structures. Availability and implementation https://github.com/xulabs/projects Supplementary information Supplementary data are available at Bioinformatics online.

Download Full-text

A Method for Vehicle Detection in High-Resolution Satellite Images that Uses a Region-Based Object Detector and Unsupervised Domain Adaptation

Remote Sensing ◽

10.3390/rs12030575 ◽

2020 ◽

Vol 12 (3) ◽

pp. 575 ◽

Cited By ~ 11

Author(s):

Yohei Koga ◽

Hiroyuki Miyazaki ◽

Ryosuke Shibasaki

Keyword(s):

Deep Learning ◽

Domain Adaptation ◽

Vehicle Detection ◽

Detection Performance ◽

Training Data ◽

Detection Accuracy ◽

Target Area ◽

Target Domain ◽

Unsupervised Domain Adaptation ◽

High Resolution Satellite Images

Recently, object detectors based on deep learning have become widely used for vehicle detection and contributed to drastic improvement in performance measures. However, deep learning requires much training data, and detection performance notably degrades when the target area of vehicle detection (the target domain) is different from the training data (the source domain). To address this problem, we propose an unsupervised domain adaptation (DA) method that does not require labeled training data, and thus can maintain detection performance in the target domain at a low cost. We applied Correlation alignment (CORAL) DA and adversarial DA to our region-based vehicle detector and improved the detection accuracy by over 10% in the target domain. We further improved adversarial DA by utilizing the reconstruction loss to facilitate learning semantic features. Our proposed method achieved slightly better performance than the accuracy achieved with the labeled training data of the target domain. We demonstrated that our improved DA method could achieve almost the same level of accuracy at a lower cost than non-DA methods with a sufficient amount of labeled training data of the target domain.

Download Full-text