Direct micro-seismic event location and characterization from passive seismic data using convolutional neural networks

Geophysics ◽

10.1190/geo2020-0636.1 ◽

2021 ◽

pp. 1-77

Author(s):

Hanchen Wang ◽

Tariq Alkhalifah

Keyword(s):

Neural Network ◽

Neural Networks ◽

Convolutional Neural Networks ◽

Event Detection ◽

Seismic Data ◽

High Performance ◽

Input Data ◽

Waveform Inversion ◽

Computational Cost ◽

Seismic Events

The ample size of time-lapse data often requires significant event detection and source location efforts, especially in areas like shale gas exploration regions where a large number of micro-seismic events are often recorded. In many cases, the real-time monitoring and locating of these events are essential to production decisions. Conventional methods face considerable drawbacks. For example, traveltime-based methods require traveltime picking of often noisy data, while migration and waveform inversion methods require expensive wavefield solutions and event detection. Both tasks require some human intervention, and this becomes a big problem when too many sources need to be located, which is common in micro-seismic monitoring. Machine learning has recently been used to identify micro-seismic events or locate their sources once they are identified and picked. We propose to use a novel artificial neural network framework to directly map seismic data, without any event picking or detection, to their potential source locations. We train two convolutional neural networks on labeled synthetic acoustic data containing simulated micro-seismic events to fulfill such requirements. One convolutional neural network, which has a global average pooling layer to reduce the computational cost while maintaining high-performance levels, aims to classify the number of events in the data. The other network predicts the source locations and other source features such as the source peak frequencies and amplitudes. To reduce the size of the input data to the network, we correlate the recorded traces with a central reference trace to allow the network to focus on the curvature of the input data near the zero-lag region. We train the networks to handle single, multi, and no event segments extracted from the data. Tests on a simple vertical varying model and a more realistic Otway field model demonstrate the approach's versatility and potential.

Download Full-text

Automated Event Detection and Denoising Method for Passive Seismic Data Using Residual Deep Convolutional Neural Networks

IEEE Transactions on Geoscience and Remote Sensing ◽

10.1109/tgrs.2021.3054071 ◽

2021 ◽

pp. 1-11

Author(s):

Abdullah Othman ◽

Naveed Iqbal ◽

Sherif M. Hanafy ◽

Umair Bin Waheed

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

Event Detection ◽

Seismic Data ◽

Deep Convolutional Neural Networks ◽

Denoising Method ◽

Passive Seismic

Download Full-text

Combing empirical matched field processing at IMS station SPITS and convolutional neural networks for calving event detection in Svalbard

10.5194/egusphere-egu21-1994 ◽

2021 ◽

Author(s):

Andreas Köhler ◽

Steffen Mæland

Keyword(s):

Machine Learning ◽

Neural Networks ◽

Time Series ◽

Convolutional Neural Networks ◽

Event Detection ◽

The Arctic ◽

Beam Power ◽

Seismic Events ◽

Tidewater Glaciers ◽

Seismic Arrays

<p>We combine the empirical matched field (EMF) method and machine learning using Convolutional Neural Networks (CNNs) for calving event detection at the IMS station SPITS and GSN station KBS on the Arctic Archipelago of Svalbard. EMF detection with seismic arrays seeks to identify all signals similar to a single template generated by seismic events in a confined target region. In contrast to master event cross-correlation detectors, the detection statistic is not the waveform similarity, but the array beam power obtained using empirical phase delays (steering parameters) between the array stations. Unlike common delay-and-sum beamforming, the steering parameters do not need to represent a plane wave and are directly computed from the template signal without assuming a particular apparent velocity and back-azimuth. As for all detectors, the false alarms rate depends strongly on the beam power threshold setting and therefore needs appropriate tuning or alternatively post-processing. Here, we combine the EMF detector using a low detection threshold with a post-detection classification step. The classifier uses spectrograms of single-station three-component records and state-of-the-art CNNs pre-trained for image recognition. Spectrograms of three-component seismic data are hereby combined as RGB images. We apply the methodology to detect calving events at tidewater glaciers in the Kongsfjord region in Northwestern Svalbard. The EMF detector uses data of the SPITS array, at about 100 km distance to the glaciers, while the CNN classifier processes data from the single three-component station KBS at 15 km distance using time windows where the event is expected according to the EMF detection. The EMF detector combines templates for the P and for the S wave onsets of a confirmed, large calving event. The CNN spectrogram classifier is trained using classes of confirmed calving signals from four different glaciers in the Kongsfjord region, seismic noise examples, and regional tectonic seismic events. By splitting the data into training and test data set, the CNN classifier yields a recognition rate of 89% on average. This is encouragingly high given the complex nature of calving signals and their visually similar waveforms. Subsequently, we process continuous data of 6 months in 2016 using the EMF-CNN method to produce a time series of glacier calving. About 90% of the confirmed calving signals used for the CNN training are detected by EMF processing, and around 80% are assigned to the correct glacier after CNN classification. Such calving time series allow us to estimate and monitor ice loss at tidewater glaciers which in turn can help to better understand the impact of climate change in Polar regions. Combining the superior detection capability of (less common) seismic arrays at a larger source distance with a powerful machine learning classifier at single three-component stations closer to the source, is a promising approach not only for environmental monitoring, but also for event detection and classification in a CTBTO verification context.</p>

Download Full-text

Tensorflow Based Image Classification using Advanced Convolutional Neural Network

International Journal of Recent Technology and Engineering - 2 ◽

10.35940/ijrte.f7543.038620 ◽

2020 ◽

Vol 8 (6) ◽

pp. 994-998

Keyword(s):

Neural Network ◽

Neural Networks ◽

Convolutional Neural Network ◽

Image Classification ◽

Programming Language ◽

Convolutional Neural Networks ◽

Input Data ◽

Research Study ◽

Testing Data

In this Research study image identifications will be done by the help of Advanced CNN (Convolutional Neural Networks with Tensorflow Framework. Here we use Python as a main programming language because Tensorflow is a python library. In this study input data mainly focuses on Plants categories by the help of leaves for identifications. Selecting CNN is the best approach for the training and testing data because it produces promising and continuously improving results on automated plant identifications. Here results are divided in terms of accuracy and time. Using advanced CNN results are above 95% while on others accuracy is below 90% and taking much time than this.

Download Full-text

High Performance Gesture Recognition via Effective and Efficient Temporal Modeling

Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2019/141 ◽

2019 ◽

Author(s):

Yang Yi ◽

Feng Ni ◽

Yuexin Ma ◽

Xinge Zhu ◽

Yuankai Qi ◽

...

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

Gesture Recognition ◽

High Performance ◽

Short Term Memory ◽

State Of The Art ◽

Computational Cost ◽

Temporal Modeling ◽

Spatiotemporal Features ◽

Public Datasets

State-of-the-art hand gesture recognition methods have investigated the spatiotemporal features based on 3D convolutional neural networks (3DCNNs) or convolutional long short-term memory (ConvLSTM). However, they often suffer from the inefficiency due to the high computational complexity of their network structures. In this paper, we focus instead on the 1D convolutional neural networks and propose a simple and efficient architectural unit, Multi-Kernel Temporal Block (MKTB), that models the multi-scale temporal responses by explicitly applying different temporal kernels. Then, we present a Global Refinement Block (GRB), which is an attention module for shaping the global temporal features based on the cross-channel similarity. By incorporating the MKTB and GRB, our architecture can effectively explore the spatiotemporal features within tolerable computational cost. Extensive experiments conducted on public datasets demonstrate that our proposed model achieves the state-of-the-art with higher efficiency. Moreover, the proposed MKTB and GRB are plug-and-play modules and the experiments on other tasks, like video understanding and video-based person re-identification, also display their good performance in efficiency and capability of generalization.

Download Full-text

Make Some Noise. Unleashing the Power of Convolutional Neural Networks for Profiled Side-channel Analysis

IACR Transactions on Cryptographic Hardware and Embedded Systems ◽

10.46586/tches.v2019.i3.148-179 ◽

2019 ◽

pp. 148-179 ◽

Cited By ~ 4

Author(s):

Jaehun Kim ◽

Stjepan Picek ◽

Annelie Heuser ◽

Shivam Bhasin ◽

Alan Hanjalic

Keyword(s):

Neural Network ◽

Neural Networks ◽

Convolutional Neural Network ◽

Convolutional Neural Networks ◽

High Performance ◽

Artificial Noise ◽

Side Channel ◽

Secret Key ◽

Side Channel Analysis ◽

Channel Analysis

Profiled side-channel analysis based on deep learning, and more precisely Convolutional Neural Networks, is a paradigm showing significant potential. The results, although scarce for now, suggest that such techniques are even able to break cryptographic implementations protected with countermeasures. In this paper, we start by proposing a new Convolutional Neural Network instance able to reach high performance for a number of considered datasets. We compare our neural network with the one designed for a particular dataset with masking countermeasure and we show that both are good designs but also that neither can be considered as a superior to the other one.Next, we address how the addition of artificial noise to the input signal can be actually beneficial to the performance of the neural network. Such noise addition is equivalent to the regularization term in the objective function. By using this technique, we are able to reduce the number of measurements needed to reveal the secret key by orders of magnitude for both neural networks. Our new convolutional neural network instance with added noise is able to break the implementation protected with the random delay countermeasure by using only 3 traces in the attack phase. To further strengthen our experimental results, we investigate the performance with a varying number of training samples, noise levels, and epochs. Our findings show that adding noise is beneficial throughout all training set sizes and epochs.

Download Full-text

SPEECH RECOGNITION BASED ON CONVOLUTION NEURAL NETWORKS

Issues of radio electronics ◽

10.21778/2218-5453-2019-4-47-52 ◽

2019 ◽

pp. 47-52

Author(s):

R. Yu. Belorutsky ◽

S. V. Zhitnik

Keyword(s):

Neural Network ◽

Neural Networks ◽

Speech Recognition ◽

Convolutional Neural Networks ◽

Input Data ◽

Recognition Accuracy ◽

Recognition Algorithm ◽

Sound Signal ◽

Male And Female ◽

Human Speech

The problem of recognizing a human speech in the form of digits from one to ten recorded by dictaphone is considered. The method of the sound signal spectrogram recognition by means of convolutional neural networks is used. The algorithms for input data preliminary processing, networks training and words recognition are realized. The recognition accuracy for different number of convolution layers is estimated. Its number is determined and the structure of neural network is proposed. The comparison of recognition accuracy when the input data for the network is spectrogram or first two formants is carried out. The recognition algorithm is tested by male and female voices with different duration of pronunciation.

Download Full-text

Designing convolutional neural networks with constrained evolutionary piecemeal training

Applied Intelligence ◽

10.1007/s10489-021-02679-7 ◽

2021 ◽

Author(s):

Dolly Sapra ◽

Andy D. Pimentel

Keyword(s):

Neural Network ◽

Neural Networks ◽

Image Classification ◽

Convolutional Neural Networks ◽

Language Processing ◽

Network Architecture ◽

High Performance ◽

Search Algorithm ◽

Search Space ◽

Pareto Optimal Set

AbstractThe automated architecture search methodology for neural networks is known as Neural Architecture Search (NAS). In recent times, Convolutional Neural Networks (CNNs) designed through NAS methodologies have achieved very high performance in several fields, for instance image classification and natural language processing. Our work is in the same domain of NAS, where we traverse the search space of neural network architectures with the help of an evolutionary algorithm which has been augmented with a novel approach of piecemeal-training. In contrast to the previously published NAS techniques, wherein the training with given data is considered an isolated task to estimate the performance of neural networks, our work demonstrates that a neural network architecture and the related weights can be jointly learned by combining concepts of the traditional training process and evolutionary architecture search in a single algorithm. The consolidation has been realised by breaking down the conventional training technique into smaller slices and collating them together with an integrated evolutionary architecture search algorithm. The constraints on architecture search space are placed by limiting its various parameters within a specified range of values, consequently regulating the neural network’s size and memory requirements. We validate this concept on two vastly different datasets, namely, the CIFAR-10 dataset in the domain of image classification, and PAMAP2 dataset in the Human Activity Recognition (HAR) domain. Starting from randomly initialized and untrained CNNs, the algorithm discovers models with competent architectures, which after complete training, reach an accuracy of of 92.5% for CIFAR-10 and 94.36% PAMAP2. We further extend the algorithm to include an additional conflicting search objective: the number of parameters of the neural network. Our multi-objective algorithm produces a Pareto optimal set of neural networks, by optimizing the search for both the accuracy and the parameter count, thus emphasizing the versatility of our approach.

Download Full-text

Avoiding Overfitting: A Survey on Regularization Methods for Convolutional Neural Networks

ACM Computing Surveys ◽

10.1145/3510413 ◽

2022 ◽

Author(s):

Claudio Filipi Gonçalves dos Santos ◽

João Paulo Papa

Keyword(s):

Neural Network ◽

Neural Networks ◽

Convolutional Neural Networks ◽

Input Data ◽

Data Augmentation ◽

Critical Factor ◽

Regularization Methods ◽

Feature Maps ◽

The Neural Network ◽

Public Repositories

Several image processing tasks, such as image classification and object detection, have been significantly improved using Convolutional Neural Networks (CNN). Like ResNet and EfficientNet, many architectures have achieved outstanding results in at least one dataset by the time of their creation. A critical factor in training concerns the network’s regularization, which prevents the structure from overfitting. This work analyzes several regularization methods developed in the last few years, showing significant improvements for different CNN models. The works are classified into three main areas: the first one is called “data augmentation”, where all the techniques focus on performing changes in the input data. The second, named “internal changes”, which aims to describe procedures to modify the feature maps generated by the neural network or the kernels. The last one, called “label”, concerns transforming the labels of a given input. This work presents two main differences comparing to other available surveys about regularization: (i) the first concerns the papers gathered in the manuscript, which are not older than five years, and (ii) the second distinction is about reproducibility, i.e., all works refered here have their code available in public repositories or they have been directly implemented in some framework, such as TensorFlow or Torch.

Download Full-text

High Performance Implementation of 3D Convolutional Neural Networks on a GPU

Computational Intelligence and Neuroscience ◽

10.1155/2017/8348671 ◽

2017 ◽

Vol 2017 ◽

pp. 1-8 ◽

Cited By ~ 8

Author(s):

Qiang Lan ◽

Zelong Wang ◽

Mei Wen ◽

Chunyuan Zhang ◽

Yijie Wang

Keyword(s):

Neural Network ◽

Neural Networks ◽

Convolutional Neural Networks ◽

High Performance ◽

The Other ◽

Memory Requirement ◽

Video Classification ◽

Filtering Algorithm ◽

Speed Up ◽

The Cost

Convolutional neural networks have proven to be highly successful in applications such as image classification, object tracking, and many other tasks based on 2D inputs. Recently, researchers have started to apply convolutional neural networks to video classification, which constitutes a 3D input and requires far larger amounts of memory and much more computation. FFT based methods can reduce the amount of computation, but this generally comes at the cost of an increased memory requirement. On the other hand, the Winograd Minimal Filtering Algorithm (WMFA) can reduce the number of operations required and thus can speed up the computation, without increasing the required memory. This strategy was shown to be successful for 2D neural networks. We implement the algorithm for 3D convolutional neural networks and apply it to a popular 3D convolutional neural network which is used to classify videos and compare it to cuDNN. For our highly optimized implementation of the algorithm, we observe a twofold speedup for most of the 3D convolution layers of our test network compared to the cuDNN version.

Download Full-text

Aspects of programming for implementation of convolutional neural networks on multisystem HPC architectures

Journal of Physics Conference Series ◽

10.1088/1742-6596/2062/1/012016 ◽

2021 ◽

Vol 2062 (1) ◽

pp. 012016

Author(s):

Sunil Pandey ◽

Naresh Kumar Nagwani ◽

Shrish Verma

Keyword(s):

Neural Network ◽

Neural Networks ◽

Deep Learning ◽

Convolutional Neural Network ◽

High Performance Computing ◽

Convolutional Neural Networks ◽

High Performance ◽

Processing Unit ◽

Computational Performance ◽

Performance Computing

Abstract The training of deep learning convolutional neural networks is extremely compute intensive and takes long times for completion, on all except small datasets. This is a major limitation inhibiting the widespread adoption of convolutional neural networks in real world applications despite their better image classification performance in comparison with other techniques. Multidirectional research and development efforts are therefore being pursued with the objective of boosting the computational performance of convolutional neural networks. Development of parallel and scalable deep learning convolutional neural network implementations for multisystem high performance computing architectures is important in this background. Prior analysis based on computational experiments indicates that a combination of pipeline and task parallelism results in significant convolutional neural network performance gains of up to 18 times. This paper discusses the aspects which are important from the perspective of implementation of parallel and scalable convolutional neural networks on central processing unit based multisystem high performance computing architectures including computational pipelines, convolutional neural networks, convolutional neural network pipelines, multisystem high performance computing architectures and parallel programming models.

Download Full-text