A data-driven approach for lightning nowcasting with deep learning

Lightning is &#160;responsible directly or indirectly, for significant human casualties and property damage worldwide. 1,2 &#160;It can cause injury and death in humans and animals, ignite fires, affect and destroy electronic devices, and cause electrical surges and system failures in airplanes and rockets.3&#8211;5 These severe and costly outcomes can be averted by predicting the lightning occurrence in advance and taking preventive actions accordingly. Therefore, a practical and fast lightning prediction method is of considerable value.Lightning is formed in the atmosphere through the combination of complex dynamic and microphysical processes, making it difficult to predict its occurrence using analytical or probabilistic approaches. In this work, we aim at leveraging advances in machine learning, deep learning, and pattern recognition to develop a lightning nowcasting model. Current numerical weather models rely on lightning parametrization. These models suffer from two drawbacks; the sequential nature of the model limits the computation speed, especially for nowcasting, and the recorded data are only used in the parametrization step and not in the prediction.6,7To cope with these drawbacks, we propose to leverage the large amounts of available data to develop a fully data-driven approach with enhanced prediction speed based on deep neural networks. The developed lightning nowcasting model is based on a residual U-net architecture.8 The model consists of two paths from the input to the output: (i) a highway path copying the input to the output in the same way as the persistent baseline model does, and (ii) a fully convolutional U-net which learns to adjust the former path to reach the desired output. The U-net itself consists of a contracting part with alternating convolution, and max pooling layers followed by an expanding part of alternating upsampling, convolution, and concatenation layers.9&#8211;11Our dataset consists of post-processed data of recorded lightning occurrences in 15-minute intervals over 60 days obtained from the GOES satellite over the Americas. We have optimized the model using data from the northern part of South America, a region characterized by high lightning activity. The model was then applied to other regions of the Americas. We are using 70-15-15% separation for training, validation, and test datasets. Upon completion of the training process, the model can achieve an overall F1 score of 70% with a lead time of 30 minutes over South America in fractions of a second. This is more than 25% increase in the F1 score compared to the persistent model which is used as our baseline forecast method.To the best of our knowledge, our model is the first data-driven approach for lightning prediction. The developed model can pave the way to large-scale, efficient, and practical lightning prediction, which in turn can protect lives and save resources.

Download Full-text

Accelerating In-Transit Co-Processing for Scientific Simulations Using Region-Based Data-Driven Analysis

Algorithms ◽

10.3390/a14050154 ◽

2021 ◽

Vol 14 (5) ◽

pp. 154

Author(s):

Marcus Walldén ◽

Masao Okita ◽

Fumihiko Ino ◽

Dimitris Drikakis ◽

Ioannis Kokkinakis

Keyword(s):

Large Scale ◽

Data Driven ◽

Data Sets ◽

Output Constraints ◽

Data Driven Approach ◽

Scientific Simulations ◽

Multiple Metrics ◽

In Transit ◽

Multiple Compression ◽

Large Scale Simulations

Increasing processing capabilities and input/output constraints of supercomputers have increased the use of co-processing approaches, i.e., visualizing and analyzing data sets of simulations on the fly. We present a method that evaluates the importance of different regions of simulation data and a data-driven approach that uses the proposed method to accelerate in-transit co-processing of large-scale simulations. We use the importance metrics to simultaneously employ multiple compression methods on different data regions to accelerate the in-transit co-processing. Our approach strives to adaptively compress data on the fly and uses load balancing to counteract memory imbalances. We demonstrate the method’s efficiency through a fluid mechanics application, a Richtmyer–Meshkov instability simulation, showing how to accelerate the in-transit co-processing of simulations. The results show that the proposed method expeditiously can identify regions of interest, even when using multiple metrics. Our approach achieved a speedup of 1.29× in a lossless scenario. The data decompression time was sped up by 2× compared to using a single compression method uniformly.

Download Full-text

A 2020 perspective on “DeRec: A data-driven approach to accurate recommendation with deep learning and weighted loss function”

Electronic Commerce Research and Applications ◽

10.1016/j.elerap.2021.101064 ◽

2021 ◽

Vol 48 ◽

pp. 101064

Author(s):

Wen Zhang ◽

Qiang Wang ◽

Ye Yang ◽

Taketoshi Yoshida

Keyword(s):

Deep Learning ◽

Loss Function ◽

Data Driven ◽

Data Driven Approach

Download Full-text

Improving the management of type 2 diabetes through large-scale general practice: the role of a data-driven and technology-enabled education programme

BMJ Open Quality ◽

10.1136/bmjoq-2020-001087 ◽

2021 ◽

Vol 10 (1) ◽

pp. e001087

Author(s):

Tarek F Radwan ◽

Yvette Agyako ◽

Alireza Ettefaghian ◽

Tahira Kamran ◽

Omar Din ◽

...

Keyword(s):

Type 2 Diabetes ◽

Primary Care ◽

Large Scale ◽

Education Programme ◽

Educational Programme ◽

Data Driven ◽

Treatment Targets ◽

Care Processes ◽

Data Driven Approach

A quality improvement (QI) scheme was launched in 2017, covering a large group of 25 general practices working with a deprived registered population. The aim was to improve the measurable quality of care in a population where type 2 diabetes (T2D) care had previously proved challenging. A complex set of QI interventions were co-designed by a team of primary care clinicians and educationalists and managers. These interventions included organisation-wide goal setting, using a data-driven approach, ensuring staff engagement, implementing an educational programme for pharmacists, facilitating web-based QI learning at-scale and using methods which ensured sustainability. This programme was used to optimise the management of T2D through improving the eight care processes and three treatment targets which form part of the annual national diabetes audit for patients with T2D. With the implemented improvement interventions, there was significant improvement in all care processes and all treatment targets for patients with diabetes. Achievement of all the eight care processes improved by 46.0% (p<0.001) while achievement of all three treatment targets improved by 13.5% (p<0.001). The QI programme provides an example of a data-driven large-scale multicomponent intervention delivered in primary care in ethnically diverse and socially deprived areas.

Download Full-text

A Data-Driven Approach to Structural Health Monitoring of Bridge Structures Based on the Discrete Model and FFT-Deep Learning

Journal of Vibration Engineering & Technologies ◽

10.1007/s42417-021-00343-5 ◽

2021 ◽

Author(s):

Thanh Q. Nguyen

Keyword(s):

Deep Learning ◽

Structural Health Monitoring ◽

Health Monitoring ◽

Discrete Model ◽

Data Driven ◽

Structural Health ◽

Bridge Structures ◽

Data Driven Approach

Download Full-text

RAFFI: Accurate and fast familial relationship inference in large scale biobank studies using RaPID

PLoS Genetics ◽

10.1371/journal.pgen.1009315 ◽

2021 ◽

Vol 17 (1) ◽

pp. e1009315

Author(s):

Ardalan Naseri ◽

Junjie Shi ◽

Xihong Lin ◽

Shaojie Zhang ◽

Degui Zhi

Keyword(s):

Large Scale ◽

Association Studies ◽

Scale Up ◽

Data Driven ◽

Genome Wide Association Studies ◽

Inference Method ◽

Genome Wide ◽

Familial Relationship ◽

Kinship Coefficients ◽

Data Driven Approach

Inference of relationships from whole-genome genetic data of a cohort is a crucial prerequisite for genome-wide association studies. Typically, relationships are inferred by computing the kinship coefficients (ϕ) and the genome-wide probability of zero IBD sharing (π0) among all pairs of individuals. Current leading methods are based on pairwise comparisons, which may not scale up to very large cohorts (e.g., sample size >1 million). Here, we propose an efficient relationship inference method, RAFFI. RAFFI leverages the efficient RaPID method to call IBD segments first, then estimate the ϕ and π0 from detected IBD segments. This inference is achieved by a data-driven approach that adjusts the estimation based on phasing quality and genotyping quality. Using simulations, we showed that RAFFI is robust against phasing/genotyping errors, admix events, and varying marker densities, and achieves higher accuracy compared to KING, the current leading method, especially for more distant relatives. When applied to the phased UK Biobank data with ~500K individuals, RAFFI is approximately 18 times faster than KING. We expect RAFFI will offer fast and accurate relatedness inference for even larger cohorts.

Download Full-text

Spatial distribution pattern of tau depositions in Alzheimer’s disease using data-driven approach of flortaucipir PET

10.1055/s-0041-1726790 ◽

2021 ◽

Author(s):

J Hong ◽

K Shi ◽

A Rominger ◽

H Choi

Keyword(s):

Alzheimer’S Disease ◽

Alzheimer's Disease ◽

Spatial Distribution ◽

Distribution Pattern ◽

Spatial Distribution Pattern ◽

Data Driven ◽

Data Driven Approach ◽

Using Data

Download Full-text

Modeling the Gas Exchange Processes of a Modern Diesel Engine With an Integrated Physics-Based and Data-Driven Approach

Volume 2: Modeling and Control of Engine and Aftertreatment Systems; Modeling and Control of IC Engines and Aftertreatment Systems; Modeling and Validation; Motion Planning and Tracking Control; Multi-Agent and Networked Systems; Renewable and Smart Energy Systems; Thermal Energy Systems; Uncertain Systems and Robustness; Unmanned Ground and Aerial Vehicles; Vehicle Dynamics and Stability; Vibrations: Modeling, Analysis, and Control ◽

10.1115/dscc2019-9226 ◽

2019 ◽

Author(s):

Jorge Pulpeiro Gonzalez ◽

King Ankobea-Ansah ◽

Elena Escuder Milian ◽

Carrie M. Hall

Keyword(s):

Diesel Engine ◽

Gas Exchange ◽

High Efficiency ◽

Direct Injection ◽

Data Driven ◽

Model Framework ◽

Variable Valve Actuation ◽

Exchange Processes ◽

Data Driven Approach ◽

Using Data

Abstract The gas exchange processes of engines are becoming increasingly complex since modern engines leverage technologies including variable valve actuation, turbochargers, and exhaust gas recirculation. Control of these many devices and the underlying gas flows is essential for high efficiency engine concepts. If these processes are to be controlled and estimated using model-based techniques, accurate models are required. This work explores a model framework that leverages a data-driven model of the turbocharger along with submodels of the intercooler, intake and exhaust manifolds and engine processes to provide cylinder-specific predictions of the pressure and temperatures of the gases across the system. This model is developed and validated using data from a 2.0 liter VW turbocharged, direct-injection diesel engine and shown to provide accurate prediction of critical gas properties.

Download Full-text

Using data-driven approach for wind power prediction: A comparative study

Energy Conversion and Management ◽

10.1016/j.enconman.2016.03.078 ◽

2016 ◽

Vol 118 ◽

pp. 193-203 ◽

Cited By ~ 53

Author(s):

Ehsan Taslimi Renani ◽

Mohamad Fathi Mohamad Elias ◽

Nasrudin Abd. Rahim

Keyword(s):

Comparative Study ◽

Wind Power ◽

Data Driven ◽

Power Prediction ◽

Wind Power Prediction ◽

Data Driven Approach ◽

Using Data

Download Full-text

Automation Testing Using Data Driven Approach

International Journal on Recent and Innovation Trends in Computing and Communication ◽

10.17762/ijritcc2321-8169.150422 ◽

2015 ◽

Vol 3 (4) ◽

pp. 1841-1844

Author(s):

Ayush Gondhali ◽

Keyword(s):

Data Driven ◽

Data Driven Approach ◽

Using Data ◽

Automation Testing

Download Full-text

Inferring dynamic topology for decoding spatiotemporal structures in complex heterogeneous networks

Proceedings of the National Academy of Sciences ◽

10.1073/pnas.1721286115 ◽

2018 ◽

Vol 115 (37) ◽

pp. 9300-9305 ◽

Cited By ~ 10

Author(s):

Shuo Wang ◽

Erik D. Herzog ◽

István Z. Kiss ◽

William J. Schwartz ◽

Guy Bloch ◽

...

Keyword(s):

Network Topology ◽

Large Scale ◽

Data Driven ◽

Electrical Networks ◽

Chemical Oscillators ◽

Complex Interactions ◽

Dynamic Topology ◽

Estimation Problems ◽

Data Driven Approach ◽

Fast Chemical

Extracting complex interactions (i.e., dynamic topologies) has been an essential, but difficult, step toward understanding large, complex, and diverse systems including biological, financial, and electrical networks. However, reliable and efficient methods for the recovery or estimation of network topology remain a challenge due to the tremendous scale of emerging systems (e.g., brain and social networks) and the inherent nonlinearity within and between individual units. We develop a unified, data-driven approach to efficiently infer connections of networks (ICON). We apply ICON to determine topology of networks of oscillators with different periodicities, degree nodes, coupling functions, and time scales, arising in silico, and in electrochemistry, neuronal networks, and groups of mice. This method enables the formulation of these large-scale, nonlinear estimation problems as a linear inverse problem that can be solved using parallel computing. Working with data from networks, ICON is robust and versatile enough to reliably reveal full and partial resonance among fast chemical oscillators, coherent circadian rhythms among hundreds of cells, and functional connectivity mediating social synchronization of circadian rhythmicity among mice over weeks.

Download Full-text