Spatial Variability Aware Deep Neural Networks (SVANN): A General Approach

Spatial variability is a prominent feature of various geographic phenomena such as climatic zones, USDA plant hardiness zones, and terrestrial habitat types (e.g., forest, grasslands, wetlands, and deserts). However, current deep learning methods follow a spatial-one-size-fits-all (OSFA) approach to train single deep neural network models that do not account for spatial variability. Quantification of spatial variability can be challenging due to the influence of many geophysical factors. In preliminary work, we proposed a spatial variability aware neural network (SVANN-I, formerly called SVANN ) approach where weights are a function of location but the neural network architecture is location independent. In this work, we explore a more flexible SVANN-E approach where neural network architecture varies across geographic locations. In addition, we provide a taxonomy of SVANN types and a physics inspired interpretation model. Experiments with aerial imagery based wetland mapping show that SVANN-I outperforms OSFA and SVANN-E performs the best of all.

Download Full-text

Developing a seq2seq neural network using visual attention to transform mathematical expressions from images to LaTeX.

Doklady BGUIR ◽

10.35596/1729-7648-2021-19-8-40-44 ◽

2022 ◽

Vol 19 (8) ◽

pp. 40-44

Author(s):

P. A. Vyaznikov ◽

I. D. Kotilevets

Keyword(s):

Neural Network ◽

Visual Attention ◽

Network Architecture ◽

Network Models ◽

Neural Network Architecture ◽

Neural Network Models ◽

The Neural Network ◽

Mathematical Expressions ◽

Series Of Experiments ◽

Visual Attention Mechanism

The paper presents the methods of development and the results of research on the effectiveness of the seq2seq neural network architecture using Visual Attention mechanism to solve the im2latex problem. The essence of the task is to create a neural network capable of converting an image with mathematical expressions into a similar expression in the LaTeX markup language. This problem belongs to the Image Captioning type: the neural network scans the image and, based on the extracted features, generates a description in natural language. The proposed solution uses the seq2seq architecture, which contains the Encoder and Decoder mechanisms, as well as Bahdanau Attention. A series of experiments was conducted on training and measuring the effectiveness of several neural network models.

Download Full-text

A data-driven neural network architecture for sentiment analysis

Data Technologies and Applications ◽

10.1108/dta-03-2018-0017 ◽

2019 ◽

Vol 53 (1) ◽

pp. 2-19 ◽

Cited By ~ 1

Author(s):

Erion Çano ◽

Maurizio Morisio

Keyword(s):

Neural Network ◽

Sentiment Analysis ◽

Network Architecture ◽

Network Models ◽

Data Sets ◽

Feature Maps ◽

Neural Network Architecture ◽

Neural Network Models ◽

Content Type ◽

Max Pooling

Purpose The fabulous results of convolution neural networks in image-related tasks attracted attention of text mining, sentiment analysis and other text analysis researchers. It is, however, difficult to find enough data for feeding such networks, optimize their parameters, and make the right design choices when constructing network architectures. The purpose of this paper is to present the creation steps of two big data sets of song emotions. The authors also explore usage of convolution and max-pooling neural layers on song lyrics, product and movie review text data sets. Three variants of a simple and flexible neural network architecture are also compared. Design/methodology/approach The intention was to spot any important patterns that can serve as guidelines for parameter optimization of similar models. The authors also wanted to identify architecture design choices which lead to high performing sentiment analysis models. To this end, the authors conducted a series of experiments with neural architectures of various configurations. Findings The results indicate that parallel convolutions of filter lengths up to 3 are usually enough for capturing relevant text features. Also, max-pooling region size should be adapted to the length of text documents for producing the best feature maps. Originality/value Top results the authors got are obtained with feature maps of lengths 6–18. An improvement on future neural network models for sentiment analysis could be generating sentiment polarity prediction of documents using aggregation of predictions on smaller excerpt of the entire text.

Download Full-text

A systematic approach for segmenting voiced/unvoiced signals using fuzzy-logic system and general fusion of neural network models for phonemes-based speech recognition

Journal of Intelligent & Fuzzy Systems ◽

10.3233/jifs-200780 ◽

2020 ◽

Vol 39 (5) ◽

pp. 7411-7429

Author(s):

Sathees Kumar Nataraj ◽

M. P. Paulraj ◽

Ahmad Nazri Bin Abdullah ◽

Sazali Bin Yaacob

Keyword(s):

Neural Network ◽

Network Architecture ◽

Spectral Band ◽

False Negative ◽

Network Models ◽

Neural Network Models ◽

Robust Model ◽

The Neural Network ◽

Total Data ◽

Trained Network

In this paper, a speech-to-text translation model has been developed for Malaysian speakers based on 41 classes of Phonemes. A simple data acquisition algorithm has been used to develop a MATLAB graphical user interface (GUI) for recording the isolated word speech signals from 35 non-native Malaysian speakers. The collected database consists of 86 words with 41 classes of phoneme based on Affricatives, Diphthongs, Fricatives, Liquid, Nasals, Semivowels and Glides, Stop and Vowels. The speech samples are preprocessed to eliminate the undesirable artifacts and the fuzzy voice classifier has been employed to classify the samples into voiced sequence and unvoiced sequence. The voiced sequences are divided into frame segments and for each frame, the Linear Predictive co-efficients features are obtained from the voiced sequence. Then the feature sets are formed by deriving the LPC features from all the extracted voiced sequences, and used for classification. The isolated words chosen based on the phonemes are associated with the extracted features to establish classification system input-output mapping. The data are then normalized and randomized to rearrange the values into definite range. The Multilayer Neural Network (MLNN) model has been developed with four combinations of input and hidden activation functions. The neural network models are trained with 60%, 70% and 80% of the total data samples. The neural network architecture was aimed at creating a robust model with 60%, 70%, and 80% of the feature set with 25 trials. The trained network model is validated by simulating the network with the remaining 40%, 30%, and 20% of the set. The reliability of trained network models were compared by measuring true-positive, false-negative, and network classification accuracy. The LPC features show better discrimination and the MLNN neural network models trained using the LPC spectral band features gives better recognition.

Download Full-text

Neural Network Modeling of Turbofan Parameters

Volume 4: Manufacturing Materials and Metallurgy; Ceramics; Structures and Dynamics; Controls, Diagnostics and Instrumentation; Education ◽

10.1115/2000-gt-0036 ◽

2000 ◽

Cited By ~ 4

Author(s):

Mark J. Embrechts ◽

Aaron L. Schweizerhof ◽

Mark Bushman ◽

Mike H. Sabatella

Keyword(s):

Neural Network ◽

Network Architecture ◽

State Of The Art ◽

Network Models ◽

Flight Test ◽

Neural Network Modeling ◽

Training Algorithms ◽

Neural Network Models ◽

Turbofan Engine ◽

The Neural Network

This research involves the development and validation of neural network models for several engine parameters for a turbofan engine. The investigation encompasses data collection, data translation, development of the neural network models, and testing. The data used to train and validate the neural network models was acquired from state of the art models as well as flight tests. During this study, different neural network architectures and training algorithms are exercised and evaluated for a turbofan engine operating at steady-state conditions. In addition, studies are performed to optimize the neural network architecture and resolution of data used for training. The resulting models are thoroughly validated using data for approximately sixty thousand flight conditions. The neural network models trained and tested with data acquired from state of the art models are capable of predicting their respective parameters with a maximum of 5.3 percent error. A neural network model created with a small set of flight test data and validated with a slightly larger set of data resulted in a maximum error of 4.6 percent.

Download Full-text

SCORING MODELING BASED ON NEURAL NETWORKS FOR DETERMINING A BANK BORROWER'S RATING

Economy of Ukraine ◽

10.15407/economyukr.2020.10.054 ◽

2020 ◽

Vol 2020 (10) ◽

pp. 54-62

Author(s):

Oleksii VASYLIEV ◽

Keyword(s):

Neural Network ◽

Neural Networks ◽

Network Architecture ◽

Statistical Data ◽

Activation Function ◽

Decision Making Process ◽

Neural Network Architecture ◽

Acceptable Accuracy ◽

The Neural Network ◽

Sigmoid Activation Function

The problem of applying neural networks to calculate ratings used in banking in the decision-making process on granting or not granting loans to borrowers is considered. The task is to determine the rating function of the borrower based on a set of statistical data on the effectiveness of loans provided by the bank. When constructing a regression model to calculate the rating function, it is necessary to know its general form. If so, the task is to calculate the parameters that are included in the expression for the rating function. In contrast to this approach, in the case of using neural networks, there is no need to specify the general form for the rating function. Instead, certain neural network architecture is chosen and parameters are calculated for it on the basis of statistical data. Importantly, the same neural network architecture can be used to process different sets of statistical data. The disadvantages of using neural networks include the need to calculate a large number of parameters. There is also no universal algorithm that would determine the optimal neural network architecture. As an example of the use of neural networks to determine the borrower's rating, a model system is considered, in which the borrower's rating is determined by a known non-analytical rating function. A neural network with two inner layers, which contain, respectively, three and two neurons and have a sigmoid activation function, is used for modeling. It is shown that the use of the neural network allows restoring the borrower's rating function with quite acceptable accuracy.

Download Full-text

Digital twin of equipment as a basis for the consumer in digital production

Automation. Modern Techologies ◽

10.36652/0869-4931-2020-74-9-394-402 ◽

2020 ◽

Keyword(s):

Neural Network ◽

Tool Wear ◽

Chip Formation ◽

Network Models ◽

Machining Accuracy ◽

Neural Network Models ◽

Digital Twin ◽

The Neural Network ◽

Digital Production ◽

Cyberphysical System

The neural network models series used in the development of an aggregated digital twin of equipment as a cyber-physical system are presented. The twins of machining accuracy, chip formation and tool wear are examined in detail. On their basis, systems for stabilization of the chip formation process during cutting and diagnose of the cutting too wear are developed. Keywords cyberphysical system; neural network model of equipment; big data, digital twin of the chip formation; digital twin of the tool wear; digital twin of nanostructured coating choice

Download Full-text

Towards Enhanced Performance of Neural-Network-Based Fault Detection Using an Sequential D-Optimum Experimental Design

Applied Sciences ◽

10.3390/app8081290 ◽

2018 ◽

Vol 8 (8) ◽

pp. 1290 ◽

Cited By ~ 2

Author(s):

Beata Mrugalska

Keyword(s):

Neural Network ◽

Experimental Design ◽

Fault Detection ◽

Network Models ◽

Neural Model ◽

Neural Network Models ◽

Optimum Experimental Design ◽

The Neural Network ◽

Adaptive Thresholds ◽

Linear Neural Network

Increasing expectations of industrial system reliability require development of more effective and robust fault diagnosis methods. The paper presents a framework for quality improvement on the neural model applied for fault detection purposes. In particular, the proposed approach starts with an adaptation of the modified quasi-outer-bounding algorithm towards non-linear neural network models. Subsequently, its convergence is proven using quadratic boundedness paradigm. The obtained algorithm is then equipped with the sequential D-optimum experimental design mechanism allowing gradual reduction of the neural model uncertainty. Finally, an emerging robust fault detection framework on the basis of the neural network uncertainty description as the adaptive thresholds is proposed.

Download Full-text

Stock Market Prediction Using Artificial Neural Networks

Advanced Engineering Forum ◽

10.4028/www.scientific.net/aef.6-7.1055 ◽

2012 ◽

Vol 6-7 ◽

pp. 1055-1060 ◽

Cited By ~ 4

Author(s):

Yang Bing ◽

Jian Kun Hao ◽

Si Chang Zhang

Keyword(s):

Neural Network ◽

Prediction Models ◽

Learning Algorithm ◽

Stock Exchange ◽

Back Propagation ◽

Composite Index ◽

Network Models ◽

Back Propagation Neural Network ◽

Neural Network Models ◽

The Neural Network

In this study we apply back propagation Neural Network models to predict the daily Shanghai Stock Exchange Composite Index. The learning algorithm and gradient search technique are constructed in the models. We evaluate the prediction models and conclude that the Shanghai Stock Exchange Composite Index is predictable in the short term. Empirical study shows that the Neural Network models is successfully applied to predict the daily highest, lowest, and closing value of the Shanghai Stock Exchange Composite Index, but it can not predict the return rate of the Shanghai Stock Exchange Composite Index in short terms.

Download Full-text

Constructive Learning of Deep Neural Networks for Bigdata Analysis

International Journal of Computer Applications Technology and Research ◽

10.7753/ijcatr0912.1001 ◽

2020 ◽

Vol 9 (12) ◽

pp. 311-322

Author(s):

Soha Abd Mohamed El-Moamen ◽

Marghany Hassan Mohamed ◽

Mohammed F. Farghally

Keyword(s):

Neural Network ◽

Lung Cancer ◽

Binary Classification ◽

Network Models ◽

Classification Model ◽

Neural Network Models ◽

Constructive Learning ◽

The Neural Network ◽

Rapid Pace ◽

Better Than

The need for tracking and evaluation of patients in real-time has contributed to an increase in knowing people’s actions to enhance care facilities. Deep learning is good at both a rapid pace in collecting frameworks of big data healthcare and good predictions for detection the lung cancer early. In this paper, we proposed a constructive deep neural network with Apache Spark to classify images and levels of lung cancer. We developed a binary classification model using threshold technique classifying nodules to benign or malignant. At the proposed framework, the neural network models training, defined using the Keras API, is performed using BigDL in a distributed Spark clusters. The proposed algorithm has metrics AUC-0.9810, a misclassifying rate from which it has been shown that our suggested classifiers perform better than other classifiers.

Download Full-text

Robotic grasp detection using a novel two-stage approach

ASP Transactions on Internet of Things ◽

10.52810/tiot.2021.100031 ◽

2021 ◽

Vol 1 (1) ◽

pp. 19-29

Author(s):

Zhe Chu ◽

Mengkai Hu ◽

Xiangyu Chen

Keyword(s):

Neural Network ◽

Neural Networks ◽

Network Models ◽

Particle Swarm Optimizer ◽

Neural Network Models ◽

Two Stage ◽

The Neural Network ◽

End To End ◽

Small Change ◽

Robotic Grasp

Recently, deep learning has been successfully applied to robotic grasp detection. Based on convolutional neural networks (CNNs), there have been lots of end-to-end detection approaches. But end-to-end approaches have strict requirements for the dataset used for training the neural network models and it’s hard to achieve in practical use. Therefore, we proposed a two-stage approach using particle swarm optimizer (PSO) candidate estimator and CNN to detect the most likely grasp. Our approach achieved an accuracy of 92.8% on the Cornell Grasp Dataset, which leaped into the front ranks of the existing approaches and is able to run at real-time speeds. After a small change of the approach, we can predict multiple grasps per object in the meantime so that an object can be grasped in a variety of ways.

Download Full-text