scholarly journals Abnormal Detection to Big Data Using Deep Neural Networks

2021 ◽  
Vol 2068 (1) ◽  
pp. 012025
Author(s):  
Jian Zheng ◽  
Zhaoni Li ◽  
Jiang Li ◽  
Hongling Liu

Abstract It is difficult to detect the anomalies in big data using traditional methods due to big data has the characteristics of mass and disorder. For the common methods, they divide big data into several small samples, then analyze these divided small samples. However, this manner increases the complexity of segmentation algorithms, moreover, it is difficult to control the risk of data segmentation. To address this, here proposes a neural network approch based on Vapnik risk model. Firstly, the sample data is randomly divided into small data blocks. Then, a neural network learns these divided small sample data blocks. To reduce the risks in the process of data segmentation, the Vapnik risk model is used to supervise data segmentation. Finally, the proposed method is verify on the historical electricity price data of Mountain View, California. The results show that our method is effectiveness.

2021 ◽  
Vol 13 (23) ◽  
pp. 4864
Author(s):  
Langfu Cui ◽  
Qingzhen Zhang ◽  
Liman Yang ◽  
Chenggang Bai

An inertial platform is the key component of a remote sensing system. During service, the performance of the inertial platform appears in degradation and accuracy reduction. For better maintenance, the inertial platform system is checked and maintained regularly. The performance change of an inertial platform can be evaluated by detection data. Due to limitations of detection conditions, inertial platform detection data belongs to small sample data. In this paper, in order to predict the performance of an inertial platform, a prediction model for an inertial platform is designed combining a sliding window, grey theory and neural network (SGMNN). The experiments results show that the SGMNN model performs best in predicting the inertial platform drift rate compared with other prediction models.


Sensors ◽  
2018 ◽  
Vol 18 (7) ◽  
pp. 2399 ◽  
Author(s):  
Cunwei Sun ◽  
Yuxin Yang ◽  
Chang Wen ◽  
Kai Xie ◽  
Fangqing Wen

The convolutional neural network (CNN) has made great strides in the area of voiceprint recognition; but it needs a huge number of data samples to train a deep neural network. In practice, it is too difficult to get a large number of training samples, and it cannot achieve a better convergence state due to the limited dataset. In order to solve this question, a new method using a deep migration hybrid model is put forward, which makes it easier to realize voiceprint recognition for small samples. Firstly, it uses Transfer Learning to transfer the trained network from the big sample voiceprint dataset to our limited voiceprint dataset for the further training. Fully-connected layers of a pre-training model are replaced by restricted Boltzmann machine layers. Secondly, the approach of Data Augmentation is adopted to increase the number of voiceprint datasets. Finally, we introduce fast batch normalization algorithms to improve the speed of the network convergence and shorten the training time. Our new voiceprint recognition approach uses the TLCNN-RBM (convolutional neural network mixed restricted Boltzmann machine based on transfer learning) model, which is the deep migration hybrid model that is used to achieve an average accuracy of over 97%, which is higher than that when using either CNN or the TL-CNN network (convolutional neural network based on transfer learning). Thus, an effective method for a small sample of voiceprint recognition has been provided.


2020 ◽  
Vol 67 (2) ◽  
pp. 114-151
Author(s):  
Daniel Kaszyński ◽  
Bogumił Kamiński ◽  
Bartosz Pankratz

The market risk management process includes the quantification of the risk connected with defined portfolios of assets and the diagnostics of the risk model. Value at Risk (VaR) is one of the most common market risk measures. Since the distributions of the daily P&L of financial instruments are unobservable, literature presents a broad range of backtests for VaR diagnostics. In this paper, we propose a new methodological approach to the assessment of the size of VaR backtests, and use it to evaluate the size of the most distinctive and popular backtests. The focus of the paper is directed towards the evaluation of the size of the backtests for small-sample cases – a typical situation faced during VaR backtesting in banking practice. The results indicate significant differences between tests in terms of the p-value distribution. In particular, frequency-based tests exhibit significantly greater discretisation effects than duration-based tests. This difference is especially apparent in the case of small samples. Our findings prove that from among the considered tests, the Kupiec TUFF and the Haas Discrete Weibull have the best properties. On the other hand, backtests which are very popular in banking practice, that is the Kupiec POF and Christoffersen’s Conditional Coverage, show significant discretisation, hence deviations from the theoretical size.


2014 ◽  
Vol 889-890 ◽  
pp. 1569-1573
Author(s):  
Shou Qi Cao ◽  
Ya Wen Zhu

Nowadays, zero inventory and just in time production and management technology are got attention by more and more domestic and foreign automobile enterprises. Accurate prediction of car sales is the key technology for ensuring zero inventory and product planning of enterprises. In this paper, a prediction model about car sales based on grey neural network was researched. Combining the advantages of grey model in the way of dealing of small sample data, and BP neural network in finding regularity of weak correlation data, the accuracy of the prediction results was improved. Finally, the model was validated by practical application of automobile enterprise production data.


Author(s):  
Jian Li ◽  
Haibin Liu ◽  
Zhijun Yang ◽  
Lei Han

2021 ◽  
Vol 2021 ◽  
pp. 1-12
Author(s):  
Yin’e Zhang ◽  
Yong Ping Liu

The prevention and control of navel orange pests and diseases is an important measure to ensure the yield of navel oranges. Aiming at the problems of slow speed, strong subjectivity, high requirements for professional knowledge required, and high identification costs in the identification methods of navel orange pests and diseases, this paper proposes a method based on DenseNet and attention. The power mechanism fusion (DCPSNET) identification method of navel orange diseases and pests improves the traditional deep dense network DenseNet model to realize accurate and efficient identification of navel orange diseases and pests. Due to the difficulty in collecting data of navel orange pests and diseases, this article uses image enhancement technology to expand. The experimental results show that, in the case of small samples, compared with the traditional model, the DCPSNET model can accurately identify different types of navel orange diseases and pests images and the accuracy of identifying six types of navel orange diseases and pests on the test set is as high as 96.90%. The method proposed in this paper has high recognition accuracy, realizes the intelligent recognition of navel orange diseases and pests, and also provides a way for high-precision recognition of small sample data sets.


2019 ◽  
Vol 2019 ◽  
pp. 1-13
Author(s):  
Hongsheng Jiang ◽  
Sujun Dong ◽  
Zheng Liu ◽  
Yue He ◽  
Fengming Ai

Centrifugal compressor is widely used in various engineering domains, and predicting the performance of a centrifugal compressor is an essential task for its conceptual design, optimization, and system simulation. For years, researchers seek to implement this mission through various kinds of methods, including interpolation, curve fitting, neural network, and other statistics-based algorithms. However, these methods usually need a large amount of data, and obtaining data may cost considerable computing or experimental resources. This paper focuses on constructing the performance maps of pressure ratio and isentropic efficiency using a limited number of sample data while maintaining accuracy. Firstly, sample data are generated from simulation using Vista CCD. Then, corrected flow rate and corrected rotational speed are used as independent variables, and the regression expressions with physical meaning of pressure ratio and isentropic efficiency are derived and simplified through thermodynamic analysis and loss analysis of centrifugal compressor, resulting in two loss-analysis-based models. Meanwhile, kriging models based on a second-order polynomial and neural network models are built. Results show that, when predicting inside data boundary, the loss-analysis-based model and the kriging model produce higher accuracy prediction even in a small data set, and the predicting result is stable, while the neural network model provides better results only in a more extensive data set with more speed lines. For the prediction outside the data boundary, the loss-analysis-based model can provide relatively accurate results. Besides, it takes less time to train and utilize a loss-analysis-based model than other models.


Sign in / Sign up

Export Citation Format

Share Document