imbalanced training data
Recently Published Documents


TOTAL DOCUMENTS

21
(FIVE YEARS 9)

H-INDEX

4
(FIVE YEARS 1)

Author(s):  
Jinyan Li ◽  
Yaoyang Wu ◽  
Simon Fong ◽  
Antonio J. Tallón-Ballesteros ◽  
Xin-she Yang ◽  
...  

2021 ◽  
Vol 2021 ◽  
pp. 1-17
Author(s):  
Zu-Min Wang ◽  
Ji-Yu Tian ◽  
Jing Qin ◽  
Hui Fang ◽  
Li-Ming Chen

Network intrusion detection remains one of the major challenges in cybersecurity. In recent years, many machine-learning-based methods have been designed to capture the dynamic and complex intrusion patterns to improve the performance of intrusion detection systems. However, two issues, including imbalanced training data and new unknown attacks, still hinder the development of a reliable network intrusion detection system. In this paper, we propose a novel few-shot learning-based Siamese capsule network to tackle the scarcity of abnormal network traffic training data and enhance the detection of unknown attacks. In specific, the well-designed deep learning network excels at capturing dynamic relationships across traffic features. In addition, an unsupervised subtype sampling scheme is seamlessly integrated with the Siamese network to improve the detection of network intrusion attacks under the circumstance of imbalanced training data. Experimental results have demonstrated that the metric learning framework is more suitable to extract subtle and distinctive features to identify both known and unknown attacks after the sampling scheme compared to other supervised learning methods. Compared to the state-of-the-art methods, our proposed method achieves superior performance to effectively detect both types of attacks.


2021 ◽  
Vol 11 (6) ◽  
pp. 2866
Author(s):  
Damheo Lee ◽  
Donghyun Kim ◽  
Seung Yun ◽  
Sanghun Kim

In this paper, we propose a new method for code-switching (CS) automatic speech recognition (ASR) in Korean. First, the phonetic variations in English pronunciation spoken by Korean speakers should be considered. Thus, we tried to find a unified pronunciation model based on phonetic knowledge and deep learning. Second, we extracted the CS sentences semantically similar to the target domain and then applied the language model (LM) adaptation to solve the biased modeling toward Korean due to the imbalanced training data. In this experiment, training data were AI Hub (1033 h) in Korean and Librispeech (960 h) in English. As a result, when compared to the baseline, the proposed method improved the error reduction rate (ERR) by up to 11.6% with phonetic variant modeling and by 17.3% when semantically similar sentences were applied to the LM adaptation. If we considered only English words, the word correction rate improved up to 24.2% compared to that of the baseline. The proposed method seems to be very effective in CS speech recognition.


2020 ◽  
Vol 124 ◽  
pp. 103611 ◽  
Author(s):  
Elias Martins Guerra Prado ◽  
Carlos Roberto de Souza Filho ◽  
Emmanuel John M. Carranza ◽  
João Gabriel Motta

Author(s):  
Amir Laadhar ◽  
Faiza Ghozzi ◽  
Imen Megdiche ◽  
Franck Ravat ◽  
Olivier Teste ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document