Using a Convolutional Neural Network and Convolutional Long Short-term Memory to Automatically Detect Aneurysms on 2D-DSA Images (Preprint)

2021 ◽  
Author(s):  
JunHua Liao ◽  
LunXin Liu ◽  
HaiHan Duan ◽  
YunZhi Huang ◽  
LiangXue Zhou ◽  
...  

BACKGROUND It is hard to distinguish cerebral aneurysms from overlapping vessels based on 2D DSA images due to their lack of spatial information. OBJECTIVE The aim of this study was to construct a deep learning diagnostic system to improve the ability to detect posterior communicating artery (PCoA) aneurysms on 2D-DSA images and validate the efficiency of the deep learning diagnostic system in 2D-DSA aneurysm detection. METHODS We proposed a two-stage detecting system. First, we established the regional localization stage (RLS) to automatically locate specific detection regions of raw 2D-DSA sequences. Then, in the intracranial aneurysm detection stage (IADS), we constructed the Bi-input+RetinaNet+C-LSTM framework to compare the performance of aneurysm detection with the existing three frameworks. Each of the frameworks had a fivefold cross-validation scheme. The area under the curve (AUC), the receiver operating characteristic (ROC) curve, and mean average precision (mAP) were used to validate the efficiency of different frameworks. The sensitivity, specificity and accuracy were used to identify the abilities of different frameworks. RESULTS A total of 255 patients with PCoA aneurysms and 20 patients without aneurysms were included in this study. The best AUC results of RetinaNet, RetinaNet+C-LSTM, Bi-input+RetinaNet and Bi-input+RetinaNet+C-LSTM were 0.95, 0.96, 0.92 and 0.97, respectively. The sensitivities of RetinaNet, RetinaNet+C-LSTM, Bi-input+RetinaNet, Bi-input+RetinaNet+C-LSTM, and human experts were 89.00% (67.02% to 98.43%), 88.00% (65.76% to 98.06%), 87.00% (64.53% to 97.66%), 89.00% (67.02% to 98.43%), and 90% (68.30% to 98.77%), respectively. The specificity of RetinaNet, RetinaNet+C-LSTM, Bi-input+RetinaNet, Bi-input+RetinaNet+C-LSTM, and human expert were 80.00% (56.34% to 94.27%), 89.00% (67.02% to 98.43%), 86.00% (63.31% to 97.24%), 93.00% (72.30% to 99.56%), and 90% (68.30% to 98.77%), respectively. The accuracies of RetinaNet, RetinaNet+C-LSTM, Bi-input+RetinaNet, Bi-input+RetinaNet+C-LSTM, and human experts were 84.50% (69.57% to 93.97%), 88.50% (74.44% to 96.39%), 86.50% (71.97% to 95.22%), 91.00% (77.63% to 97.72%), and 90.00% (76.34% to 97.21%), respectively. CONCLUSIONS A two-stage aneurysm detection system can reduce the time cost and the computational load. According to our results, more spatial and temporal information can help improve the performances of the frameworks so that Bi-input+RetinaNet+C-LSTM has the best performance compared to the other frameworks. Our study demonstrates that our system can assist doctors in detecting intracranial aneurysms on 2D-DSA images.

2021 ◽  
Author(s):  
JunHua Liao ◽  
LunXin Liu ◽  
HaiHan Duan ◽  
YunZhi Huang ◽  
LiangXue Zhou ◽  
...  

BACKGROUND It is hard to distinguish cerebral aneurysm from overlap vessels based on the 2D DSA images, for its lack the spatial information. OBJECTIVE The aim of this study is to construct a deep learning diagnostic system to improve the ability of detecting the PCoA aneurysm on 2D-DSA images and validate the efficiency of deep learning diagnostic system in 2D-DSA aneurysm detecting. METHODS We proposed a two stage detecting system. First, we established the regional localization stage (RLS) to automatically locate specific detection region of raw 2D-DSA sequences. And then, in the intracranial aneurysm detection stage (IADS) ,we build three different frames, RetinaNet, RetinaNet+LSTM, Bi-input+RetinaNet+LSTM, to detect the aneurysms. Each of the frame had fivefold cross-validation scheme. The area under curve (AUC), the receiver operating characteristic (ROC) curve, and mean average precision (mAP) were used to validate the efficiency of different frames. The sensitivity, specificity and accuracy were used to identify the ability of different frames. RESULTS 255 patients with PCoA aneurysms and 20 patients without aneurysm were included in this study. The best results of AUC of the RetinaNet, RetinaNet+LSTM, and Bi-input+RetinaNet+LSTM were 0.95, 0.96, and 0.97, respectively. The sensitivity of the RetinaNet, RetinaNet+LSTM, and Bi-input+RetinaNet+LSTM were 81.65% (59.40% to 94.76%), 87.91% (64.24% to 98.27%), 84.50% (69.57% to 93.97%), respectively. The specificity of the RetinaNet, RetinaNet+LSTM, and Bi-input+RetinaNet+LSTM were 88.89% (66.73% to 98.41%), 88.12% (66.06% to 98.08%), and 88.50% (74.44% to 96.39%), respectively. The accuracy of the RetinaNet, RetinaNet+LSTM, and Bi-input+RetinaNet+LSTM were 92.71% (71.29% to 99.54%), 89.42% (68.13% to 98.49%), and 91.00% (77.63% to 97.72%), respectively. CONCLUSIONS Two stage aneurysm detecting system can reduce time cost and the computation load. According to our results, more spatial and temporal information can help improve the performance of the frames, so that Bi-input+RetinaNet+LSTM has the best performance compared to other frames. And our study can demonstrate that our system was feasible to assist doctor to detect intracranial aneurysm on 2D-DSA images.


2018 ◽  
Vol 10 (11) ◽  
pp. 1827 ◽  
Author(s):  
Ahram Song ◽  
Jaewan Choi ◽  
Youkyung Han ◽  
Yongil Kim

Hyperspectral change detection (CD) can be effectively performed using deep-learning networks. Although these approaches require qualified training samples, it is difficult to obtain ground-truth data in the real world. Preserving spatial information during training is difficult due to structural limitations. To solve such problems, our study proposed a novel CD method for hyperspectral images (HSIs), including sample generation and a deep-learning network, called the recurrent three-dimensional (3D) fully convolutional network (Re3FCN), which merged the advantages of a 3D fully convolutional network (FCN) and a convolutional long short-term memory (ConvLSTM). Principal component analysis (PCA) and the spectral correlation angle (SCA) were used to generate training samples with high probabilities of being changed or unchanged. The strategy assisted in training fewer samples of representative feature expression. The Re3FCN was mainly comprised of spectral–spatial and temporal modules. Particularly, a spectral–spatial module with a 3D convolutional layer extracts the spectral–spatial features from the HSIs simultaneously, whilst a temporal module with ConvLSTM records and analyzes the multi-temporal HSI change information. The study first proposed a simple and effective method to generate samples for network training. This method can be applied effectively to cases with no training samples. Re3FCN can perform end-to-end detection for binary and multiple changes. Moreover, Re3FCN can receive multi-temporal HSIs directly as input without learning the characteristics of multiple changes. Finally, the network could extract joint spectral–spatial–temporal features and it preserved the spatial structure during the learning process through the fully convolutional structure. This study was the first to use a 3D FCN and a ConvLSTM for the remote-sensing CD. To demonstrate the effectiveness of the proposed CD method, we performed binary and multi-class CD experiments. Results revealed that the Re3FCN outperformed the other conventional methods, such as change vector analysis, iteratively reweighted multivariate alteration detection, PCA-SCA, FCN, and the combination of 2D convolutional layers-fully connected LSTM.


Sensors ◽  
2021 ◽  
Vol 21 (16) ◽  
pp. 5446
Author(s):  
Hyojung Ahn ◽  
Inchoon Yeo

As the workforce shrinks, the demand for automatic, labor-saving, anomaly detection technology that can perform maintenance on advanced equipment such as vehicles has been increasing. In a vehicular environment, noise in the cabin, which directly affects users, is considered an important factor in lowering the emotional satisfaction of the driver and/or passengers in the vehicles. In this study, we provide an efficient method that can collect acoustic data, measured using a large number of microphones, in order to detect abnormal operations inside the machine via deep learning in a quick and highly accurate manner. Unlike most current approaches based on Long Short-Term Memory (LSTM) or autoencoders, we propose an anomaly detection (AD) algorithm that can overcome the limitations of noisy measurement and detection system anomalies via noise signals measured inside the mechanical system. These features are utilized to train a variety of anomaly detection models for demonstration in noisy environments with five different errors in machine operation, achieving an accuracy of approximately 90% or more.


F1000Research ◽  
2021 ◽  
Vol 10 ◽  
pp. 1010
Author(s):  
Nouar AlDahoul ◽  
Hezerul Abdul Karim ◽  
Abdulaziz Saleh Ba Wazir ◽  
Myles Joshua Toledo Tan ◽  
Mohammad Faizal Ahmad Fauzi

Background: Laparoscopy is a surgery performed in the abdomen without making large incisions in the skin and with the aid of a video camera, resulting in laparoscopic videos. The laparoscopic video is prone to various distortions such as noise, smoke, uneven illumination, defocus blur, and motion blur. One of the main components in the feedback loop of video enhancement systems is distortion identification, which automatically classifies the distortions affecting the videos and selects the video enhancement algorithm accordingly. This paper aims to address the laparoscopic video distortion identification problem by developing fast and accurate multi-label distortion classification using a deep learning model. Current deep learning solutions based on convolutional neural networks (CNNs) can address laparoscopic video distortion classification, but they learn only spatial information. Methods: In this paper, utilization of both spatial and temporal features in a CNN-long short-term memory (CNN-LSTM) model is proposed as a novel solution to enhance the classification. First, pre-trained ResNet50 CNN was used to extract spatial features from each video frame by transferring representation from large-scale natural images to laparoscopic images. Next, LSTM was utilized to consider the temporal relation between the features extracted from the laparoscopic video frames to produce multi-label categories. A novel laparoscopic video dataset proposed in the ICIP2020 challenge was used for training and evaluation of the proposed method. Results: The experiments conducted show that the proposed CNN-LSTM outperforms the existing solutions in terms of accuracy (85%), and F1-score (94.2%). Additionally, the proposed distortion identification model is able to run in real-time with low inference time (0.15 sec). Conclusions: The proposed CNN-LSTM model is a feasible solution to be utilized in laparoscopic videos for distortion identification.


BMJ Open ◽  
2020 ◽  
Vol 10 (9) ◽  
pp. e036423
Author(s):  
Zhigang Song ◽  
Chunkai Yu ◽  
Shuangmei Zou ◽  
Wenmiao Wang ◽  
Yong Huang ◽  
...  

ObjectivesThe microscopic evaluation of slides has been gradually moving towards all digital in recent years, leading to the possibility for computer-aided diagnosis. It is worthwhile to know the similarities between deep learning models and pathologists before we put them into practical scenarios. The simple criteria of colorectal adenoma diagnosis make it to be a perfect testbed for this study.DesignThe deep learning model was trained by 177 accurately labelled training slides (156 with adenoma). The detailed labelling was performed on a self-developed annotation system based on iPad. We built the model based on DeepLab v2 with ResNet-34. The model performance was tested on 194 test slides and compared with five pathologists. Furthermore, the generalisation ability of the learning model was tested by extra 168 slides (111 with adenoma) collected from two other hospitals.ResultsThe deep learning model achieved an area under the curve of 0.92 and obtained a slide-level accuracy of over 90% on slides from two other hospitals. The performance was on par with the performance of experienced pathologists, exceeding the average pathologist. By investigating the feature maps and cases misdiagnosed by the model, we found the concordance of thinking process in diagnosis between the deep learning model and pathologists.ConclusionsThe deep learning model for colorectal adenoma diagnosis is quite similar to pathologists. It is on-par with pathologists’ performance, makes similar mistakes and learns rational reasoning logics. Meanwhile, it obtains high accuracy on slides collected from different hospitals with significant staining configuration variations.


2020 ◽  
Vol 17 (4A) ◽  
pp. 655-661
Author(s):  
Mohammad Shurman ◽  
Rami Khrais ◽  
Abdulrahman Yateem

In the recent years, Denial-of-Service (DoS) or Distributed Denial-of-Service (DDoS) attack has spread greatly and attackers make online systems unavailable to legitimate users by sending huge number of packets to the target system. In this paper, we proposed two methodologies to detect Distributed Reflection Denial of Service (DrDoS) attacks in IoT. The first methodology uses hybrid Intrusion Detection System (IDS) to detect IoT-DoS attack. The second methodology uses deep learning models, based on Long Short-Term Memory (LSTM) trained with latest dataset for such kinds of DrDoS. Our experimental results demonstrate that using the proposed methodologies can detect bad behaviour making the IoT network safe of Dos and DDoS attacks


2021 ◽  
Vol 11 (21) ◽  
pp. 10249
Author(s):  
Chien-Nguyen Nhu ◽  
Minho Park

Cloud computing is currently considered the most cost-effective platform for offering business and consumer IT services over the Internet. However, it is prone to new vulnerabilities. A new type of attack called an economic denial of sustainability (EDoS) attack exploits the pay-per-use model to scale up the resource usage over time to the extent that the cloud user has to pay for the unexpected usage charge. To prevent EDoS attacks, a few solutions have been proposed, including hard-threshold and machine learning-based solutions. Among them, long short-term memory (LSTM)-based solutions achieve much higher accuracy and false-alarm rates than hard-threshold and other machine learning-based solutions. However, LSTM requires a long sequence length of the input data, leading to a degraded performance owing to increases in the calculations, the detection time, and consuming a large number of computing resources of the defense system. We, therefore, propose a two-phase deep learning-based EDoS detection scheme that uses an LSTM model to detect each abnormal flow in network traffic; however, the LSTM model requires only a short sequence length of five of the input data. Thus, the proposed scheme can take advantage of the efficiency of the LSTM algorithm in detecting each abnormal flow in network traffic, while reducing the required sequence length of the input data. A comprehensive performance evaluation shows that our proposed scheme outperforms the existing solutions in terms of accuracy and resource consumption.


Author(s):  
Aakshi Mittal ◽  
Mohit Dua

AbstractDetection of spoof is essential for improving the performance of current scenario of Automatic Speaker Verification (ASV) systems. Empowerment to both frontend and backend parts can build the robust ASV systems. First, this paper discuses performance comparison of static and static–dynamic Constant Q Cepstral Coefficients (CQCC) frontend features by using Long Short Term Memory (LSTM) with Time Distributed Wrappers model at the backend. Second, it performs comparative analysis of ASV systems built using three deep learning models LSTM with Time Distributed Wrappers, LSTM and Convolutional Neural Network at backend and using static–dynamic CQCC features at frontend. Third, it discusses implementation of two spoof detection systems for ASV by using same static–dynamic CQCC features at frontend and different combination of deep learning models at backend. Out of these two, the first one is a voting protocol based two-level spoof detection system that uses CNN, LSTM model at first level and LSTM with Time Distributed Wrappers model at second level. The second one is a two-level spoof detection system with user identification and verification protocol, which uses LSTM model for user identification at first level and LSTM with Time Distributed Wrappers for verification at the second level. For implementing the proposed work, a variation in ASVspoof 2019 dataset has been used to introduce all types of spoofing attacks such as Speech Synthesis (SS), Voice Conversion (VC) and replay in single set of dataset. The results show that, at frontend, static–dynamic CQCC feature outperform static CQCC features and at the backend, hybrid combination of deep learning models increases accuracy of spoof detection systems.


PLoS ONE ◽  
2021 ◽  
Vol 16 (3) ◽  
pp. e0245259
Author(s):  
Fufeng Qiao

A DCNN-LSTM (Deep Convolutional Neural Network-Long Short Term Memory) model is proposed to recognize and track table tennis’s real-time trajectory in complex environments, aiming to help the audiences understand competition details and provide a reference for training enthusiasts using computers. Real-time motion features are extracted via deep reinforcement networks. DCNN tracks the recognized objects, and the LSTM algorithm predicts the ball’s trajectory. The model is tested on a self-built video dataset and existing systems and compared with other algorithms to verify its effectiveness. Finally, an overall tactical detection system is built to measure ball rotation and predict ball trajectory. Results demonstrate that in feature extraction, the Deep Deterministic Policy Gradient (DDPG) algorithm has the best performance, with a maximum accuracy rate of 89% and a minimum mean square error of 0.2475. The accuracy of target tracking effect and trajectory prediction is as high as 90%. Compared with traditional methods, the performance of the DCNN-LSTM model based on deep learning is improved by 23.17%. The implemented automatic detection system of table tennis tactical indicators can deal with the problems of table tennis tracking and rotation measurement. It can provide a theoretical foundation and practical value for related research in real-time dynamic detection of balls.


Sign in / Sign up

Export Citation Format

Share Document