scholarly journals Deep Generalized Max Pooling

Author(s):  
Vincent Christlein ◽  
Lukas Spranger ◽  
Mathias Seuret ◽  
Anguelos Nicolaou ◽  
Pavel Kral ◽  
...  
Keyword(s):  
Author(s):  
Yaotian Ling ◽  
Zongwei Wang ◽  
Yunfan Yang ◽  
Zhizhen Yu ◽  
Qilin Zheng ◽  
...  

2019 ◽  
Vol 53 (1) ◽  
pp. 2-19 ◽  
Author(s):  
Erion Çano ◽  
Maurizio Morisio

Purpose The fabulous results of convolution neural networks in image-related tasks attracted attention of text mining, sentiment analysis and other text analysis researchers. It is, however, difficult to find enough data for feeding such networks, optimize their parameters, and make the right design choices when constructing network architectures. The purpose of this paper is to present the creation steps of two big data sets of song emotions. The authors also explore usage of convolution and max-pooling neural layers on song lyrics, product and movie review text data sets. Three variants of a simple and flexible neural network architecture are also compared. Design/methodology/approach The intention was to spot any important patterns that can serve as guidelines for parameter optimization of similar models. The authors also wanted to identify architecture design choices which lead to high performing sentiment analysis models. To this end, the authors conducted a series of experiments with neural architectures of various configurations. Findings The results indicate that parallel convolutions of filter lengths up to 3 are usually enough for capturing relevant text features. Also, max-pooling region size should be adapted to the length of text documents for producing the best feature maps. Originality/value Top results the authors got are obtained with feature maps of lengths 6–18. An improvement on future neural network models for sentiment analysis could be generating sentiment polarity prediction of documents using aggregation of predictions on smaller excerpt of the entire text.


2021 ◽  
Author(s):  
Seshadri Ramana K ◽  
Bala Chowdappa K ◽  
Obulesu ooruchintala ◽  
Deena Babu Mandru ◽  
kallam suresh

Abstract Cancer is uncontrolled cell growth in any part of the body. Early cancer detection aims to identify patients who exhibit symptoms early on in order to maximise their chances of a successful treatment. Cancer disease mortality is decreased through early detection and treatment. Numerous researchers proposed a variety of image processing and machine learning approaches for cancer detection. However, existing systems did not improve detection accuracy or efficiency. A Deep Convolutional Neural Learning Classifier Model based on the Least Mean Square Filterative Ricker Wavelet Transform (L-DCNLC) is proposed to address the aforementioned issues. The L-DCNLC Model's primary objective is to detect cancer earlier by utilising a fully connected max pooling deep convolutional network with increased accuracy and reduced time consumption. The fully connected max pooling deep convolutional network is composed of one input layer, three hidden layers, and one output layer. Initially, the input layer of the L-DCNLC Model considers the number of patient images in the database as input.


2019 ◽  
Author(s):  
MD Sharique ◽  
Bondi Uday Pundarikaksha ◽  
Pradeeba Sridar ◽  
R S Rama Krishnan ◽  
Ramarathnam Krishnakumar

AbstractStroke is one of the leading causes of disability. Segmentation of ischemic stroke could help in planning an optimal treatment. Currently, radiologists use manual segmentation, which can often be time-consuming, laborious and error-prone. Automatic segmentation of ischemic stroke in MRI brain images is a challenging problem due to its small size, multiple occurrences and the need to use multiple image modalities. In this paper, we propose a new architecture for image segmentation, called Parallel Capsule Net, which uses max pooling in every parallel pathways along with dense connections between the parallel layers. We hypothesise that the spatial information lost due to max pooling in these layers can be retrieved by the use of such dense connections. In order to combine the information encoded by the parallel layers, outputs of the layers are concatenated before upsampling. We also propose the use of a modified loss function which consists of a regional term (Generalized Dice loss + Focal Loss) and a boundary term (Boundary loss) to address the problem of class imbalance which is prevalent in medical images. We achieved a competitive Dice score of 0.754, on ISLES SISS data set, compared to a score of 0.67 reported in earlier studies. We also obtained a Dice score of 0.902 with another popular data set, ATLAS. The proposed parallel capsule net can be extended to other similar medical image segmentation problems.


Sensors ◽  
2020 ◽  
Vol 20 (10) ◽  
pp. 2875 ◽  
Author(s):  
Vessela Krasteva ◽  
Sarah Ménétré ◽  
Jean-Philippe Didon ◽  
Irena Jekova

Deep neural networks (DNN) are state-of-the-art machine learning algorithms that can be learned to self-extract significant features of the electrocardiogram (ECG) and can generally provide high-output diagnostic accuracy if subjected to robust training and optimization on large datasets at high computational cost. So far, limited research and optimization of DNNs in shock advisory systems is found on large ECG arrhythmia databases from out-of-hospital cardiac arrests (OHCA). The objective of this study is to optimize the hyperparameters (HPs) of deep convolutional neural networks (CNN) for detection of shockable (Sh) and nonshockable (NSh) rhythms, and to validate the best HP settings for short and long analysis durations (2–10 s). Large numbers of (Sh + NSh) ECG samples were used for training (720 + 3170) and validation (739 + 5921) from Holters and defibrillators in OHCA. An end-to-end deep CNN architecture was implemented with one-lead raw ECG input layer (5 s, 125 Hz, 2.5 uV/LSB), configurable number of 5 to 23 hidden layers and output layer with diagnostic probability p ∈ [0: Sh,1: NSh]. The hidden layers contain N convolutional blocks × 3 layers (Conv1D (filters = Fi, kernel size = Ki), max-pooling (pool size = 2), dropout (rate = 0.3)), one global max-pooling and one dense layer. Random search optimization of HPs = {N, Fi, Ki}, i = 1, … N in a large grid of N = [1, 2, … 7], Fi = [5;50], Ki = [5;100] was performed. During training, the model with maximal balanced accuracy BAC = (Sensitivity + Specificity)/2 over 400 epochs was stored. The optimization principle is based on finding the common HPs space of a few top-ranked models and prediction of a robust HP setting by their median value. The optimal models for 1–7 CNN layers were trained with different learning rates LR = [10−5; 10−2] and the best model was finally validated on 2–10 s analysis durations. A number of 4216 random search models were trained. The optimal models with more than three convolutional layers did not exhibit substantial differences in performance BAC = (99.31–99.5%). Among them, the best model was found with {N = 5, Fi = {20, 15, 15, 10, 5}, Ki = {10, 10, 10, 10, 10}, 7521 trainable parameters} with maximal validation performance for 5-s analysis (BAC = 99.5%, Se = 99.6%, Sp = 99.4%) and tolerable drop in performance (<2% points) for very short 2-s analysis (BAC = 98.2%, Se = 97.6%, Sp = 98.7%). DNN application in future-generation shock advisory systems can improve the detection performance of Sh and NSh rhythms and can considerably shorten the analysis duration complying with resuscitation guidelines for minimal hands-off pauses.


Sign in / Sign up

Export Citation Format

Share Document