max pooling
Recently Published Documents


TOTAL DOCUMENTS

108
(FIVE YEARS 63)

H-INDEX

17
(FIVE YEARS 5)

2022 ◽  
Vol 2161 (1) ◽  
pp. 012016
Author(s):  
Salim Ahmed Ali ◽  
B G Prasad

Abstract Semantic segmentation is an important technology commonly used in medical imaging, autonomous driving vehicles, and backgrounds for virtual meetings. Scale Aware approaches have become the standard when it comes to the semantic segmentation domain of Machine Learning. Multiple image scales are passed through the network allowing the result to use the regular CNN layers such as max-pooling as well as convolution layers. Also, a cascading hierarchy of attention has been shown to improve the accuracy of models for such segmentation tasks. The combination of both these approaches has been shown to greatly improve the accuracy of such models. A side effect of using the cascading approach is that the model turns out to use less memory in comparison to previous approaches. Auto-labelling engines are also helpful in generalizing the model further. The cityscapes dataset used here is a useful data bank as it consists of a myriad of situations where the model can be trained and tested on. This paper presents the tested results of such a segmentation model and incremental modifications to the model pipeline to understand and improve upon the existing architecture.


2021 ◽  
Vol 2021 ◽  
pp. 1-8
Author(s):  
Bingshuai Liu ◽  
Jiawei Zheng ◽  
Hongwei Zhang ◽  
Peijie Chen ◽  
Shipeng Li ◽  
...  

In this paper, we proposed an improved 2D U-Net model integrated squeeze-and-excitation layer for prostate cancer segmentation. The proposed model combined a more complex 2D U-Net model and squeeze-and-excitation technique. The model consisted of an encoder stage and a decoder stage. The encoder stage aims to extract features of the input, which contains CONV blocks, SE layers, and max-pooling layers for improving the feature extraction capability of the model. The decoder aims to map the extracted features to the original image with CONV blocks, SE layers, and upsampling layers. The SE layer is implemented to learn more global and local features. Experiments on the public dataset PROMISE12 have demonstrated that the proposed model could achieve state-of-the-art segmentation performance compared with other traditional methods.


2021 ◽  
Vol 11 (23) ◽  
pp. 11255
Author(s):  
Marjan Kamyab ◽  
Guohua Liu ◽  
Michael Adjeisah

Sentiment analysis (SA) detects people’s opinions from text engaging natural language processing (NLP) techniques. Recent research has shown that deep learning models, i.e., Convolutional Neural Network (CNN), Recurrent Neural Network (RNN), and Transformer-based provide promising results for recognizing sentiment. Nonetheless, CNN has the advantage of extracting high-level features by using convolutional and max-pooling layers; it cannot efficiently learn a sequence of correlations. At the same time, Bidirectional RNN uses two RNN directions to improve extracting long-term dependencies. However, it cannot extract local features in parallel, and Transformer-based like Bidirectional Encoder Representations from Transformers (BERT) are the computational resources needed to fine-tune, facing an overfitting problem on small datasets. This paper proposes a novel attention-based model that utilizes CNNs with LSTM (named ACL-SA). First, it applies a preprocessor to enhance the data quality and employ term frequency-inverse document frequency (TF-IDF) feature weighting and pre-trained Glove word embedding approaches to extract meaningful information from textual data. In addition, it utilizes CNN’s max-pooling to extract contextual features and reduce feature dimensionality. Moreover, it uses an integrated bidirectional LSTM to capture long-term dependencies. Furthermore, it applies the attention mechanism at the CNN’s output layer to emphasize each word’s attention level. To avoid overfitting, the Guasiannoise and GuasianDroupout are adopted as regularization. The model’s robustness is evaluated on four English standard datasets, i.e., Sentiment140, US-airline, Sentiment140-MV, SA4A with various performance matrices, and compared efficiency with existing baseline models and approaches. The experiment results show that the proposed method significantly outperforms the state-of-the-art models.


2021 ◽  
pp. 28-34
Author(s):  
Andryi V. Manokhin ◽  
◽  
Natalia A. Rybachok ◽  

The article highlights aspects of the use of deep machine learning to recognize the accents of the English language. The software has been developed to determine the percentage of how close audio recordings are to each of 8 most common English accents. A convolutional neural network consisting of 2 convolutional layers, 1 max pooling layer, and 2 dense layers was trained across 2 epochs on a set of 5,516 audio recordings taken from the English Multi-speaker Corpus for Voice Cloning resource. The forecasting accuracy of 89.07% was achieved on the test data presented by 11 thousand MFCC matrices with a dimension of 50×87.


Author(s):  
Lixing Li ◽  
Deyang Chen ◽  
Xiaoyong Xue ◽  
Xiaoyang Zeng

2021 ◽  
Author(s):  
Seshadri Ramana K ◽  
Bala Chowdappa K ◽  
Obulesu ooruchintala ◽  
Deena Babu Mandru ◽  
kallam suresh

Abstract Cancer is uncontrolled cell growth in any part of the body. Early cancer detection aims to identify patients who exhibit symptoms early on in order to maximise their chances of a successful treatment. Cancer disease mortality is decreased through early detection and treatment. Numerous researchers proposed a variety of image processing and machine learning approaches for cancer detection. However, existing systems did not improve detection accuracy or efficiency. A Deep Convolutional Neural Learning Classifier Model based on the Least Mean Square Filterative Ricker Wavelet Transform (L-DCNLC) is proposed to address the aforementioned issues. The L-DCNLC Model's primary objective is to detect cancer earlier by utilising a fully connected max pooling deep convolutional network with increased accuracy and reduced time consumption. The fully connected max pooling deep convolutional network is composed of one input layer, three hidden layers, and one output layer. Initially, the input layer of the L-DCNLC Model considers the number of patient images in the database as input.


2021 ◽  
Vol 2021 ◽  
pp. 1-12
Author(s):  
Mustafa Mhamed ◽  
Richard Sutcliffe ◽  
Xia Sun ◽  
Jun Feng ◽  
Eiad Almekhlafi ◽  
...  

Sentiment analysis is an essential process which is important to many natural language applications. In this paper, we apply two models for Arabic sentiment analysis to the ASTD and ATDFS datasets, in both 2-class and multiclass forms. Model MC1 is a 2-layer CNN with global average pooling, followed by a dense layer. MC2 is a 2-layer CNN with max pooling, followed by a BiGRU and a dense layer. On the difficult ASTD 4-class task, we achieve 73.17%, compared to 65.58% reported by Attia et al., 2018. For the easier 2-class task, we achieve 90.06% with MC1 compared to 85.58% reported by Kwaik et al., 2019. We carry out experiments on various data splits, to match those used by other researchers. We also pay close attention to Arabic preprocessing and include novel steps not reported in other works. In an ablation study, we investigate the effect of two steps in particular, the processing of emoticons and the use of a custom stoplist. On the 4-class task, these can make a difference of up to 4.27% and 5.48%, respectively. On the 2-class task, the maximum improvements are 2.95% and 3.87%.


2021 ◽  
pp. 107456
Author(s):  
Hongfeng You ◽  
Long Yu ◽  
Shengwei Tian ◽  
Xiang Ma ◽  
Yan Xing ◽  
...  
Keyword(s):  

2021 ◽  
Vol 9 ◽  
Author(s):  
Shui-Hua Wang ◽  
Suresh Chandra Satapathy ◽  
Donovan Anderson ◽  
Shi-Xin Chen ◽  
Yu-Dong Zhang

Aim: Coronavirus disease 2019 (COVID-19) is a form of disease triggered by a new strain of coronavirus. This paper proposes a novel model termed “deep fractional max pooling neural network (DFMPNN)” to diagnose COVID-19 more efficiently.Methods: This 12-layer DFMPNN replaces max pooling (MP) and average pooling (AP) in ordinary neural networks with the help of a novel pooling method called “fractional max-pooling” (FMP). In addition, multiple-way data augmentation (DA) is employed to reduce overfitting. Model averaging (MA) is used to reduce randomness.Results: We ran our algorithm on a four-category dataset that contained COVID-19, community-acquired pneumonia, secondary pulmonary tuberculosis (SPT), and healthy control (HC). The 10 runs on the test set show that the micro-averaged F1 (MAF) score of our DFMPNN is 95.88%.Discussions: This proposed DFMPNN is superior to 10 state-of-the-art models. Besides, FMP outperforms traditional MP, AP, and L2-norm pooling (L2P).


2021 ◽  
Vol 15 ◽  
Author(s):  
Jianfeng Wu ◽  
Qunxi Dong ◽  
Jie Gui ◽  
Jie Zhang ◽  
Yi Su ◽  
...  

Biomarker assisted preclinical/early detection and intervention in Alzheimer’s disease (AD) may be the key to therapeutic breakthroughs. One of the presymptomatic hallmarks of AD is the accumulation of beta-amyloid (Aβ) plaques in the human brain. However, current methods to detect Aβ pathology are either invasive (lumbar puncture) or quite costly and not widely available (amyloid PET). Our prior studies show that magnetic resonance imaging (MRI)-based hippocampal multivariate morphometry statistics (MMS) are an effective neurodegenerative biomarker for preclinical AD. Here we attempt to use MRI-MMS to make inferences regarding brain Aβ burden at the individual subject level. As MMS data has a larger dimension than the sample size, we propose a sparse coding algorithm, Patch Analysis-based Surface Correntropy-induced Sparse-coding and Max-Pooling (PASCS-MP), to generate a low-dimensional representation of hippocampal morphometry for each individual subject. Then we apply these individual representations and a binary random forest classifier to predict brain Aβ positivity for each person. We test our method in two independent cohorts, 841 subjects from the Alzheimer’s Disease Neuroimaging Initiative (ADNI) and 260 subjects from the Open Access Series of Imaging Studies (OASIS). Experimental results suggest that our proposed PASCS-MP method and MMS can discriminate Aβ positivity in people with mild cognitive impairment (MCI) [Accuracy (ACC) = 0.89 (ADNI)] and in cognitively unimpaired (CU) individuals [ACC = 0.79 (ADNI) and ACC = 0.81 (OASIS)]. These results compare favorably relative to measures derived from traditional algorithms, including hippocampal volume and surface area, shape measures based on spherical harmonics (SPHARM) and our prior Patch Analysis-based Surface Sparse-coding and Max-Pooling (PASS-MP) methods.


Sign in / Sign up

Export Citation Format

Share Document