scholarly journals Reliable Classification of FAQs with Spelling Errors Using an Encoder-Decoder Neural Network in Korean

2019 ◽  
Vol 9 (22) ◽  
pp. 4758 ◽  
Author(s):  
Youngjin Jang ◽  
Harksoo Kim

To resolve lexical disagreement problems between queries and frequently asked questions (FAQs), we propose a reliable sentence classification model based on an encoder-decoder neural network. The proposed model uses three types of word embeddings; fixed word embeddings for representing domain-independent meanings of words, fined-tuned word embeddings for representing domain-specific meanings of words, and character-level word embeddings for bridging lexical gaps caused by spelling errors. It also uses class embeddings to represent domain knowledge associated with each category. In the experiments with an FAQ dataset about online banking, the proposed embedding methods contributed to an improved performance of the sentence classification. In addition, the proposed model showed better performance (with an accuracy of 0.810 in the classification of 411 categories) than that of the comparison model.

2021 ◽  
Vol 16 ◽  
Author(s):  
Di Gai ◽  
Xuanjing Shen ◽  
Haipeng Chen

Background: The effective classification of the melting curve is conducive to measure the specificity of the amplified products and the influence of invalid data on subsequent experiments is excluded. Objective: In this paper, a convolutional neural network (CNN) classification model based on dynamic filter is proposed, which can categorize the number of peaks in the melting curve image and distinguish the pollution data represented by the noise peaks. Method: The main advantage of the proposed model is that it adopts the filter which changes with the input and uses the dynamic filter to capture more information in the image, making the network learning more accurate. In addition, the residual module is used to extract the characteristics of the melting curve, and the pooling operation is replaced with an atrous convolution to prevent the loss of context information. Result: In order to train the proposed model, a novel melting curve dataset is created, which includes a balanced dataset and an unbalanced dataset. The proposed method uses six classification-based assessment criteria to compare with seven representative methods based on deep learning. Experimental results show that proposed method is not only markedly outperforms the other state-of-the-art methods in accuracy, but also has much less running time. Conclusion: It evidently proves that the proposed method is suitable for judging the specificity of amplification products according to the melting curve. Simultaneously, it overcomes the difficulties of manual selection with low efficiency and artificial bias.


2020 ◽  
Vol 17 (4) ◽  
pp. 497-506
Author(s):  
Sunil Patel ◽  
Ramji Makwana

Automatic classification of dynamic hand gesture is challenging due to the large diversity in a different class of gesture, Low resolution, and it is performed by finger. Due to a number of challenges many researchers focus on this area. Recently deep neural network can be used for implicit feature extraction and Soft Max layer is used for classification. In this paper, we propose a method based on a two-dimensional convolutional neural network that performs detection and classification of hand gesture simultaneously from multimodal Red, Green, Blue, Depth (RGBD) and Optical flow Data and passes this feature to Long-Short Term Memory (LSTM) recurrent network for frame-to-frame probability generation with Connectionist Temporal Classification (CTC) network for loss calculation. We have calculated an optical flow from Red, Green, Blue (RGB) data for getting proper motion information present in the video. CTC model is used to efficiently evaluate all possible alignment of hand gesture via dynamic programming and check consistency via frame-to-frame for the visual similarity of hand gesture in the unsegmented input stream. CTC network finds the most probable sequence of a frame for a class of gesture. The frame with the highest probability value is selected from the CTC network by max decoding. This entire CTC network is trained end-to-end with calculating CTC loss for recognition of the gesture. We have used challenging Vision for Intelligent Vehicles and Applications (VIVA) dataset for dynamic hand gesture recognition captured with RGB and Depth data. On this VIVA dataset, our proposed hand gesture recognition technique outperforms competing state-of-the-art algorithms and gets an accuracy of 86%


2021 ◽  
Vol 11 (1) ◽  
Author(s):  
Ha Min Son ◽  
Wooho Jeon ◽  
Jinhyun Kim ◽  
Chan Yeong Heo ◽  
Hye Jin Yoon ◽  
...  

AbstractAlthough computer-aided diagnosis (CAD) is used to improve the quality of diagnosis in various medical fields such as mammography and colonography, it is not used in dermatology, where noninvasive screening tests are performed only with the naked eye, and avoidable inaccuracies may exist. This study shows that CAD may also be a viable option in dermatology by presenting a novel method to sequentially combine accurate segmentation and classification models. Given an image of the skin, we decompose the image to normalize and extract high-level features. Using a neural network-based segmentation model to create a segmented map of the image, we then cluster sections of abnormal skin and pass this information to a classification model. We classify each cluster into different common skin diseases using another neural network model. Our segmentation model achieves better performance compared to previous studies, and also achieves a near-perfect sensitivity score in unfavorable conditions. Our classification model is more accurate than a baseline model trained without segmentation, while also being able to classify multiple diseases within a single image. This improved performance may be sufficient to use CAD in the field of dermatology.


Sensors ◽  
2021 ◽  
Vol 21 (8) ◽  
pp. 2648
Author(s):  
Muhammad Aamir ◽  
Tariq Ali ◽  
Muhammad Irfan ◽  
Ahmad Shaf ◽  
Muhammad Zeeshan Azam ◽  
...  

Natural disasters not only disturb the human ecological system but also destroy the properties and critical infrastructures of human societies and even lead to permanent change in the ecosystem. Disaster can be caused by naturally occurring events such as earthquakes, cyclones, floods, and wildfires. Many deep learning techniques have been applied by various researchers to detect and classify natural disasters to overcome losses in ecosystems, but detection of natural disasters still faces issues due to the complex and imbalanced structures of images. To tackle this problem, we propose a multilayered deep convolutional neural network. The proposed model works in two blocks: Block-I convolutional neural network (B-I CNN), for detection and occurrence of disasters, and Block-II convolutional neural network (B-II CNN), for classification of natural disaster intensity types with different filters and parameters. The model is tested on 4428 natural images and performance is calculated and expressed as different statistical values: sensitivity (SE), 97.54%; specificity (SP), 98.22%; accuracy rate (AR), 99.92%; precision (PRE), 97.79%; and F1-score (F1), 97.97%. The overall accuracy for the whole model is 99.92%, which is competitive and comparable with state-of-the-art algorithms.


2019 ◽  
Vol 14 (1) ◽  
pp. 124-134 ◽  
Author(s):  
Shuai Zhang ◽  
Yong Chen ◽  
Xiaoling Huang ◽  
Yishuai Cai

Online feedback is an effective way of communication between government departments and citizens. However, the daily high number of public feedbacks has increased the burden on government administrators. The deep learning method is good at automatically analyzing and extracting deep features of data, and then improving the accuracy of classification prediction. In this study, we aim to use the text classification model to achieve the automatic classification of public feedbacks to reduce the work pressure of administrator. In particular, a convolutional neural network model combined with word embedding and optimized by differential evolution algorithm is adopted. At the same time, we compared it with seven common text classification models, and the results show that the model we explored has good classification performance under different evaluation metrics, including accuracy, precision, recall, and F1-score.


Author(s):  
Noha Ali ◽  
Ahmed H. AbuEl-Atta ◽  
Hala H. Zayed

<span id="docs-internal-guid-cb130a3a-7fff-3e11-ae3d-ad2310e265f8"><span>Deep learning (DL) algorithms achieved state-of-the-art performance in computer vision, speech recognition, and natural language processing (NLP). In this paper, we enhance the convolutional neural network (CNN) algorithm to classify cancer articles according to cancer hallmarks. The model implements a recent word embedding technique in the embedding layer. This technique uses the concept of distributed phrase representation and multi-word phrases embedding. The proposed model enhances the performance of the existing model used for biomedical text classification. The result of the proposed model overcomes the previous model by achieving an F-score equal to 83.87% using an unsupervised technique that trained on PubMed abstracts called PMC vectors (PMCVec) embedding. Also, we made another experiment on the same dataset using the recurrent neural network (RNN) algorithm with two different word embeddings Google news and PMCVec which achieving F-score equal to 74.9% and 76.26%, respectively.</span></span>


2020 ◽  
pp. 487-501
Author(s):  
Steven Walczak ◽  
Senanu R. Okuboyejo

This study investigates the use of artificial neural networks (ANNs) to classify reasons for medication nonadherence. A survey method is used to collect individual reasons for nonadherence to treatment plans. Seven reasons for nonadherence are identified from the survey. ANNs using backpropagation learning are trained and validated to produce a nonadherence classification model. Most patients identified multiple reasons for nonadherence. The ANN models were able to accurately predict almost 63 percent of the reasons identified for each patient. After removal of two highly common nonadherence reasons, new ANN models are able to identify 73 percent of the remaining nonadherence reasons. ANN models of nonadherence are validated as a reliable medical informatics tool for assisting healthcare providers in identifying the most likely reasons for treatment nonadherence. Physicians may use the identified nonadherence reasons to help overcome the causes of nonadherence for each patient.


2019 ◽  
Vol 8 (4) ◽  
pp. 160 ◽  
Author(s):  
Bingxin Liu ◽  
Ying Li ◽  
Guannan Li ◽  
Anling Liu

Spectral characteristics play an important role in the classification of oil film, but the presence of too many bands can lead to information redundancy and reduced classification accuracy. In this study, a classification model that combines spectral indices-based band selection (SIs) and one-dimensional convolutional neural networks was proposed to realize automatic oil films classification using hyperspectral remote sensing images. Additionally, for comparison, the minimum Redundancy Maximum Relevance (mRMR) was tested for reducing the number of bands. The support vector machine (SVM), random forest (RF), and Hu’s convolutional neural networks (CNN) were trained and tested. The results show that the accuracy of classifications through the one dimensional convolutional neural network (1D CNN) models surpassed the accuracy of other machine learning algorithms such as SVM and RF. The model of SIs+1D CNN could produce a relatively higher accuracy oil film distribution map within less time than other models.


2020 ◽  
Vol 13 (1-2) ◽  
pp. 43-52
Author(s):  
Boudewijn van Leeuwen ◽  
Zalán Tobak ◽  
Ferenc Kovács

AbstractClassification of multispectral optical satellite data using machine learning techniques to derive land use/land cover thematic data is important for many applications. Comparing the latest algorithms, our research aims to determine the best option to classify land use/land cover with special focus on temporary inundated land in a flat area in the south of Hungary. These inundations disrupt agricultural practices and can cause large financial loss. Sentinel 2 data with a high temporal and medium spatial resolution is classified using open source implementations of a random forest, support vector machine and an artificial neural network. Each classification model is applied to the same data set and the results are compared qualitatively and quantitatively. The accuracy of the results is high for all methods and does not show large overall differences. A quantitative spatial comparison demonstrates that the neural network gives the best results, but that all models are strongly influenced by atmospheric disturbances in the image.


2017 ◽  
Vol 12 (1) ◽  
pp. 43-57
Author(s):  
Aneke Rintiasti ◽  
Ikhwan Krisnadi

Various cigars, which are present in the community among the elite and prestigious venues, the raw material is a Java Tabak cigars, tobacco from Java, especially Klaten and Jember. Recent years, the availability of labor more difficult with increasing costs skyrocketing, so it must start leading to mechanization. The purpose of this research was to Generate Design of Tobacco Leaf Analysis Applications, Getting Segmentation Model for pixel readout from tobacco leaves, Generate classification models that can be used for the separation of tobacco leaves which is expected to ease the process of evaluation and classification of color in the first sorting Tobacco leaves. Tobacco Leaf used is The Under Shade Tobacco leaf (TBN) consisted of five classes, namely the color Blue / Green (B), Yellow (K), Yellow Sprayed (KV), Red (M), Red Sprayed (MV). Before analyzed the leaves image photographed using a cabinet that unaffected the outside light. TBN leaf image is then analyzed using the RGB model and models HSV, RGB image of the model  is  analyzed using the characteristic leaf color values, The image of leaf TBN that meets the characteristics become an input of Bakcpropagation Neural Networks with the target are 5 color grade which converted into a binary form. The research resulted Segmentation Model for pixel readout TBN tobacco leaves using RGB models, classification model that can be used for the classification of TBN leaves use Neural Network Back Training RGB with an error value = 8.7%.”keywords : besuki tobacco, shaded tobacco, image processingABSTRAK Aneka cerutu, yang hadir di kalangan komunitas elit dan tempat-tempat yang prestisius, bahan bakunya adalah Java Tabak Cerutu, tembakau asal Jawa, khususnya Klaten dan Jember. Beberapa tahun belakangan ini, ketersediaan tenaga kerja semakin sulit den gan biaya yang semakin meroket, sehingga harus mulai mengarah ke mekanisasi. Tujuan Penelitian ini adalah menghasilkan Rancang Bangun Aplikasi Analisa Daun Tembakau, mendapatkan Model Segmentasi untuk pembacaan piksel daun tembakau, menghasilkan Model Klasifikasi yang dapat digunakan untuk Pemisahan daun tembakau,sehingga diharapkan dapat mempermudah proses evaluasi dan klasifikasi warna pada Sortasi I daun Tembakau. Daun Tembakau yang digunakan adalah Daun Tembakau Bawah Naungan (TBN) jenis besuki terdiri dari 5 kelas warna yaitu Biru / Hijau (B), Kuning (K), Kuning Tidak Merata (KV), Merah (M), Merah Tidak Merata (MV). Sebelum dianalisa citra daun difoto menggunakan cabinet yang tidak terpengaruh cahaya luar. Citra daun TBN tersebut kemudian dianalisa menggunakan model RGB, dari model RGB citra daun dianalisa menggunakan karakteristik nilai warna, citra daun TBN yang memenuhi karakteristik menjadi masukan Jaringan Saraf Tiruan Bakcpropagation dengan target 5 kelas warna yang sudah diubah menjadi bentuk biner. Penelitian menghasilkan Model Segmentasi untuk pembacaan piksel daun tembakau TBN menggunakan model RGB, Model Klasifikasi yang dapat digunakan untuk klasifikasi daun TBN menggunakan Neural Network Back PropagationTraining RGB dengan nilai error = 8.7%.Kata Kunci : tembakau besuki, tembakau bawah naungan, pengolahan citra 


Sign in / Sign up

Export Citation Format

Share Document