scholarly journals Fusing Logical Relationship Information of Text in Neural Network for Text Classification

2020 ◽  
Vol 2020 ◽  
pp. 1-16 ◽  
Author(s):  
Heyong Wang ◽  
Dehang Zeng

With the development of computer science and information science, text classification technology has been greatly developed and its application scenarios have been widened. In traditional process of text classification, the existing method will lose much logical relationship information of text. The logical relationship information of a text refers to the relationship information among different logical parts of the text, such as title, abstract, and body. When human beings are reading, they will take title as an important part to remind the central idea of the article, abstract as a brief summary of the content of the article, and body as a detailed description of the article. In most of the text classification studies, researchers concern more about the relationship among words (word frequency, semantics, etc.) and neglect the logical relationship information of text. It will lose information about the relationship among different parts (title, body, etc.) and have an influence on the performance of text classification. Therefore, we propose a text classification algorithm—fusing the logical relationship information of text in neural network (FLRIOTINN), which complements the logical relationship information into text classification algorithms. Experiments show that the effect of FLRIOTINN is better than the conventional backpropagation neural networks which does not consider the logical relationship information of text.

Endoscopy ◽  
2019 ◽  
Vol 51 (06) ◽  
pp. 522-531 ◽  
Author(s):  
Lianlian Wu ◽  
Wei Zhou ◽  
Xinyue Wan ◽  
Jun Zhang ◽  
Lei Shen ◽  
...  

Abstract Background Gastric cancer is the third most lethal malignancy worldwide. A novel deep convolution neural network (DCNN) to perform visual tasks has been recently developed. The aim of this study was to build a system using the DCNN to detect early gastric cancer (EGC) without blind spots during esophagogastroduodenoscopy (EGD). Methods 3170 gastric cancer and 5981 benign images were collected to train the DCNN to detect EGC. A total of 24549 images from different parts of stomach were collected to train the DCNN to monitor blind spots. Class activation maps were developed to automatically cover suspicious cancerous regions. A grid model for the stomach was used to indicate the existence of blind spots in unprocessed EGD videos. Results The DCNN identified EGC from non-malignancy with an accuracy of 92.5 %, a sensitivity of 94.0 %, a specificity of 91.0 %, a positive predictive value of 91.3 %, and a negative predictive value of 93.8 %, outperforming all levels of endoscopists. In the task of classifying gastric locations into 10 or 26 parts, the DCNN achieved an accuracy of 90 % or 65.9 %, on a par with the performance of experts. In real-time unprocessed EGD videos, the DCNN achieved automated performance for detecting EGC and monitoring blind spots. Conclusions We developed a system based on a DCNN to accurately detect EGC and recognize gastric locations better than endoscopists, and proactively track suspicious cancerous lesions and monitor blind spots during EGD.


2019 ◽  
Vol 9 (11) ◽  
pp. 2347 ◽  
Author(s):  
Hannah Kim ◽  
Young-Seob Jeong

As the number of textual data is exponentially increasing, it becomes more important to develop models to analyze the text data automatically. The texts may contain various labels such as gender, age, country, sentiment, and so forth. Using such labels may bring benefits to some industrial fields, so many studies of text classification have appeared. Recently, the Convolutional Neural Network (CNN) has been adopted for the task of text classification and has shown quite successful results. In this paper, we propose convolutional neural networks for the task of sentiment classification. Through experiments with three well-known datasets, we show that employing consecutive convolutional layers is effective for relatively longer texts, and our networks are better than other state-of-the-art deep learning models.


2021 ◽  
Vol 16 (1) ◽  
pp. 1-23
Author(s):  
Keyu Yang ◽  
Yunjun Gao ◽  
Lei Liang ◽  
Song Bian ◽  
Lu Chen ◽  
...  

Text classification is a fundamental task in content analysis. Nowadays, deep learning has demonstrated promising performance in text classification compared with shallow models. However, almost all the existing models do not take advantage of the wisdom of human beings to help text classification. Human beings are more intelligent and capable than machine learning models in terms of understanding and capturing the implicit semantic information from text. In this article, we try to take guidance from human beings to classify text. We propose Crowd-powered learning for Text Classification (CrowdTC for short). We design and post the questions on a crowdsourcing platform to extract keywords in text. Sampling and clustering techniques are utilized to reduce the cost of crowdsourcing. Also, we present an attention-based neural network and a hybrid neural network to incorporate the extracted keywords as human guidance into deep neural networks. Extensive experiments on public datasets confirm that CrowdTC improves the text classification accuracy of neural networks by using the crowd-powered keyword guidance.


2019 ◽  
Vol 18 (03) ◽  
pp. 1950033
Author(s):  
Madan Lal Yadav ◽  
Basav Roychoudhury

One can either use machine learning techniques or lexicons to undertake sentiment analysis. Machine learning techniques include text classification algorithms like SVM, naive Bayes, decision tree or logistic regression, whereas lexicon-based sentiment analysis uses either general or domain-based lexicons. In this paper, we investigate the effectiveness of domain lexicons vis-à-vis general lexicon, wherein we have performed aspect-level sentiment analysis on data from three different domains, viz. car, guitar and book. While it is intuitive that domain lexicons will always perform better than general lexicons, the actual performance however may depend on the richness of the concerned domain lexicon as well as the text analysed. We used the general lexicon SentiWordNet and the corresponding domain lexicons in the aforesaid domains to compare their relative performances. The results indicate that domain lexicon used along with general lexicon performs better as compared to general lexicon or domain lexicon, when used alone. They also suggest that the performance of domain lexicons depends on the text content; and also on whether the language involves technical or non-technical words in the concerned domain. This paper makes a case for development of domain lexicons across various domains for improved performance, while gathering that they might not always perform better. It further highlights that the importance of general lexicons cannot be underestimated — the best results for aspect-level sentiment analysis are obtained, as per this paper, when both the domain and general lexicons are used side by side.


Phronesis ◽  
2014 ◽  
Vol 59 (2) ◽  
pp. 113-142 ◽  
Author(s):  
Emily Fletcher

Abstract In the Philebus, Socrates maintains two theses about the relationship between pleasure and the good life: (1) the mixed life of pleasure and intelligence is better than the unmixed life of intelligence, and: (2) the unmixed life of intelligence is the most divine. Taken together, these two claims lead to the paradoxical conclusion that the best human life is better than the life of a god. A popular strategy for avoiding this conclusion is to distinguish human from divine goods; on such a reading, pleasure has merely instrumental value, and it benefits human beings only as a result of their imperfect nature. I argue that certain ‘pure’ pleasures are full-fledged, intrinsic goods in the Philebus, which are even worthy of the gods (thus Socrates ultimately rejects thesis 2). This positive evaluation of pure pleasure results from a detailed examination of pleasure, which reveals that different types of pleasures have fundamentally different natures.


2019 ◽  
Vol 13 (4) ◽  
pp. 348-355
Author(s):  
Shao Hong ◽  
Xu Rongze ◽  
Guo Xiaopeng ◽  
Cui Wencheng

Background: The unique individual biological characteristics is used for identification in biometrics, which is safe and difficult to forge. Therefore, it can help to enhance the safety of access control system. Since the developing of modern information science and technology, computerbased signature verification system enables signature verification more efficiently and automatically in comparison with traditional human identification method. Methods: In order to improve the accuracy of Chinese signature verification, an off-line Chinese signature verification method based on deep convolutional neural network is proposed. First, the machine learning library Tensorflow is build, and the volunteers are invited to establish the offline Chinese signature dataset. Second, the dataset is pre-processed, including denoising, binarization and size normalization. Finally, three different CNN architectures (AlexNet, GoogleNet, VGGNet) are adopted to implement the signature verification. Results: Experimental results show that the performance of AlexNet is better than that of the other two convolutional neural network architectures, the accuracy of classification has been up to 99.77%, and verification rate is 87.5%. Conclusion: Compared with the traditional offline Chinese signature recognition method, the method based on the convolutional neural network Alex Net-f is better than other methods to some extent, and avoids the complicated feature engineering.


2014 ◽  
Vol 687-691 ◽  
pp. 1030-1033
Author(s):  
Yan Ming Wei ◽  
Xu Sheng Gan ◽  
Jie Yang

BP neural network has a good nonlinear mapping ability, and can describe the relationship between frequency characteristics and fault. However, the multi-resolution wavelet neural network has the simple learning rules, fast training speed with the avoidance of local minima. So a multi-resolution wavelet neural network based on UKF is proposed to solve the problem of fault diagnosis for rotating machinery. The simulation result shows that the proposed multi-resolution wavelet neural network based on UKF value has a good diagnosis capability, and is better than that of traditional BP neural network and wavelet neural network.


Author(s):  
Somayeh Noruzi

The design of residential complexes, taking into account neighborhoods in different parts of the building, can improve thequality of life of the users, human beings are created socially and physically and mentally requires communication and interaction oftheir own kind. In fact, communication is one of the most important and tangible elements of social life. In many residential complexes,due to the lack of desirable neighborhoods, the relationship between residents does not occur. In this case, many residents donot find this sense of satisfaction and belonging to the environment in which they live. In this research, the research method used isdescriptive-analytic. Using library documents, surveying methods and direct observation of space, as well as field observationsand questions from space users, to collect and analyze the data needed for research. Is. The results of the analysis of texture andneighborhoods of the Kandlos village as a case study show that the traditional neighborhood and Iranian neighborhoods are moreflexible, more diverse, more complete and comprehensive, and using the principles of location Traditional Iranian can reproducethe concept of interaction and neighborhoods and used in the design of contemporary Iranian residential complexes.


2021 ◽  
Vol 8 (5) ◽  
pp. 1067
Author(s):  
Yuliska Yuliska ◽  
Dini Hidayatul Qudsi ◽  
Juanda Hakim Lubis ◽  
Khairul Umum Syaliman ◽  
Nina Fadilah Najwa

<p class="Abstrak"><em>Review</em> atau saran dari <em>customer</em> dapat menjadi sangat penting bagi penyedia layanan, begitu pula saran dari mahasiswa mengenai layanan sebuah unit kerja di perguruan tinggi. <em>Review</em> menjadi penting karena dapat menjadi indikator kinerja penyedia layanan. Pengolahan review juga sangat penting karena dapat menjadi referensi untuk pengambilan keputusan dan peningkatan layanan yang lebih baik ke depannya. Penelitian ini menerapkan analisis sentimen pada data saran atau <em>review</em> mahasiswa terhadap kinerja unit kerja atau departemen di perguruan tinggi, yaitu Politeknik Caltex Riau. Analisis sentimen dilakukan dengan menggunakan <em>Convolutional Neural Network (CNN)</em> dan <em>word embedding</em> <em>Word2vec</em> sebagai representasi kata. <em>CNN</em> merupakan metode yang memiliki performa yang baik dalam mengklasifikasi teks, yaitu dengan teknik <em>convolutional</em> yang menggabungkan beberapa <em>window</em> kata pada kalimat dan mengambil <em>window</em> yang paling <em>representative</em>. <em>Word2Vec</em> digunakan sebagai representasi data saran dan inputan awal pada <em>CNN</em>, dimana <em>Word2Vec</em> merupakan <em>dense vectors</em> yang dapat merepresentasikan hubungan antar kata pada data saran dengan baik. Saran mahasiswa dapat mengandung kalimat yang sangat panjang, karena itu perpaduan <em>Word2Vec</em> sebagai representasi kata dan <em>CNN</em> dengan teknik <em>convolutional</em>, dapat menghasilkan representasi yang <em>representative</em> dari kalimat panjang tersebut. Penelitian ini menggunakan dua arsitektur <em>CNN</em>, yaitu <em>Simple</em> <em>CNN</em> dan <em>DoubleMax CNN</em> untuk mengidentifikasi pengaruh kompleksitas arsitektur terhadap hasil klasifikasi sentimen.  Berdasarkan hasil pengujian, <em>DoubleMax CNN</em> dapat mengklasifikasi sentimen pada saran mahasiswa dengan sangat baik, yaitu mencapai Akurasi tertinggi sebesar 98%, <em>Recall</em> 97%, <em>Precision</em> 98% dan <em>F1-Score</em> 98%.</p><p class="Abstrak"> </p><p class="Abstrak"><em><strong>Abstract</strong></em></p><p class="Abstract"><em>Student’s reviews about department performance can be essential for a college for it can be used to evaluate the department performance and to take an immediate action to improve its performance. This research applies sentiment analysis in the student’s reviews of college department in Politeknik Caltex Riau. Convolutional Neural Network and Word2Vec are employed to analyze the sentiment. CNN is known for its good performance in text classification by applying a convolutional technique to the input sentences. Word2Vec is used as word representation and as an input to the CNN. Word2Vec are dense vectors which can represent the relationship between words excellently. Student’s reviews can be a long sentence; hence the combination of Word2Vec as word representation and CNN with convolutional technique can produce a representative fiture from that long sentence. This research utilizes two CNN architectures, which are Simple CNN dan DoubleMax CNN to identify the effect of the complexity of CNN architecture to final result. Our experiments show that DoubleMax CNN has a great performance in classifying sentiment in the student’s reviews with the best Accuracy value of 98%, Recall 97%, Precision 98% and F1-Score value of 98%.<strong> </strong></em></p><p class="Abstrak"><em><strong><br /></strong></em></p>


2019 ◽  
Vol 10 (1) ◽  
pp. 74-87
Author(s):  
Arbana Kadriu ◽  
Lejla Abazi ◽  
Hyrije Abazi

Abstract Background: Text classification is a very important task in information retrieval. Its objective is to classify new text documents in a set of predefined classes, using different supervised algorithms. Objectives: We focus on the text classification for Albanian news articles using two approaches. Methods/Approach: In the first approach, the words in a collection are considered as independent components, allocating to each of them a conforming vector in the vector’s space. Here we utilized nine classifiers from the scikit-learn package, training the classifiers with part of news articles (80%) and testing the accuracy with the remaining part of these articles. In the second approach, the text classification treats words based on their semantic and syntactic word similarities, supposing a word is formed by n-grams of characters. In this case, we have used the fastText, a hierarchical classifier, that considers local word order, as well as sub-word information. We have measured the accuracy for each classifier separately. We have also analyzed the training and testing time. Results: Our results show that the bag of words model does better than fastText when testing the classification process for not a large dataset of text. FastText shows better performance when classifying multi-label text. Conclusions: News articles can serve to create a benchmark for testing classification algorithms of Albanian texts. The best results are achieved with a bag of words model, with an accuracy of 94%.


Sign in / Sign up

Export Citation Format

Share Document