scholarly journals Distributional Semantic Model Based on Convolutional Neural Network for Arabic Textual Similarity

Author(s):  
Adnen Mahmoud ◽  
Mounir Zrigui

The problem addressed is to develop a model that can reliably identify whether a previously unseen document pair is paraphrased or not. Its detection in Arabic documents is a challenge because of its variability in features and the lack of publicly available corpora. Faced with these problems, the authors propose a semantic approach. At the feature extraction level, the authors use global vectors representation combining global co-occurrence counting and a contextual skip gram model. At the paraphrase identification level, the authors apply a convolutional neural network model to learn more contextual and semantic information between documents. For experiments, the authors use Open Source Arabic Corpora as a source corpus. Then the authors collect different datasets to create a vocabulary model. For the paraphrased corpus construction, the authors replace each word from the source corpus by its most similar one which has the same grammatical class applying the word2vec algorithm and the part-of-speech annotation. Experiments show that the model achieves promising results in terms of precision and recall compared to existing approaches in the literature.

Connectivity ◽  
2020 ◽  
Vol 148 (6) ◽  
Author(s):  
B. V. Shefkin ◽  
◽  
I. V. Krasyuk ◽  
V. O. Khomenchuk ◽  
K. P. Storchak ◽  
...  

TensorFlow is Google’s open-source machine learning and deep learning framework, which is convenient and flexible to build the current mainstream deep learning model. Convolutional neural network is a classical model of deep learning, the advantage lies in its powerful feature extraction capabilities of convolutional blocks. A neural network in the simplest case is a mathematical model consisting of several layers of elements that perform parallel calculations. Initially, such an architecture was created by analogy with the small computing elements of the human brain — neurons. The minimal computing elements of an artificial neural network are also called neurons. Neural networks typically consist of three or more layers: an input layer, a hidden layer (or layers), and an output layer. An important feature of the neural network is its ability to learn by example, this is called learning with a teacher. The neural network is trained on a large number of examples consisting of input-output pairs (corresponding to each other input and output). In object recognition problems, such a pair will be the input image and the corresponding label — the name of the object. Neural network learning is an iterative process that reduces the deviation of the network output from a given «teacher response» — a label that corresponds to a given image. This process consists of steps called epochs of learning (they are usually calculated in thousands), each of which is the adjustment of the «weights» of the neural network — the parameters of the hidden layers of the network. Upon completion of the learning process, the quality of the neural network is usually good enough to perform the task for which it was trained, although the optimal set of parameters that perfectly recognizes all the images, it is often impossible to choose. Based on the TensorFlow platform, a convolutional neural network model with two-convolution-layers was built. The model was trained and tested with the MNIST data set. The test accuracy rate could reach 99,15%, and compared with the rate of 98,69% with only one-convolution-layer model, which shows that the two-convolution-layers convolutional neural network model has a better ability of feature extraction and classification decision-making.


2019 ◽  
Vol 24 (3) ◽  
pp. 220-228
Author(s):  
Gusti Alfahmi Anwar ◽  
Desti Riminarsih

Panthera merupakan genus dari keluarga kucing yang memiliki empat spesies popular yaitu, harimau, jaguar, macan tutul, singa. Singa memiliki warna keemasan dan tidak memilki motif, harimau memiliki motif loreng dengan garis-garis panjang, jaguar memiliki tubuh yang lebih besar dari pada macan tutul serta memiliki motif tutul yang lebih lebar, sedangkan macan tutul memiliki tubuh yang sedikit lebih ramping dari pada jaguar dan memiliki tutul yang tidak terlalu lebar. Pada penelitian ini dilakukan klasifikasi genus panther yaitu harimau, jaguar, macan tutul, dan singa menggunakan metode Convolutional Neural Network. Model Convolutional Neural Network yang digunakan memiliki 1 input layer, 5 convolution layer, dan 2 fully connected layer. Dataset yang digunakan berupa citra harimau, jaguar, macan tutul, dan singa. Data training terdiri dari 3840 citra, data validasi sebanyak 960 citra, dan data testing sebanyak 800 citra. Hasil akurasi dari pelatihan model untuk training yaitu 92,31% dan validasi yaitu 81,88%, pengujian model menggunakan dataset testing mendapatan hasil 68%. Hasil akurasi prediksi didapatkan dari nilai F1-Score pada pengujian didapatkan sebesar 78% untuk harimau, 70% untuk jaguar, 37% untuk macan tutul, 74% untuk singa. Macan tutul mendapatkan akurasi terendah dibandingkan 3 hewan lainnya tetapi lebih baik dibandingkan hasil penelitian sebelumnya.


2021 ◽  
Vol 1099 (1) ◽  
pp. 012001
Author(s):  
Srishti Garg ◽  
Tanishq Sehga ◽  
Aakriti Jain ◽  
Yash Garg ◽  
Preeti Nagrath ◽  
...  

2021 ◽  
pp. 1-10
Author(s):  
Chien-Cheng Leea ◽  
Zhongjian Gao ◽  
Xiu-Chi Huanga

This paper proposes a Wi-Fi-based indoor human detection system using a deep convolutional neural network. The system detects different human states in various situations, including different environments and propagation paths. The main improvements proposed by the system is that there is no cameras overhead and no sensors are mounted. This system captures useful amplitude information from the channel state information and converts this information into an image-like two-dimensional matrix. Next, the two-dimensional matrix is used as an input to a deep convolutional neural network (CNN) to distinguish human states. In this work, a deep residual network (ResNet) architecture is used to perform human state classification with hierarchical topological feature extraction. Several combinations of datasets for different environments and propagation paths are used in this study. ResNet’s powerful inference simplifies feature extraction and improves the accuracy of human state classification. The experimental results show that the fine-tuned ResNet-18 model has good performance in indoor human detection, including people not present, people still, and people moving. Compared with traditional machine learning using handcrafted features, this method is simple and effective.


Sign in / Sign up

Export Citation Format

Share Document