Agricultural Product Recommendation Model based on BMF

AbstractIn this article, based on the collaborative deep learning (CDL) and convolutional matrix factorisation (ConvMF), the language model BERT is used to replace the traditional word vector construction method, and the bidirectional long–short time memory network Bi-LSTM is used to construct an improved collaborative filtering model BMF, which not only solves the phenomenon of ‘polysemy’, but also alleviates the problem of sparse scoring matrix data. Experiments show that the proposed model is effective and superior to CDL and ConvMF. The trained MSE value is 1.031, which is 9.7% lower than ConvMF.

Download Full-text

A Hybrid Deep Learning Model for Predicting Protein Hydroxylation Sites

International Journal of Molecular Sciences ◽

10.3390/ijms19092817 ◽

2018 ◽

Vol 19 (9) ◽

pp. 2817 ◽

Cited By ~ 9

Author(s):

Haixia Long ◽

Bo Liao ◽

Xingyu Xu ◽

Jialiang Yang

Keyword(s):

Deep Learning ◽

Short Term Memory ◽

Learning Model ◽

New Drugs ◽

Post Translational Modifications ◽

Novel Approach ◽

Benchmark Datasets ◽

Memory Network ◽

Scoring Matrix ◽

Deep Learning Model

Protein hydroxylation is one type of post-translational modifications (PTMs) playing critical roles in human diseases. It is known that protein sequence contains many uncharacterized residues of proline and lysine. The question that needs to be answered is: which residue can be hydroxylated, and which one cannot. The answer will not only help understand the mechanism of hydroxylation but can also benefit the development of new drugs. In this paper, we proposed a novel approach for predicting hydroxylation using a hybrid deep learning model integrating the convolutional neural network (CNN) and long short-term memory network (LSTM). We employed a pseudo amino acid composition (PseAAC) method to construct valid benchmark datasets based on a sliding window strategy and used the position-specific scoring matrix (PSSM) to represent samples as inputs to the deep learning model. In addition, we compared our method with popular predictors including CNN, iHyd-PseAAC, and iHyd-PseCp. The results for 5-fold cross-validations all demonstrated that our method significantly outperforms the other methods in prediction accuracy.

Download Full-text

Deep Learning based Semantic Similarity Detection using Text Data

Information Technology And Control ◽

10.5755/j01.itc.49.4.27118 ◽

2020 ◽

Vol 49 (4) ◽

pp. 495-510

Author(s):

Muhammad Mansoor ◽

Zahoor ur Rehman ◽

Muhammad Shaheen ◽

Muhammad Attique Khan ◽

Mohamed Habib

Keyword(s):

Deep Learning ◽

Language Processing ◽

Short Term Memory ◽

Main Task ◽

Detection Algorithms ◽

Similarity Detection ◽

Novel Approach ◽

Proposed Model ◽

Memory Network ◽

Numeric Data

Similarity detection in the text is the main task for a number of Natural Language Processing (NLP) applications. As textual data is comparatively large in quantity and huge in volume than the numeric data, therefore measuring textual similarity is one of the important problems. Most of the similarity detection algorithms are based upon word to word matching, sentence/paragraph matching, and matching of the whole document. In this research, a novel approach is proposed using deep learning models, combining Long Short Term Memory network (LSTM) with Convolutional Neural Network (CNN) for measuring semantics similarity between two questions. The proposed model takes sentence pairs as input to measure the similarity between them. The model is tested on publicly available Quora’s dataset. The model in comparison to the existing techniques gave 87.50 % accuracy which is better than the previous approaches.

Download Full-text

Deep-Kcr: accurate detection of lysine crotonylation sites using deep learning method

Briefings in Bioinformatics ◽

10.1093/bib/bbaa255 ◽

2020 ◽

Author(s):

Hao Lv ◽

Fu-Ying Dao ◽

Zheng-Xing Guan ◽

Hui Yang ◽

Yan-Wen Li ◽

...

Keyword(s):

Deep Learning ◽

Large Scale ◽

Short Term Memory ◽

Information Gain ◽

Independent Set ◽

Cost Effective ◽

Cellular Regulation ◽

Proposed Model ◽

Experimental Approaches ◽

Memory Network

Abstract As a newly discovered protein posttranslational modification, histone lysine crotonylation (Kcr) involved in cellular regulation and human diseases. Various proteomics technologies have been developed to detect Kcr sites. However, experimental approaches for identifying Kcr sites are often time-consuming and labor-intensive, which is difficult to widely popularize in large-scale species. Computational approaches are cost-effective and can be used in a high-throughput manner to generate relatively precise identification. In this study, we develop a deep learning-based method termed as Deep-Kcr for Kcr sites prediction by combining sequence-based features, physicochemical property-based features and numerical space-derived information with information gain feature selection. We investigate the performances of convolutional neural network (CNN) and five commonly used classifiers (long short-term memory network, random forest, LogitBoost, naive Bayes and logistic regression) using 10-fold cross-validation and independent set test. Results show that CNN could always display the best performance with high computational efficiency on large dataset. We also compare the Deep-Kcr with other existing tools to demonstrate the excellent predictive power and robustness of our method. Based on the proposed model, a webserver called Deep-Kcr was established and is freely accessible at http://lin-group.cn/server/Deep-Kcr.

Download Full-text

KM-BERT: A Pre-trained BERT for Korean Medical Natural Language Processing (Preprint)

10.2196/preprints.31223 ◽

2021 ◽

Author(s):

Yoojoong Kim ◽

Jeong Moon Lee ◽

Moon Joung Jang ◽

Yun Jin Yum ◽

Jong-Ho Kim ◽

...

Keyword(s):

Deep Learning ◽

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Pearson Correlation ◽

Language Model ◽

Language Models ◽

Korean Language ◽

Medical Texts ◽

Proposed Model

BACKGROUND With advances in deep learning and natural language processing, analyzing medical texts is becoming increasingly important. Nonetheless, a study on medical-specific language models has not yet been conducted given the importance of medical texts. OBJECTIVE Korean medical text is highly difficult to analyze because of the agglutinative characteristics of the language as well as the complex terminologies in the medical domain. To solve this problem, we collected a Korean medical corpus and used it to train language models. METHODS In this paper, we present a Korean medical language model based on deep learning natural language processing. The proposed model was trained using the pre-training framework of BERT for the medical context based on a state-of-the-art Korean language model. RESULTS After pre-training, the proposed method showed increased accuracies of 0.147 and 0.148 for the masked language model with next sentence prediction. In the intrinsic evaluation, the next sentence prediction accuracy improved by 0.258, which is a remarkable enhancement. In addition, the extrinsic evaluation of Korean medical semantic textual similarity data showed a 0.046 increase in the Pearson correlation. CONCLUSIONS The results demonstrated the superiority of the proposed model for Korean medical natural language processing. We expect that our proposed model can be extended for application to various languages and domains.

Download Full-text

Collaborative Filtering Recommendation Algorithm for MOOC Resources Based on Deep Learning

Complexity ◽

10.1155/2021/5555226 ◽

2021 ◽

Vol 2021 ◽

pp. 1-11

Author(s):

Lili Wu

Keyword(s):

Deep Learning ◽

Collaborative Filtering ◽

Laplacian Matrix ◽

Network Information ◽

Recommendation Algorithms ◽

Text Word ◽

Scoring Matrix ◽

Accuracy Of Prediction ◽

The Impact ◽

Relational Network

In view of the poor recommendation performance of traditional resource collaborative filtering recommendation algorithms, this article proposes a collaborative filtering recommendation model based on deep learning for art and MOOC resources. This model first uses embedding vectors based on the context of metapaths for learning. Embedding vectors based on the context of metapaths aggregate different metapath information and different MOOCs may have different preferences for different metapaths. Secondly, to capture this preference drift, the model introduces an attention mechanism, which can improve the interpretability of the recommendation results. Then, by introducing the Laplacian matrix into the prior distribution of the hidden factor feature matrix, the relational network information is effectively integrated into the model. Finally, compared with the traditional model using the scoring matrix, the model in this article using text word vectors effectively alleviates the impact of data sparsity and greatly improves the accuracy of prediction. After analyzing the experimental results, compared with other algorithms, the resource collaborative filtering recommendation model proposed in this article has achieved better recommendation results, with good stability and scalability.

Download Full-text

A Deep Bi-directional Attention Network for Human Motion Recovery

Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2019/99 ◽

2019 ◽

Cited By ~ 4

Author(s):

Qiongjie Cui ◽

Huaijiang Sun ◽

Yupeng Li ◽

Yue Kong

Keyword(s):

Relevant Information ◽

Human Motion ◽

Sequential Data ◽

Time Step ◽

Attention Network ◽

Motion Data ◽

Proposed Model ◽

Memory Network ◽

Short Time

Human motion capture (mocap) data, recording the movement of markers attached to specific joints, has gradually become the most popular solution of animation production. However, the raw motion data are often corrupted due to joint occlusion, marker shedding and the lack of equipment precision, which severely limits the performance in real-world applications. Since human motion is essentially a sequential data, the latest methods resort to variants of long short-time memory network (LSTM) to solve related problems, but most of them tend to obtain visually unreasonable results. This is mainly because these methods hardly capture long-term dependencies and cannot explicitly utilize relevant context, especially in long sequences. To address these issues, we propose a deep bi-directional attention network (BAN) which can not only capture the long-term dependencies but also adaptively extract relevant information at each time step. Moreover, the proposed model, embedded attention mechanism in the bi-directional LSTM (BLSTM) structure at the encoding and decoding stages, can decide where to borrow information and use it to recover corrupted frame effectively. Extensive experiments on CMU database demonstrate that the proposed model consistently outperforms other state-of-the-art methods in terms of recovery accuracy and visualization.

Download Full-text

Deep Learning and Collaborative Filtering-Based Methods for Students’ Performance Prediction and Course Recommendation

Wireless Communications and Mobile Computing ◽

10.1155/2021/2157343 ◽

2021 ◽

Vol 2021 ◽

pp. 1-13 ◽

Cited By ~ 1

Author(s):

Jinyang Liu ◽

Chuantao Yin ◽

Yuhang Li ◽

Honglu Sun ◽

Hong Zhou

Keyword(s):

Deep Learning ◽

Prediction Model ◽

Collaborative Filtering ◽

Performance Prediction ◽

Good Effect ◽

Course Selection ◽

Hybrid Prediction ◽

Proposed Model ◽

Student’S Performance ◽

Vocational Learning

At the beginning of a new semester, due to the limited understanding of the new courses, it is difficult for students to make predictive choices about the courses of the current semester. In order to help students solve this problem, this paper proposed a hybrid prediction model based on deep learning and collaborative filtering. The proposed model can automatically generate personalized suggestions about courses in the next semester to assist students in course selection. The two important tasks of this study are course recommendation and student ranking prediction. First, we use a user-based collaborative filtering model to give a list of recommended courses by calculating the similarity between users. Then, for the courses in the list, we use a hybrid prediction model to predict the student’s performance in each course, that is, ranking prediction. Finally, we will give a list of courses that the student is good at or not good at according to the predicted ranking of the courses. Our method is evaluated on students’ data from two departments of our university. Through experiments, we compared the hybrid prediction model with other nonhybrid models and confirmed the good effect of our model. By using our model, students can refer to the different recommendation lists given and choose courses that they may be interested in and good at. The proposed method can be widely applied in Internet of Things and industrial vocational learning systems.

Download Full-text

Predicting subcellular location of protein with evolution information and sequence-based deep learning

BMC Bioinformatics ◽

10.1186/s12859-021-04404-0 ◽

2021 ◽

Vol 22 (S10) ◽

Author(s):

Zhijun Liao ◽

Gaofeng Pan ◽

Chao Sun ◽

Jijun Tang

Keyword(s):

Deep Learning ◽

Short Term Memory ◽

Protein Sequences ◽

Subcellular Location ◽

Protein Subcellular Location ◽

Benchmark Datasets ◽

Memory Network ◽

Scoring Matrix ◽

Protein Subcellular Locations ◽

Protein Subcellular Localization Prediction

Abstract Background Protein subcellular localization prediction plays an important role in biology research. Since traditional methods are laborious and time-consuming, many machine learning-based prediction methods have been proposed. However, most of the proposed methods ignore the evolution information of proteins. In order to improve the prediction accuracy, we present a deep learning-based method to predict protein subcellular locations. Results Our method utilizes not only amino acid compositions sequence but also evolution matrices of proteins. Our method uses a bidirectional long short-term memory network that processes the entire protein sequence and a convolutional neural network that extracts features from protein sequences. The position specific scoring matrix is used as a supplement to protein sequences. Our method was trained and tested on two benchmark datasets. The experiment results show that our method yields accurate results on the two datasets with an average precision of 0.7901, ranking loss of 0.0758 and coverage of 1.2848. Conclusion The experiment results show that our method outperforms five methods currently available. According to those experiments, we can see that our method is an acceptable alternative to predict protein subcellular location.

Download Full-text

A Joint Summarization and Pre-Trained Model for Review-Based Recommendation

Information ◽

10.3390/info12060223 ◽

2021 ◽

Vol 12 (6) ◽

pp. 223

Author(s):

Yi Bai ◽

Yang Li ◽

Letian Wang

Keyword(s):

Collaborative Filtering ◽

Language Model ◽

Recommendation Systems ◽

The Internet ◽

Automatic Summarization ◽

Potential Approach ◽

Fine Grained ◽

Proposed Model ◽

Summary Representation ◽

Better Than

Currently, reviews on the Internet contain abundant information about users and products, and this information is of great value to recommendation systems. As a result, review-based recommendations have begun to show their effectiveness and research value. Due to the accumulation of a large number of reviews, it has become very important to extract useful information from reviews. Automatic summarization can capture important information from a set of documents and present it in the form of a brief summary. Therefore, integrating automatic summarization into recommendation systems is a potential approach for solving this problem. Based on this idea, we propose a joint summarization and pre-trained recommendation model for review-based rate prediction. Through automatic summarization and a pre-trained language model, the overall recommendation model learns a fine-grained summary representation of the key content as well as the relationships between words and sentences in each review. The review summary representations of users and items are finally incorporated into a neural collaborative filtering (CF) framework with interactive attention mechanisms to predict the rating scores. We perform experiments on the Amazon dataset and compare our method with several competitive baselines. Experimental results show that the performance of the proposed model is obviously better than that of the baselines. Relative to the current best results, the average improvements obtained on four sub-datasets randomly selected from the Amazon dataset are approximately 3.29%.

Download Full-text

Human Activity Recognition using Fourier Transform Inspired Deep Learning Combination Model

International Journal of Sensors Wireless Communications and Control ◽

10.2174/2210327908666180727123657 ◽

2019 ◽

Vol 9 (1) ◽

pp. 16-31

Author(s):

Kyungkoo Jun

Keyword(s):

Fourier Transform ◽

Deep Learning ◽

Short Term Memory ◽

Window Size ◽

Sensor Data ◽

Data Sets ◽

Data Set ◽

Proposed Model ◽

Testing Data ◽

Labeling Scheme

Background & Objective: This paper proposes a Fourier transform inspired method to classify human activities from time series sensor data. Methods: Our method begins by decomposing 1D input signal into 2D patterns, which is motivated by the Fourier conversion. The decomposition is helped by Long Short-Term Memory (LSTM) which captures the temporal dependency from the signal and then produces encoded sequences. The sequences, once arranged into the 2D array, can represent the fingerprints of the signals. The benefit of such transformation is that we can exploit the recent advances of the deep learning models for the image classification such as Convolutional Neural Network (CNN). Results: The proposed model, as a result, is the combination of LSTM and CNN. We evaluate the model over two data sets. For the first data set, which is more standardized than the other, our model outperforms previous works or at least equal. In the case of the second data set, we devise the schemes to generate training and testing data by changing the parameters of the window size, the sliding size, and the labeling scheme. Conclusion: The evaluation results show that the accuracy is over 95% for some cases. We also analyze the effect of the parameters on the performance.

Download Full-text