similarity algorithm
Recently Published Documents


TOTAL DOCUMENTS

209
(FIVE YEARS 61)

H-INDEX

10
(FIVE YEARS 2)

2022 ◽  
Vol 2146 (1) ◽  
pp. 012041
Author(s):  
Yiming Niu ◽  
Wenyong Du ◽  
Zhenying Tang

Abstract With the rapid development of the Internet industry, hundreds of millions of online resources are also booming. In the information space with huge and complex resources, it is necessary to quickly help users find the resources they are interested in and save users time. At this stage, the content industry’s application of the recommendation model in the content distribution process has become the mainstream. The content recommendation model provides users with a highly efficient and highly satisfying reading experience, and solves the problem of information redundancy to a certain extent. Knowledge tag personalized dynamic recommendation technology is currently widely used in the field of e-commerce. The purpose of this article is to study the optimization of the knowledge tag personalized dynamic recommendation system based on artificial intelligence algorithms. This article first proposes a hybrid recommendation algorithm based on the comparison between content-based filtering and collaborative filtering algorithms. It mainly introduces user browsing behavior analysis and design, KNN-based item similarity algorithm design, and hybrid recommendation algorithm implementation. Finally, through algorithm simulation experiments, the effectiveness of the algorithm in this paper is verified, and the accuracy of the recommendation has been improved.


2021 ◽  
Vol 5 (4) ◽  
pp. 529
Author(s):  
Fajar Agung Nugroho ◽  
Fajar Septian ◽  
Dimas Abisono Pungkastyo ◽  
Joko Riyanto

Research and community service activities are the obligations of a lecturer that must be carried out from part of the Tri Dharma of Higher Education in addition to teaching, where research activities should have a level of innovation in the form of development or discovery of something new, but with a large number of lecturers, this results in research activities. and community service has many similarities with previous activities. At the Lembaga Penelitian dan Pengabdian kepada Masyarakat (LPPM), Pamulang University, experiencing several problems in the management of research and community service activities, namely the absence of a system used to manage research and community service activities and data related to the track record of research and community service activities that have been carried out by 2,613 lecturers who impact on the difficulty in finding data, efficiency of storage space and more importantly is the number of similar proposals in the research itself. The research carried out aims to develop an information system that can process research and community service activities and detect similarities in content by applying the Cosine Similarity algorithm, so that it can overcome existing problems. The system development method uses a waterfall. From the results of making the system that has been carried out, it shows that the system is capable of processing activities in the field of research and community service carried out by lecturers, supporting storage, and facilitating screening of proposals for research and community service activities that will be approved.


2021 ◽  
Vol 5 (2) ◽  
pp. 106-114
Author(s):  
Muhamad Aldi Rifai ◽  
Indra Gita Anugrah

The activity of writing scientific articles by academics at universities is one of the activities that is often carried out, but when writing scientific articles problems arise regarding the difficulty of finding ideas, literature studies, and reference sources that you want to use as references when writing. Sometimes when searching on a search engine, we have trouble finding the right document, because usually, the keywords we are looking for are not in the title section but another part of the structure. Since most search engines only match titles, other structures are usually excluded from matching. So that the search results that we do sometimes don't match what we want. In addition, usually, each scientific article has many language differences in its structure as found in the abstract section. To detect similarities through the structure of scientific articles, an algorithm is used, namely weighted tree similarity, and to detect language using the N-gram algorithm, then the cosine similarity algorithm can be used to check the level of similarity in keyword text with text in scientific articles.


2021 ◽  
Vol 9 (02) ◽  
pp. 60-67
Author(s):  
Nur Kharisa Umami ◽  
Setyawan Wibisono

There are still many parents who do not have sufficient understanding in terms of toddler disease. One way to provide education is the availability of a system that can be used for consultation based on the symptoms of illness experienced by toddlers and the actions needed to overcome them. The system that will be built is an expert system that can relatively provide suggestions for solutions to children's health problems using the Case Based Reasoning (CBR) method. namely an expert system that uses case-based reasoning methods, namely looking for similarities of a disease compared to a disease that has existed before. In this study, the CBR method was combined with a weighting process using the pairwise comparison method which was within the scope of the AHP (Analytic Hierarchy Process) method. In comparing consultations with old diseases that already exist in the system, and looking for similarities from the comparison results, the Sorensen similarity algorithm is used. This study resulted in weights with 3 symptom categories, namely mild symptoms with a weight of 0.09, moderate symptoms with a weight of 0.24 and severe symptoms with a weight of 0.67 and will recommend several diseases with a similarity above 0.5 and diseases with a similarity below 0.5 will be entered into the revise table to find a solution.


Processes ◽  
2021 ◽  
Vol 9 (12) ◽  
pp. 2115
Author(s):  
Yujie Bai ◽  
Dong Gao ◽  
Lanfei Peng

Hazard and operability (HAZOP) is an important safety analysis method, which is widely used in the safety evaluation of petrochemical industry. The HAZOP analysis report contains a large amount of expert knowledge and experience. In order to realize the effective expression and reuse of knowledge, the knowledge ontology is constructed to store the risk propagation path and realize the standardization of knowledge expression. On this basis, a comprehensive algorithm of ontology semantic similarity based on the ant clony optimization generalized neural network (ACO-GRNN) model is proposed to improve the accuracy of semantic comparison. This method combines the concept name, semantic distance, and improved attribute coincidence calculation method, and ACO-GRNN is used to train the weights of each part, avoiding the influence of manual weighting. The results show that the Pearson coefficient of this method reaches 0.9819, which is 45.83% higher than the traditional method. It could solve the problems of semantic comparison and matching, and lays a good foundation for subsequent knowledge retrieval and reuse.


2021 ◽  
Vol 22 (22) ◽  
pp. 12542
Author(s):  
Oxana Kazakova ◽  
Roxana Racoviceanu ◽  
Anastasiya Petrova ◽  
Marius Mioc ◽  
Adrian Militaru ◽  
...  

Twenty lupane type A-ring azepano-triterpenoids were synthesized from betulin and its related derivatives and their antitubercular activity against Mycobacterium tuberculosis, mono-resistant MTB strains, and nontuberculous strains Mycobacterium abscessus and Mycobacterium avium were investigated in the framework of AToMIc (Anti-mycobacterial Target or Mechanism Identification Contract) realized by the Division of Microbiology and Infectious Diseases, NIAID, National Institute of Health. Of all the tested triterpenoids, 17 compounds showed antitubercular activity and 6 compounds were highly active on the H37Rv wild strain (with MIC 0.5 µM for compound 7), out of which 4 derivatives also emerged as highly active compounds on the three mono-resistant MTB strains. Molecular docking corroborated with a machine learning drug-drug similarity algorithm revealed that azepano-triterpenoids have a rifampicin-like antitubercular activity, with compound 7 scoring the highest as a potential M. tuberculosis RNAP potential inhibitor. FIC testing demonstrated an additive effect of compound 7 when combined with rifampin, isoniazid and ethambutol. Most compounds were highly active against M. avium with compound 14 recording the same MIC value as the control rifampicin (0.0625 µM). The antitubercular ex vivo effectiveness of the tested compounds on THP-1 infected macrophages is correlated with their increased cell permeability. The tested triterpenoids also exhibit low cytotoxicity and do not induce antibacterial resistance in MTB strains.


2021 ◽  
Vol 21 (S9) ◽  
Author(s):  
Yani Chen ◽  
Shan Nan ◽  
Qi Tian ◽  
Hailing Cai ◽  
Huilong Duan ◽  
...  

Abstract Background Standardized coding of plays an important role in radiology reports’ secondary use such as data analytics, data-driven decision support, and personalized medicine. RadLex, a standard radiological lexicon, can reduce subjective variability and improve clarity in radiology reports. RadLex coding of radiology reports is widely used in many countries, but translation and localization of RadLex in China are far from being established. Although automatic RadLex coding is a common way for non-standard radiology reports, the high-accuracy cross-language RadLex coding is hardly achieved due to the limitation of up-to-date auto-translation and text similarity algorithms and still requires further research. Methods We present an effective approach that combines a hybrid translation and a Multilayer Perceptron weighting text similarity ensemble algorithm for automatic RadLex coding of Chinese structured radiology reports. Firstly, a hybrid way to integrate Google neural machine translation and dictionary translation helps to optimize the translation of Chinese radiology phrases to English. The dictionary is made up of 21,863 Chinese–English radiological term pairs extracted from several free medical dictionaries. Secondly, four typical text similarity algorithms are introduced, which are Levenshtein distance, Jaccard similarity coefficient, Word2vec Continuous bag-of-words model, and WordNet Wup similarity algorithms. Lastly, the Multilayer Perceptron model has been used to synthesize the contextual, lexical, character and syntactical information of four text similarity algorithms to promote precision, in which four similarity scores of two terms are taken as input and the output presents whether the two terms are synonyms. Results The results show the effectiveness of the approach with an F1-score of 90.15%, a precision of 91.78% and a recall of 88.59%. The hybrid translation algorithm has no negative effect on the final coding, F1-score has increased by 21.44% and 8.12% compared with the GNMT algorithm and dictionary translation. Compared with the single similarity, the result of the MLP weighting similarity algorithm is satisfactory that has a 4.48% increase compared with the best single similarity algorithm, WordNet Wup. Conclusions The paper proposed an innovative automatic cross-language RadLex coding approach to solve the standardization of Chinese structured radiology reports, that can be taken as a reference to automatic cross-language coding.


2021 ◽  
Vol 12 ◽  
Author(s):  
Kun Sun ◽  
Xiaofei Lu

Previous studies of the lexical psycholinguistic properties (LPPs) in second language (L2) production have assessed the degree of an LPP dimension of an L2 corpus by computing the mean ratings of unique content words in the corpus for that dimension, without considering the possibility that learners at different proficiency levels may perceive the degree of that dimension of the same words differently. This study extended a dynamic semantic similarity algorithm to estimate the degree of five different LPP dimensions of several sub-corpora of the Education First-Cambridge Open Language Database representing L2 English learners at different proficiency levels. Our findings provide initial evidence for the validity of the algorithm for assessing the LPPs in L2 production and contribute useful insights into between-proficiency relationships and cross-proficiency differences in the LPPs in L2 production as well as the relationships among different LPP dimensions.


Author(s):  
Celestine Iwendi ◽  
Ebuka Ibeke ◽  
Harshini Eggoni ◽  
Sreerajavenkatareddy Velagala ◽  
Gautam Srivastava

The creation of digital marketing has enabled companies to adopt personalized item recommendations for their customers. This process keeps them ahead of the competition. One of the techniques used in item recommendation is known as item-based recommendation system or item–item collaborative filtering. Presently, item recommendation is based completely on ratings like 1–5, which is not included in the comment section. In this context, users or customers express their feelings and thoughts about products or services. This paper proposes a machine learning model system where 0, 2, 4 are used to rate products. 0 is negative, 2 is neutral, 4 is positive. This will be in addition to the existing review system that takes care of the users’ reviews and comments, without disrupting it. We have implemented this model by using Keras, Pandas and Sci-kit Learning libraries to run the internal work. The proposed approach improved prediction with [Formula: see text] accuracy for Yelp datasets of businesses across 11 metropolitan areas in four countries, along with a mean absolute error (MAE) of [Formula: see text], precision at [Formula: see text], recall at [Formula: see text] and F1-Score at [Formula: see text]. Our model shows scalability advantage and how organizations can revolutionize their recommender systems to attract possible customers and increase patronage. Also, the proposed similarity algorithm was compared to conventional algorithms to estimate its performance and accuracy in terms of its root mean square error (RMSE), precision and recall. Results of this experiment indicate that the similarity recommendation algorithm performs better than the conventional algorithm and enhances recommendation accuracy.


2021 ◽  
Vol 14 (1) ◽  
pp. 93-100
Author(s):  
Addini Yusmar ◽  
Luh Kesuma Wardhani ◽  
Hendra Bayu Suseno

In 2018, the Ministry of Industry (Kemenperin) stated that the food and beverage sector contributed 6.34% of the national gross domestic product (GDP). Currently, culinary information can be easily found, both in print and online. The amount of information available sometimes makes people over-informed, making it difficult to choose a restaurant based on their preferences. To assist consumers in selecting a restaurant, we need a system that can provide several recommendations. This study aims to implement the item-based Collaborative Filtering method using the Adjusted Cosine Similarity algorithm on a restaurant recommendation system. The test was carried out with 40 samples from UIN Syarif Hidayatullah Jakarta using purposive sampling because the sample was selected based on specific criteria, and 40 respondents can be said to be correct because of the minimum number of respondents is 30. The accuracy test uses precision, and the determination of the error value uses MAE. The analysis of the research results used three scenarios, which are 5, 20, and 40 users. The third scenario produces the best precision and MAE values. Precision is better if the precision value is close to 1, and MAE is getting better if the MAE value is getting closer to 0. So it can be concluded that the Item-Based method with the Adjusted Cosine algorithm has the best accuracy and error values when the number of users grows.


Sign in / Sign up

Export Citation Format

Share Document