average similarity
Recently Published Documents


TOTAL DOCUMENTS

87
(FIVE YEARS 32)

H-INDEX

14
(FIVE YEARS 1)

2022 ◽  
Vol 24 (1) ◽  
pp. 139-140
Author(s):  
Dr.S. Dhanabal ◽  
◽  
Dr.K. Baskar ◽  
R. Premkumar ◽  
◽  
...  

Collaborative filtering algorithms (CF) and mass diffusion (MD) algorithms have been successfully applied to recommender systems for years and can solve the problem of information overload. However, both algorithms suffer from data sparsity, and both tend to recommend popular products, which have poor diversity and are not suitable for real life. In this paper, we propose a user internal similarity-based recommendation algorithm (UISRC). UISRC first calculates the item-item similarity matrix and calculates the average similarity between items purchased by each user as the user’s internal similarity. The internal similarity of users is combined to modify the recommendation score to make score predictions and suggestions. Simulation experiments on RYM and Last.FM datasets, the results show that UISRC can obtain better recommendation accuracy and a variety of recommendations than traditional CF and MD algorithms.


Author(s):  
M. S. Ranjani ◽  
P. Jayamani

The research entails morphological characterization of pigeonpea genotypes based on qualitative traits which aids in varietal description and ensure genetic purity. The study was conducted at the Department of Pulses, Tamil Nadu Agricultural University during kharif, 2019 and rabi, 2019-2020. The experiment was conducted in Randomized complete block design with two replications. The 68-short duration pigeonpea genotypes were characterized and grouped based on 17 qualitative traits. The morphological characterization revealed that, more variation was identified for traits viz., pattern of streaks on standard petal, plant height, seed colour, seed shape and seed size. Sixty - eight genotypes were subjected to cluster analysis and were grouped into four major clusters with an average similarity of 80%. The similarity coefficient ranged from 0.65 to 1.00. The cluster I consisted of 60 genotypes. Two sub-groups were formed from Cluster I. The sub-group I had 59 genotypes at 84 per cent similarity, whereas the sub-group II consisted of the genotype ICPL19050. Cluster II was made up of six genotypes. The cluster II was divided into two sub-groups at 84 per cent similarity. Clusters III and IV were solitary clusters, each with a single genotype. The characterization of genotypes with specific traits could be used to identify the genotypes, maintenance of genetic purity and to utilize in future breeding programmes.


2021 ◽  
Vol 49 (1) ◽  
Author(s):  
Souad Larabi-Marie-Sainte ◽  
◽  
Betool S. Alnamlah ◽  
Norah F. Alkassim ◽  
Sara Y. Alshathry ◽  
...  

Automated recitation plays an important role in improving self-learning. It is based on Speech/Text recognition. The research in Arabic speech recognition is very limited. The few existing applications are only based on the Holy Qur’an. This article proposed a new system (Samee’a - ) to facilitate memorizing any kind of text such that poems, speeches and the Holy Qur’an. Samee’a system is based on Google Cloud Speech Recognition API to convert the Arabic speech to text and Jaro Winkler Distance algorithm to determine the similarity between the original and converted texts. The system has been tested using 70 collected files ranging between 12 to 400 words and some chapters from the Holy Qur’an. The average similarity achieved 83.33% for the 70 files and 69% for the selected chapters of the Holy Qur’an. These results were enhanced to 91.33 % and 95.66% after applying preprocessing operations on the text files and the Holly Qur’an respectively. To validate the obtained results, two comparison studies were performed. The Jaro Winker distance was successfully compared to the cosine and the Euclidean distance. In addition, the proposed system outperformed the related work with an improvement of the similarity reaching 5% when using section 30 of the Holy Qur’an. Finally, the user experience testing was carried out by 10 users of different ages (between 5 and 50-year-old) using small texts and some small chapters of the Holy Qur’an. The proposed system proved its efficiency.


2021 ◽  
Author(s):  
Michael Doane ◽  
Colton Johnson ◽  
Shaili Johri ◽  
Emma N. Kerr ◽  
Megan M. Morris ◽  
...  

Abstract Background: The epidermis of Chondrichthyan fishes consists of dermal denticles with production of minimal but protein rich mucus that influence the attachment and biofilm development of microbes, facilitating a unique epidermal microbiome. Here, we use metagenomics to provide the taxonomic and functional characterization of the epidermal microbiome of the Triakis semifasciata (leopard shark) across three time-points to identify links between microbial groups and host metabolism. Our aims include 1) describing the variation of microbiome taxa over time and identify those members which are recurrent (present across all time-points, 2) investigating the relationship between the recurrent and flexible taxa (those which are not found consistently across time-points, 3) describing the functional compositions of the microbiome which may suggest links with the host metabolism; and 4) identifying whether the metabolisms are share across microbial genera or found in specific taxa. Results: Microbial members of the microbiome showed high similarity between all individuals (average similarity: 82.74) with relative abundance of those members varying across years, suggesting flexibility of taxa in the microbiome. One hundred and eighty-eight genera were identified as recurrent, including Pseudomonas, Erythrobacter, Alcanivorax, Marinobacter and Sphingopxis being consistently abundance across time-points, while Limnobacter and Xyella exhibited switching patterns with high relative abundance in 2013, Sphingobium and Sphingomona in 2015, and Altermonas, Leeuwenhoekiella, Gramella and Maribacter in 2017. Of the 188 genera identified as recurrent, the top 19 relative abundant genera forming three recurrent groups. The microbiome also displayed high functional similarity between individuals (average similarity: 97.65) with gene function composition being consistent across time-points. Conclusion: These results show that while presence of microbial genera exhibit consistency across time-points, their abundances do fluctuate. Functions however remain stable across time points; thus, we suggest the leopard shark microbiomes exhibit functional redundancy. We hypothesize this may be the result of the host’s epidermal attributes structuring the microbiome. In addition, we show the co-existence of many microbial genera that carry genes which may enable the microbes to use the nutrients provided by the elasmobranch’s metabolism.


2021 ◽  
Author(s):  
Zulfaqar Sa’adi ◽  
Ahmad Zuhdi Ismail ◽  
Zulkifli Yusop ◽  
Zainab Mohamad Yusof

Abstract This study aimed at mapping the Kota Tinggi flood event in 2006/07 that had caused massive damages to properties and the environment. The flood was associated with unusually high intensity and continuous rainfall. Therefore, a reliable technique of floodplain mapping is crucial for the improvement of flood control strategies and for preparing an evacuation plan. The main objective of this study is to compare the effect of tide on flood modeling analysis. The inundated areas were mapped for various annual recurrent intervals using peak flow data from 1965 to 2010. The study used Light Detection and Ranging (LiDAR) data for flood modeling. HEC-HMS, HEC-RAS, and HEC-GeoRAS were used to develop the flood model. The results reaffirm that the GEV model is the best for fitting the annual flood. The HEC-HMS hydrologic model was calibrated and validated using observed hydrographs in Sep 2002 and Jan 2003, respectively. Upon successful calibration and validation, the model was used to simulate flood hydrograph in Jan 2007. The modeling took into account the tidal effect. When the tidal effect was not considered, the simulated flood depth was 43 % lower than the observed flood. However, the inclusion of the tidal effect has reduced the simulation error with an average similarity of 91.4%. The simulation results show that the river flow starts to over bank for ARIs exceeded 25 years.


2021 ◽  
Author(s):  
Michael Doane ◽  
Colton Johnson ◽  
Shaili Johri ◽  
Emma N. Kerr ◽  
Megan M. Morris ◽  
...  

Abstract Background: The epidermis of Chondrichthyan fishes consists of dermal denticles with production of minimal but protein rich mucus that influence the attachment and biofilm development of microbes, facilitating a unique epidermal microbiome. Here, we use metagenomics to provide the taxonomic and functional characterization of the epidermal microbiome of the Triakis semifasciata (leopard shark) across three time-points to identify links between microbial groups and host metabolism. Our aims include 1) describing the variation of microbiome taxa over time and identify those members which are recurrent (present across all time-points, 2) investigating the relationship between the recurrent and flexible taxa (those which are not found consistently across time-points, 3) describing the functional compositions of the microbiome which may suggest links with the host metabolism; and 4) identifying whether the metabolisms are share across microbial genera or found in specific taxa. Results: Microbial members of the microbiome showed high similarity between all individuals (average similarity: 82.74) with relative abundance of those members varying across years, suggesting flexibility of taxa in the microbiome. One hundred and eighty-eight genera were identified as recurrent, including Pseudomonas, Erythrobacter, Alcanivorax, Marinobacter and Sphingopxis being consistently abundance across time-points, while Limnobacter and Xyella exhibited switching patterns with high relative abundance in 2013, Sphingobium and Sphingomona in 2015, and Altermonas, Leeuwenhoekiella, Gramella and Maribacter in 2017. Of the 188 genera identified as recurrent, the top 19 relative abundant genera forming three recurrent groups. The microbiome also displayed high functional similarity between individuals (average similarity: 97.65) with gene function composition being consistent across time-points. Conclusion: These results show that while presence of microbial genera exhibit consistency across time-points, their abundances do fluctuate. Functions however remain stable across time points; thus, we suggest the leopard shark microbiomes exhibit functional redundancy. We hypothesize this may be the result of the host’s epidermal attributes structuring the microbiome. In addition, we show the co-existence of many microbial genera that carry genes which may enable the microbes to use the nutrients provided by the elasmobranch’s metabolism.


Author(s):  
Annisa Fitriani ◽  
Herman Hamdani ◽  
. Zahidah ◽  
Heti Herawati

This research aims to determine the structure of the macrozoobenthos community which includes variety, diversity, uniformity, coefficient of similarity and the Family Biotic Index (FBI) macrozoobenthos as well as the physical and chemical parameters of the waters and to determine the pollution status of Situ Cileunca waters. The research was conducted in Situ Cileunca, Bandung Regency, West Java from January to February 2021. The research method used a survey method. The method used in data collection is purposive sampling by determining 5 stations and 4 times. Data analysis in this research uses descriptive analysis. The results showed that the lowest average Biochemical Oxygen Demand (BOD5) value was found at station 4 of 8.90 ±4.92 mg/L and the highest average BOD5 value was found at station 5 of 12.99 ± 6.60 mg/L. There are three species of macrozoobenthos that live in the waters of Situ Cileunca, namely Pomacea canaliculata, Filopaludina javanica and Anodonta woodiana. Macrozoobenthos abundance ranges from 400 - 704 ind/m2. The diversity index of macrozoobenthos is included in the low category, ranging from 0.17 to 0.28. The uniformity value ranged from 0.36 - 0.59 which was classified as moderate in all including except at station 4 which was classified as low. The lowest average similarity coefficient was found between stations 3 and 5 was 0.13 and the highest average similarity coefficient between stations 4 and 5 was 0.30. The index value of the FBI macrozoobenthos ranged from 6.05 to 6.18 with a rather poor category. The pollution status of  Situ Cileunca is moderately polluted because the BOD5 value is categorized as moderately polluted, the diversity (H') is classified as heavily polluted and the FBI value is in the rather poor category.


2021 ◽  
Vol 5 (1) ◽  
pp. 82-94
Author(s):  
Ramadani Ramadani ◽  
Ramon Muhandaz

The ability to think creatively is needed by students in mathematics learning activities, because creativity raises ideas in solving mathematical problems, so that the expected learning objectives are achieved. In addition, the mathematical resilience factor is needed to make students more persistent, never give up, enjoy being challenged, so that students' perspective on mathematics is not something to be afraid of anymore. This study aims to determine the effect of an open-ended approach on the mathematical creative thinking ability based on the mathematical resilience of Madrasah Tsanawiyah (MTs). This type of research is an experimental design that uses a factorial design. The selected population has been subjected to a one-way ANOVA test with the results having an average similarity, so that the sample was taken using the Cluster Random technique with randomly selected samples, namely class VIII-1 which was selected as the experimental class and class VIII-2 was selected as the control class. The results showed: 1) there were differences in mathematical creative thinking skills between students learning with open-ended learning and students learning using conventional learning; 2) there are differences in mathematical creative thinking abilities between students who have high, medium and low mathematical resilience; 3) while for interaction there is no interaction between the open-ended learning approach and students' mathematical resilience to mathematical creative thinking abilities. So the results of this study can be used to improve mathematics learning in class.


Author(s):  
Ravisha Raina ◽  
Anita Grewal

Rose is a woody perennial of the genus Rosa within the family Rosaceae. In this study, phylogenetic relationship among seven different genotypes of Rose (Rosa) was determined using ISSR primers and morphological parameters. DNA was isolated using CTAB method with 0.5% of CTAB concentration. Out of 50 ISSR primers, 16 primers gave reproducible banding pattern with all the seven genotypes of Rose while others failed to produce polymorphism. A total of 108 bands were obtained with an average polymorphism of 54.69% and an average of 9.06 bands per primer. The average similarity was found to be 0.69 among all the seven genotypes using Jaccard’s similarity coefficient. On basis of polymorphism cluster analysis divided them into two major groups separating one genotype (light pink) from rest of the six genotypes and principle coordinate analysis confirmed the results. A parallel correlation was found between the morphological parameter (flower diameter) and the results of ISSR analysis as both of them divided the seven genotypes into two main groups.


2021 ◽  
Vol 7 (2) ◽  
pp. 119
Author(s):  
Wahyu Hidayat ◽  
Ema Utami ◽  
Anggit Dwi Hartanto

Pengidentifikasian terkait plagiarisme terhadap dokumen berbahasa Indonesia telah dilakukan di penelitian terkait, untuk pendeteksi tingkat kesamaan dokumen. Dalam penelitian tersebut telah digunakan algoritma pendeteksi kesamaan dokumen dengan metode fingerprint sseperti Algoritma Winnowing. Algoritma Winnowing memiliki perbedaan pada penggunaan parameter seperti ada yang menggunakan k-gram dan n-gram. Dari perbedaan parameter tersebut dilakukan penelitian performa dari perbandingan penggunaan parameter yang berbeda pada pemotongan string pada tahap algoritma Winnowing sehingga dapat diketahui parameter yang mempunyai tingkat performa yang paling baik. Hasil penelitian pada k-gram memiliki tingkat nilai similarity yang tinggi namun ketika nilai jumlah k semakin besar akan mengurangi tingkat nilai similarit dengan rata-rata hasil pada k = 2 sebesar 0.5299, k = 3 sebesar 0.1689, k = 5 sebesar 0.0283 dan k = 7 sebesar 0.0095. Penerapan pemotongan string n-gram pada unigram memiliki rata-rata tingkat similarity sebesar 0.0683, bigram 0.003, pada trigram dan four-gram sebesar 0.000. Pada perbandingan kecepatan pemrosesan waktu k-gram dan n-gram tidak terlihat perbedaan yang signifikan dan keduanya mendominasi selama 6 detik.Kata Kunci—Algoritma Winnowing, Jaccard Similarity, Fingerprint, K-gram, N-gramIdentification related to plagiarism of Indonesian language documents has been carried out in related research, such as for the purpose of detecting the level of similarity documents. In this research, algorithm similarity detection algorithms have been used, especially with the fingerprint method wich Winnowing algorithm. Winnowing algorithm using parameters such as those using k-gram and n-gram. From these different parameters, a study of the performance of the comparison the use of different parameters in the string cutting at the Winnowing algorithm stage can be found out which parameter has the best level of performance. The results of research on k-gram have a high level of similarity value, but when the value of the number of k gets bigger it will reduce the level of similarity values with an average result at k = 2 of 0.5299, k = 3 of 0.1689, k = 5 of 0.0283 and k = 7 in the amount of 0.0095. The application of cutting n-gram strings on unigram has an average similarity level of 0.0683, bigram 0.003, on trigrams and four-grams of 0.000. In the comparison of the processing speed of k-gram and n-gram time, there was no significant difference, and both dominated for 6 seconds. Keywords— Winnowing algorithm, Jaccard Similarity, Fingerprint, K-gram, N-gram


Sign in / Sign up

Export Citation Format

Share Document