scholarly journals The Hybrid of Jaro-Winkler and Rabin-Karp Algorithm in Detecting Indonesian Text Similarity

2021 ◽  
Vol 6 (1) ◽  
pp. 88
Author(s):  
Muhamad Arief Yulianto ◽  
Nurhasanah Nurhasanah

The String-matching technique is part of the similarity technique. This technique can detect the similarity level of the text. The Rabin-Karp is an algorithm of string-matching type. The Rabin-Karp is capable of multiple patterns searching but does not match a single pattern. The Jaro-Winkler Distance algorithm can find strings within approximate string matching. This algorithm is very suitable and gives the best results on the matching of two short strings. This study aims to overcome the shortcomings of the Rabin-Karp algorithm in the single pattern search process by combining the Jaro-Winkler and Rabin-Karp algorithm methods. The merging process started from pre-processing and forming the K-Gram data. Then, it was followed by the calculation of the hash value for each K-Gram by the Rabin-Karp algorithm. The process of finding the same hash score and calculating the percentage level of data similarity used the Jaro-Winkler algorithm. The test was done by comparing words, sentences, and journal abstracts that have been rearranged. The average percentage of the test results for the similarity level of words in the combination algorithm has increased. In contrast, the results of the percentage test for the level of similarity of sentences and journal abstracts have decreased. The experimental results showed that the combination of the Jaro-Winkler algorithm on the Rabin-Karp algorithm can improve the similarity of text accuracy.

Author(s):  
Bobby Aris Sandy ◽  
Paska Marto Hasugian

Searching is the process of selecting the information needed from a collection of data that already exists, data search is often also called a tablelook-up or store and retrieval information. Along with the development of technology that is now so very rapid, one of them is the Smartphone application. Smartphone applications that are currently in demand are increasing sharply, that is Android. Even though the presentation is limited to the screen, Smartphones are quite efficient for its users where the mobility is very high and affordable in all circles of society. used is the string search method. String search method is a string search process or often also called string matching. In the string matching process, there is the Crochemore perrin algorithm, which is an algorithm that factoring a pattern into two parts, namely Pattrenkiri and pattern right. This method is perfect for dictionary search applications of Latin terms flora and fauna.


2021 ◽  
Vol 1 (2) ◽  
pp. 54-60
Author(s):  
Candra Irawan ◽  
Mudafiq Riyan Pratama

String matching is an algorithm for matching a text to another text or also known as a text search. There are several algorithms that can be used for string matching, including the Boyer-Moore algorithm and the Brute Force algorithm. The Boyer-Moore algorithm is a string matching algorithm published by Robert S. Boyer and J. Strother Moore in 1977. This algorithm is considered the most efficient algorithm in general applications. The Boyer-Moore algorithm starts matching characters from the pattern on the right. While the Brute Force algorithm is an algorithm that matches a pattern with all text between 0 and n-m to find the existence of a pattern in the text. These two algorithms have different patterns in the search process. In this article, a comparative analysis of the performance of the Boyer-Moore and Brute Force algorithms is carried out in a case study of the search for the Big Indonesian Dictionary (KBBI) based on Android. The search process is carried out by searching based on words and word descriptions. The results of this study indicate that the criteria for running time, the Brute Force algorithm is faster than the Boyer-Moore algorithm with the total running time of the Brute Force algorithm is 168.3 ms in words, 6994.16 ms in word descriptions, while the Boyer-Moore algorithm for running time reached 304.7 ms on the word, 8654.77 ms on the word description. In the testing criteria based on related keywords, the two algorithms can display the same list of related keywords.


2021 ◽  
Vol 3 (4) ◽  
pp. 385-394
Author(s):  
Usman Nurhasan ◽  
Erninda Ristiani ◽  
Samsul Islam Baddrisshofa

The School Literacy Movement (GLS) aims to foster youth character through a culture of literacy (reading and writing). However, in the presence of the Covid-19 outbreak, Indonesian education needs to use online media to keep learning going. Many types of platforms are used for online learning media, but all of these media do not support school literacy activities, so school literacy activities do not run as usual. Based on these problems, a solution was created, namely an application that makes it easy for literacy activities to take place online. Students can access this application to do online literacy via a laptop or smartphone. This application makes it easier for teachers to monitor the course of online literacy programs. The results of this study are indicated by functional testing on all features obtaining a 100% valid percentage. Tests on users get an average percentage of more than 80%. The test results prove that this application can be accepted by students, teachers and admins at State High School 1 Geger Madiun to make literacy activities more effective and efficient.


Author(s):  
I Made Ardwi Pradnyana ◽  
I Ketut Resika Arthana ◽  
I Gusti Bagus Hari Sastrawan

Submission of learning materials with animal themes, especially wild animals to early childhood becomes a challenge for teachers. Two-dimensional displacement media in the form of a monotonous image has the potential to decrease interest in children's learning. Bringing wild animals directly or bringing the children to the zoo requires considerable cost and time and harm. Based on these problems, the authors develop android-based applications that contain fourteen species of wild animals in 3D format that is packed with Virtual Reality (VR) technology. The authors develop applications using development research methods with the ADDIE model. The developed VR application is capable of displaying wild animal animations complete with the sounds and environment of the habitat, as well as the description narrative features and food that can be viewed in 3D and VR modes. The test results showed that the application received a positive response from users, especially children in TK Negeri Pembina Singaraja. The average percentage for the user response test is 88.50%, which means it is very good where children can know the types of wild animals, the movements of wild animals, the sounds of wild animals, the habitats of wild animals and can use them easily. 


2018 ◽  
Vol 2 (1) ◽  
Author(s):  
Rini Susilowati

This type of research  is a class action research. The purpose of this research enhances students ' critical thinking ability by application of Problem Based Learning with  Audio Visual Media. Data were collected by using a sheet of observation and evaluation tests. The average percentage of critical thinking ability of students overall observation sheet on precyae 13.8% increase in cycle cycle I 69% and increased again in cycle II 96.5%. And the overall percentage on the test results the evaluation cycle I 44.8% increase in cycle II 96.6%. Thus the results showed that the application of problem based learning with audio visual media is able to enhance the critical thinking ability of students.


2019 ◽  
Vol 178 (3) ◽  
pp. 71-75
Author(s):  
Piotr ORLIŃSKI ◽  
Marcin WOJS ◽  
Mateusz BEDNARSKI ◽  
Mieczysław SIKORA

The article presents the results of empirical research and their analysis regarding the impact of diesel oil and diesel oil mixture with bioethanol on coking the test injector nozzles of the XUD9 engine from PSA. The research included three fuel deals: diesel fuel as the base fuel and diesel oil mix with ONE10 bioethanol (10% bioethanol plus diesel oil (V/V)), ONE20 (20% bioethanol plus diesel oil (V/V)). They were conducted on the basis of CEC PF-023 developed by CEC (Coordinating European Council). Each of the above-mentioned fuels was tested using a new set of injectors. The propensity of the fuel for coking the injector tips was expressed as a percentage reduction in the air flow through the nozzles of each injector for the given sheer increments. The test result was the average percentage of airflow reduction for all nozzles at 0.1 mm spike increments and was measured according to ISO 4010 "Diesel engines. Calibrating nozzle, delay pintle type”. The test results for individual atomizers of the above-mentioned test engine in the area of sediment formation from flowing fuel shown a lower tendency to coke the injectors using diesel fuel-bioethanol in comparison to the use of pure diesel oil. Based on the CEC PF-023 test, it can be noticed that the level of contamination of the tested injectors for ONE10 fuel is about 3% lower, and for ONE20 fuel is about 4% lower than the level of pollution for diesel fuel.


2012 ◽  
Vol 8 (1) ◽  
Author(s):  
Ed O. Omictin III ◽  
Rodrigo Gante Jr ◽  
Robby Rosa P. Villaflores ◽  
Ma. Bryne Catherine M. Marchan ◽  
Rodolfo T. Noblefranca Jr

Aho-Corasick Algorithm (ACA) is a kind of dictionary-matching algorithm that locates elements of finite set of strings within an input text. It matches all patterns “at once”, so the complexity of the algorithm is linear in the length of the patterns plus the length of the searched text plus the number of output matches. This paper discusses the applicability of Aho-Corasick algorithm in identifying test validity using the standard Guidelines in Evaluating Tests. A proposed Quiz-Zone system was developed in order to evaluate and test the applicability of the algorithm used. Quiz-Zone allows the user to create exam that will check the test’s validity. It also allows the user to choose five types of exam namely: Matching Type, Multiple Choice, Essay, True or False and Short-Answer. The researchers revealed that there are some rules in identifying test validity that ACA can’t be applied. Keywords : Aho Corasick Algorithm, string-matching algorithm, test validity


Academia Open ◽  
2021 ◽  
Vol 5 ◽  
Author(s):  
Aditya Kurniawan ◽  
Yulian Findawati

Voting in a democratic country is an important part of the means of choosing leaders. The village head election process in Indonesia still uses conventional voting methods, namely using ballot paper media in the election process. Voting that is carried out conventionally has several obstacles, including the lack of guaranteeing the authenticity of voters' votes, so that people think the results of voting results are often manipulated. In addition conventional selection is deemed inaccurate and time-consuming and costly. In this study the aim of this research is to design an information system e-voting that can be used for the Election of the Village Head of Cemandi, Sedati, Sidoarjo, East Java, where by using this system the election process becomes easier by ensuring the accuracy of the vote count. This system development method uses the model of software engineering waterfall. The test results blackbox show that the functions of the features in the system are running well. Based on the UAT test, the system received an average percentage rate of 86%.


Sign in / Sign up

Export Citation Format

Share Document