Methodology of Selecting the Hadoop Ecosystem Configuration in Order to Improve the Performance of a Plagiarism Detection System

Analysis of Stylometric Features and Segmentation Strategies in Intrinsic Plagiarism Detection System

Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi) ◽

10.29207/resti.v4i5.2486 ◽

2020 ◽

Vol 4 (5) ◽

pp. 988-997

Author(s):

Sylvia Putri Gunawan ◽

Lucia Dwi Krisnawati ◽

Antonius Rachmat Chrismanto

Keyword(s):

Detection System ◽

Plagiarism Detection ◽

Development System ◽

Intrinsic Plagiarism Detection

Two different paradigms in the field of plagiarism detection resulting in External Plagiarism Detection (EPD) and Intrinsic Plagiarism Detection (IPD) systems. The most common applied system is EPD, which requires its algorithm to make a heuristic comparison between a suspicious document with documents in a corpus. In contrast, given a suspicious document only, an algorithm of IPD should be able to find the plagiarism section by looking for text segments having different writing styles. Previous researches for Indonesian texts fell only in the field of the EPD development system. Therefore, this research focuses on and contributes to experimenting and analyzing the stylometric features and segmentation strategies to build an IPD system for Indonesian texts. The experimentation results show that the paragraph segment performs better by scoring 0.92 for Macro Averaged-Accuracy and 0.54 for Macro Averaged-F1. The stylometric features achieving the highest scores of F-1 and Accuracy are the frequency of punctuation, the average paragraph length, and the type-token ratio.

Download Full-text

Java Source Code Plagiarism Detection System

International Journal for Research in Applied Science and Engineering Technology ◽

10.22214/ijraset.2018.3748 ◽

2018 ◽

Vol 6 (3) ◽

pp. 3596-3600

Author(s):

Mrs. Ghuge Madhuri Laxman

Keyword(s):

Detection System ◽

Source Code ◽

Plagiarism Detection

Download Full-text

A Two Phases Plagiarism Detection System for the Newspaper Articles by using a Web Search and a Document Similarity Estimation

The KIPS Transactions PartB ◽

10.3745/kipstb.2009.16-b.2.181 ◽

2009 ◽

Vol 16B (2) ◽

pp. 181-194 ◽

Cited By ~ 1

Author(s):

Jung-Hyun Cho ◽

Hyun-Ki Jung ◽

Yu-Seop Kim

Keyword(s):

Web Search ◽

Detection System ◽

Plagiarism Detection ◽

Document Similarity ◽

Similarity Estimation ◽

Two Phases ◽

Newspaper Articles

Download Full-text

PlagZap: A Textual Plagiarism Detection System for Student Assignments Built with Open-Source Software

Advances in Intelligent Systems and Computing - International Joint Conference SOCO’18-CISIS’18-ICEUTE’18 ◽

10.1007/978-3-319-94120-2_48 ◽

2018 ◽

pp. 500-508

Author(s):

Elena Băutu ◽

Andrei Băutu

Keyword(s):

Open Source ◽

Open Source Software ◽

Detection System ◽

Plagiarism Detection

Download Full-text

Design and Implementation of Arabic Plagiarism Detection System

Intelligent Systems Reference Library - Further Advances in Internet of Things in Biomedical and Cyber Physical Systems ◽

10.1007/978-3-030-57835-0_25 ◽

2021 ◽

pp. 347-358

Author(s):

Zahraa Jasim Jaber ◽

Ahmed H. Aliwy

Keyword(s):

Detection System ◽

Plagiarism Detection ◽

Design And Implementation

Download Full-text

Fast Plagiarism Detection System

String Processing and Information Retrieval - Lecture Notes in Computer Science ◽

10.1007/11575832_30 ◽

2005 ◽

pp. 267-270 ◽

Cited By ~ 21

Author(s):

Maxim Mozgovoy ◽

Kimmo Fredriksson ◽

Daniel White ◽

Mike Joy ◽

Erkki Sutinen

Keyword(s):

Detection System ◽

Plagiarism Detection

Download Full-text

Text Documents Plagiarism Detection using Rabin-Karp and Jaro-Winkler Distance Algorithms

Indonesian Journal of Electrical Engineering and Computer Science ◽

10.11591/ijeecs.v5.i2.pp462-471 ◽

2017 ◽

Vol 5 (2) ◽

pp. 462 ◽

Cited By ~ 3

Author(s):

Brinardi Leonardo ◽

Seng Hansun

Keyword(s):

Detection System ◽

String Matching ◽

Experimental Results ◽

Plagiarism Detection ◽

Text Documents ◽

Matching Algorithm ◽

Text Document ◽

Different Types ◽

The University

Plagiarism is an act that is considered by the university as a fraud by taking someone ideas or writings without mentioning the references and claimed as his own. Plagiarism detection system is generally implement string matching algorithm in a text document to search for common words between documents. There are some algorithms used for string matching, two of them are Rabin-Karp and Jaro-Winkler Distance algorithms. Rabin-Karp algorithm is one of compatible algorithms to solve the problem of multiple string patterns, while, Jaro-Winkler Distance algorithm has advantages in terms of time. A plagiarism detection application is developed and tested on different types of documents, i.e. doc, docx, pdf and txt. From the experimental results, we obtained that both of these algorithms can be used to perform plagiarism detection of those documents, but in terms of their effectiveness, Rabin-Karp algorithm is much more effective and faster in the process of detecting the document with the size more than 1000 KB.

Download Full-text

A plagiarism detection system

ACM SIGCSE Bulletin ◽

10.1145/953049.800955 ◽

1981 ◽

Vol 13 (1) ◽

pp. 21-25 ◽

Cited By ~ 20

Author(s):

John L. Donaldson ◽

Ann-Marie Lancaster ◽

Paula H. Sposato

Keyword(s):

Detection System ◽

Plagiarism Detection

Download Full-text

EPlag: A two layer source code plagiarism detection system

Eighth International Conference on Digital Information Management (ICDIM 2013) ◽

10.1109/icdim.2013.6693984 ◽

2013 ◽

Cited By ~ 2

Author(s):

Omer Ajmal ◽

M. M. Saad Missen ◽

Tazeen Hashmat ◽

M. Moosa ◽

Tenvir Ali

Keyword(s):

Detection System ◽

Source Code ◽

Plagiarism Detection

Download Full-text

Online Multilingual Plagiarism Detection System Using Multi Search Engines

Journal of Southwest Jiaotong University ◽

10.35741/issn.0258-2724.54.6.30 ◽

2019 ◽

Vol 54 (6) ◽

Author(s):

Maytham Alabbas ◽

Raidah S. Khudeyer ◽

Mustafa Radif ◽

Hassan Khalid Hameed

Keyword(s):

Search Engines ◽

Web Application ◽

Detection System ◽

Current System ◽

Special Focus ◽

Plagiarism Detection ◽

Ethical Misconduct ◽

The One ◽

Multilingual Text ◽

Innovative Techniques

Using someone else's work or ideas without attribution is plagiarism, whether you meant to do it or not. Unintended plagiarism of snippet of text can have serious consequences and be a serious form of ethical misconduct. The current system is a web application that enables you to check a multilingual text, with special focus on Arabic, for duplicate contents on the World Wide Web. In this system, you can simply input or paste your text through the online system and for each sentence in the text it will go through three popular search engines: Google, Bing, and Yandex SERP and try to find the top three results on the first page for each search engine where duplicate contents already exist. This system is getting data from the three-search engines custom search APIs. Then, the system uses a text similarity technique between the suspicious sentence and the retrieved text snippet for all nine results. The result is the one that gives the highest similarity rate. The results were encouraging and will open doors for new and innovative techniques for researchers in this field.

Download Full-text