Hybrid Segmentation Prototype for Arabic Text-Based Documents
Keyword(s):
The contribution of this work relates to the field of Arabic text-based document analysis for the detection of plagiarism. This analysis will be carried out according to the triadic computation model of document similarity. The authors propose a hybrid segmentation prototype for Arabic text-based documents that links different processing steps in order to generate the similarity rate between the documents of an Arabic corpus. It involves two segmentation systems and a morphological analysis in order to obtain a matrix representation adapted to the triadic similarity computation according to three abstraction levels: documents, sentences and words.
2015 ◽
Vol 6
(1)
◽
pp. 63-74
◽
Keyword(s):
Keyword(s):
2010 ◽
Vol 44-47
◽
pp. 3965-3969
Keyword(s):
2022 ◽
Vol 21
(1)
◽
pp. 1-25