scholarly journals KEBI 1.0: Indonesian Spelling Error Detection System for Scientific Papers using Dictionary Lookup and Peter Norvig Spelling Corrector

2021 ◽  
Vol 12 (2) ◽  
pp. 78
Author(s):  
Tresna Maulana Fahrudin ◽  
Ilmatus Sa’diyah ◽  
Latipah Latipah ◽  
Ibnu Zahy’ Atha Illah ◽  
Cagiva Chaedar Bey Lirna ◽  
...  

Many Indonesian spelling errors occur in research papers published to the public, closely related to academics in all institutions such as research institutions, government, schools, and universities. The spelling errors usually writing punctuation, writing letters, writing words, writing words originating from foreign or regional languages (uptake words), using affixed words, and writing ineffective sentences. The mistakes made by the academics then become a cycle in the academic environment. They usually provide guidance for writing an undergraduate thesis, thesis, dissertations to students, or the other forms of documents and scientific papers. Therefore, the research proposed the application to facilitate all authors of scientific papers in producing quality scientific works based on the General Guidelines for Indonesian Spelling published by the Agency for Development and Language Development. The application is named KEBI 1.0 Checker (Indonesian Spelling Error 1.0 Checker), a web-based application with a built-in algorithm to detect and correct Indonesian Spelling in scientific papers. The experiment result shows that the application has given the best accuracy performance to correct the non-standard words, and typographical errors reached 100% and 55,52%, respectively. The application also has been detected 209 meaningless words. The application processing time is relatively low, the average time needed to correct non-standard words is 0.016 seconds, and typo words are 14.58 seconds. KEBI 1.0 Checker is helpful for the end-user in academics but needs to improve the vocabulary of the large corpus in various fields of science for correcting typo words.  

Author(s):  
Tresna Maulana Fahrudin ◽  
Ilmatus Sa'diyah ◽  
Latipah ◽  
Ibnu Zahy Atha Illah ◽  
Cagiva Chaedar Beylirna ◽  
...  

At educational institutions, especially at University, writing scientific papers is a skill that must be possessed by academics such as educators and students. However, writing scientific papers is not easy, there are many provisions and rules that need to be fulfilled. Several studies show that there are still many academics who make mistakes in writing their scientific papers. Some of the mistakes made include punctuation errors, typographic writing errors and the use of non-standard words in Indonesian. Researchers in Indonesia have developed various spelling error detection applications in Indonesian-language scientific papers. This study tries to analyze the development of an application framework for detecting Indonesian spelling errors from various assessment indicators. This study tries to compare the application framework for detecting spelling errors between other studies with proposed application that named KEBI 1.0 Checker. KEBI 1.0 Checker as a spelling error detection application has 3 main features, namely detecting errors in the use of punctuation marks, writing typography, and using non-standard words in accordance with the standards of the Big Indonesian Dictionary and the General Guidelines for Indonesian Spelling. In addition, this study tries to objectively examine the complexity of the features, advantages and disadvantages, methods and the level of accuracy of each application. The results of the analysis show that KEBI 1.0 Checker has the completeness of features, fast computation time, easy application access, and an attractive user interface. However, it is still necessary to improve the precision in correcting spelling errors in typographic words.


CoDAS ◽  
2015 ◽  
Vol 27 (2) ◽  
pp. 113-118 ◽  
Author(s):  
Nathane Sanches Marques Silva ◽  
Patrícia Abreu Pinheiro Crenitte

Objective: To compare the spelling ability of schoolchildren from the fourth to sixth grades of the elementary schools in the private and public schools of Bauru, São Paulo, and to verify whether errors are overcome as studies progress and the hierarchy of errors as to how often they occur. Methods : A dictation was applied to 384 schoolchildren: 206 from the private schools: 74 were at the fourth grade, 65 at the fifth grade, and 67 at the sixth grade; and 178 from the public schools; 56 at the fourth grade, 63 at the fifth grade, and 59 at the sixth grade of elementary school. Student's t test was used. Results: In comparison of total spelling errors score, difference was found among the fourth and sixth grades of the private and public schools. Spelling errors decreased as education progressed, and those related to language irregularities were more common. Conclusion: Spelling ability and performance of students from the private and public schools are not similar in the fourth and sixth grades, but it is in the fifth grade. Spelling errors are gradually overcome as education progresses; however, this overcome rate was considerable between the fourth and fifth grades in the public schools. Decrease in the types of spelling errors follows a hierarchy of categories: phoneme/grapheme conversion, simple contextual rules, complex contextual rules, and language irregularities. Finally, the most common type of spelling error found was that related to language irregularities.


2020 ◽  
Author(s):  
Radana Fauziana Rahmah

Language is a media of communication. The aim of this research ia to know about (1) Indonesian spelling errors in Arkhais and LPP Mandala's journal, (2) Indonesian diction errors in Arkhais and LPP Mandala's journal, and (3) Indonesian structure errors in Arkhais and LPP Mandala's journal. The subject of this research is the essay Made by Arkhais and LPP Mandala's journal. The object of this research is the errors in the use of Indonesian language in Arkhais and LPP Mandala's journal included some aspects such as spelling, diction, and structure. The conclution on this study are as follows. Firstly, the linguistic elements of language errors that often occur in the student text ia divided into four errors : spelling error and morphology. Secondly, the most error in the text of foreign student s is the spelling errors.


2019 ◽  
Vol 1 (1) ◽  
pp. 33-45
Author(s):  
Rosdiana Rosdiana ◽  
Padeli Padeli ◽  
Revi Sajidah Sri Handayani ◽  
Rifky Alfian

The public service administration system at the government offices of the Kemiri village office in the Kemiri District currently does not have a computerized and integrated system. Because the system runs, residents who submit letters for administrative completeness, still use the Ms.Word / Ms.Excel application. of course has many weaknesses including human error, not neat in file storage, resulting in the lengthy process of searching and making a cover letter and required reports. The analytical method used in this study is to use PIECES (Performance, Informance, Economy, Control, Efficiency, Service) analysis, the design of the model uses UML (Unified Modeling Language). The results of this study are web-based letter information systems at the Office of the Village Chief of Kemiri that can accessed using a local computer browser. Thus the information letter needed by the Kemiri Village community and more effective and efficient in making the letter.


Epidemiologia ◽  
2021 ◽  
Vol 2 (1) ◽  
pp. 84-94
Author(s):  
Mst. Marium Begum ◽  
Osman Ulvi ◽  
Ajlina Karamehic-Muratovic ◽  
Mallory R. Walsh ◽  
Hasan Tarek ◽  
...  

Background: Chikungunya is a vector-borne disease, mostly present in tropical and subtropical regions. The virus is spread by Ae. aegypti and Ae. albopictus mosquitos and symptoms include high fever to severe joint pain. Dhaka, Bangladesh, suffered an outbreak of chikungunya in 2017 lasting from April to September. With the goal of reducing cases, social media was at the forefront during this outbreak and educated the public about symptoms, prevention, and control of the virus. Popular web-based sources such as the top dailies in Bangladesh, local news outlets, and Facebook spread awareness of the outbreak. Objective: This study sought to investigate the role of social and mainstream media during the chikungunya epidemic. The study objective was to determine if social media can improve awareness of and practice associated with reducing cases of chikungunya. Methods: We collected chikungunya-related information circulated from the top nine television channels in Dhaka, Bangladesh, airing from 1st April–20th August 2017. All the news published in the top six dailies in Bangladesh were also compiled. The 50 most viewed chikungunya-related Bengali videos were manually coded and analyzed. Other social media outlets, such as Facebook, were also analyzed to determine the number of chikungunya-related posts and responses to these posts. Results: Our study showed that media outlets were associated with reducing cases of chikungunya, indicating that media has the potential to impact future outbreaks of these alpha viruses. Each media outlet (e.g., web, television) had an impact on the human response to an individual’s healthcare during this outbreak. Conclusions: To prevent future outbreaks of chikungunya, media outlets and social media can be used to educate the public regarding prevention strategies such as encouraging safe travel, removing stagnant water sources, and assisting with tracking cases globally to determine where future outbreaks may occur.


Robotics ◽  
2021 ◽  
Vol 10 (2) ◽  
pp. 53
Author(s):  
Luiz F. P. Oliveira ◽  
António P. Moreira ◽  
Manuel F. Silva

The development of robotic systems to operate in forest environments is of great relevance for the public and private sectors. In this sense, this article reviews several scientific papers, research projects and commercial products related to robotic applications for environmental preservation, monitoring, wildfire firefighting, inventory operations, planting, pruning and harvesting. After conducting critical analysis, the main characteristics observed were: (a) the locomotion system is directly affected by the type of environmental monitoring to be performed; (b) different reasons for pruning result in different locomotion and cutting systems; (c) each type of forest, in each season and each type of soil can directly interfere with the navigation technique used; and (d) the integration of the concept of swarm of robots with robots of different types of locomotion systems (land, air or sea) can compensate for the time of executing tasks in unstructured environments. Two major areas are proposed for future research works: Internet of Things (IoT)-based smart forest and navigation systems. It is expected that, with the various characteristics exposed in this paper, the current robotic forest systems will be improved, so that forest exploitation becomes more efficient and sustainable.


2021 ◽  
pp. 105566562199530
Author(s):  
İlkem Kara ◽  
Aydan Baştuğ Dumbak ◽  
Maviş Emel Kulak Kayıkcı

Introduction: Factors such as teachers’ appropriate support and social interactions have an impact on the academic performance of children with cleft lip and/or palate (CL/P). This study was designed to investigate the perceptions of the teachers and the general public about the academic and cognitive performance of individuals with CL/P. Methods: This study was included 360 (male/female = 102/258) teachers and 640 (male/female = 259/381) participants that represent the general public. Anonymized web-based and paper-and-pencil self-administered questionnaire that included multiple-choice and yes/no questions were administered. Within-group differences and intergroup differences were analyzed in terms of academic and cognitive performance. Results: Most of the teachers and the general public indicated that the academic and cognitive performance of individuals with CL/P is the same as their unaffected peers. A significantly higher proportion of the teachers indicated that the academic performance of children with CL/P is the same as their unaffected peers than the general public. Conclusion: Considering that the general public’s attitudes and appropriate teacher support are crucial to prevent adverse impacts on the lives of individuals with CL/P, it is important to support teachers with the appropriate information and to encourage the public to recognize that everybody with a facial difference should be treated as an individual rather than a disability.


Electronics ◽  
2020 ◽  
Vol 9 (10) ◽  
pp. 1686 ◽  
Author(s):  
Nancy Agarwal ◽  
Mudasir Ahmad Wani ◽  
Patrick Bours

This work focuses on designing a grammar detection system that understands both structural and contextual information of sentences for validating whether the English sentences are grammatically correct. Most existing systems model a grammar detector by translating the sentences into sequences of either words appearing in the sentences or syntactic tags holding the grammar knowledge of the sentences. In this paper, we show that both these sequencing approaches have limitations. The former model is over specific, whereas the latter model is over generalized, which in turn affects the performance of the grammar classifier. Therefore, the paper proposes a new sequencing approach that contains both information, linguistic as well as syntactic, of a sentence. We call this sequence a Lex-Pos sequence. The main objective of the paper is to demonstrate that the proposed Lex-Pos sequence has the potential to imbibe the specific nature of the linguistic words (i.e., lexicals) and generic structural characteristics of a sentence via Part-Of-Speech (POS) tags, and so, can lead to a significant improvement in detecting grammar errors. Furthermore, the paper proposes a new vector representation technique, Word Embedding One-Hot Encoding (WEOE) to transform this Lex-Pos into mathematical values. The paper also introduces a new error induction technique to artificially generate the POS tag specific incorrect sentences for training. The classifier is trained using two corpora of incorrect sentences, one with general errors and another with POS tag specific errors. Long Short-Term Memory (LSTM) neural network architecture has been employed to build the grammar classifier. The study conducts nine experiments to validate the strength of the Lex-Pos sequences. The Lex-Pos -based models are observed as superior in two ways: (1) they give more accurate predictions; and (2) they are more stable as lesser accuracy drops have been recorded from training to testing. To further prove the potential of the proposed Lex-Pos -based model, we compare it with some well known existing studies.


Sensors ◽  
2021 ◽  
Vol 21 (5) ◽  
pp. 1820
Author(s):  
Xiaotao Shao ◽  
Qing Wang ◽  
Wei Yang ◽  
Yun Chen ◽  
Yi Xie ◽  
...  

The existing pedestrian detection algorithms cannot effectively extract features of heavily occluded targets which results in lower detection accuracy. To solve the heavy occlusion in crowds, we propose a multi-scale feature pyramid network based on ResNet (MFPN) to enhance the features of occluded targets and improve the detection accuracy. MFPN includes two modules, namely double feature pyramid network (FPN) integrated with ResNet (DFR) and repulsion loss of minimum (RLM). We propose the double FPN which improves the architecture to further enhance the semantic information and contours of occluded pedestrians, and provide a new way for feature extraction of occluded targets. The features extracted by our network can be more separated and clearer, especially those heavily occluded pedestrians. Repulsion loss is introduced to improve the loss function which can keep predicted boxes away from the ground truths of the unrelated targets. Experiments carried out on the public CrowdHuman dataset, we obtain 90.96% AP which yields the best performance, 5.16% AP gains compared to the FPN-ResNet50 baseline. Compared with the state-of-the-art works, the performance of the pedestrian detection system has been boosted with our method.


Sign in / Sign up

Export Citation Format

Share Document