scholarly journals A Higher-Order N-gram Model to Enhance Automatic Word Prediction for Assamese Sentences Containing Ambiguous Words

Word prediction is a technique which tries to suggest the users’ words after knowing the few input letters of the user. This predictive model also tries to generate the future words or next words of a sentence by observing earlier words of the sentence. In this research, two problems are combined, one is word prediction and the next is handling of ambiguous words. A word prediction model predicts the future words of a sentence by using n-gram based model. In general, predictive models use unigram, bigram or trigram models to predict the next words. In case of sentences consisting of ambiguous words, the predictive model by using only bigram or trigram cannot perform well to predict the next words. To enhance this prediction for ambiguous words, maximum of six previous input words are observed and try to predict almost the exact words after the ambiguous words in those particular contexts. Different level of experiments are done and the results are compared for modified or enhanced prediction model with the traditional prediction model, improvement on accuracy and failure rate are found in the enhanced model. The accuracy of the Traditional Model is 60.68% on the hand the accuracy of the Enhanced Model is 66.88%. The failure rate of the Traditional Model is 32.35% and the Enhanced Model is 29.17%

2019 ◽  
Vol 8 (1) ◽  
Author(s):  
Komang Agus Rudi Indra Laksmana ◽  
Ayu Darmawati

This study aimed at analyzing how the results of the Grover, Springate and Zmijewski models predict the bankruptcy of PT Citra Maharlika Nusantara Corpora Tbk for the period of June 2013 - September 2016. This study also aimed at measuring the accuracy of the bankruptcy prediction model and determined which predictive model of the three models was the most accurate. From the data analysis, it was found that Springate model was the most accurate prediction model with 100% accuracy rate to predict the bankruptcy of PT Citra Maharlika Nusantara Corpora Tbk compared to the Grover model with an accuracy rate of 71.48% and Zmijewski model with the lowest accuracy rate of 21.48%. The limitations of this study was this study only carried out in one company, thus in the future it is expected that the model will be tested in more than one company and type of business sector.Keywords: Financial Distress, Grover, Springate, Zmijewski ModelsPenelitian ini bertujuan untuk menganalisis bagaimana hasil dari model Grover, Springate dan Zmijewski dalam memprediksi kebangkrutan PT Citra Maharlika Nusantara Corpora Tbk periode Juni 2013 – September 2016 serta mengukur tingkat akurasi model prediksi kebangkrutan tersebut dan menentukan model prediksi manakah diantara ketiga model tersebut yang paling akurat. Model Springate menjadi model prediksi paling akurat dengan tingkat akurasi 100% untuk memprediksi kebangkrutan PT Citra Maharlika Nusantara Corpora Tbk dibandingakan dengan model Grover dengan tingkat akurasi 71,48% dan model Zmijewski dengan tingkat akurasi paling rendah sebesar 21,48%.Keterbatasan penelitian ini terletak pada pengujian model pada satu perusahaan di satu unit sektor usaha, kedepan bisa dilakukan pengujian pada berbagai jenis sektor usaha.Kata kunci: Financial Distress, Model Grover, Springate, Zmijewski


The system of route correction of an unmanned aerial vehicle (UAV) is considered. For the route correction the on-board radar complex is used. In conditions of active interference, it is impossible to use radar images for the route correction so it is proposed to use the on-board navigation system with algorithmic correction. An error compensation scheme of the navigation system in the output signal using the algorithm for constructing a predictive model of the system errors is applied. The predictive model is building using the genetic algorithm and the method of group accounting of arguments. The quality comparison of the algorithms for constructing predictive models is carried out using mathematical modeling.


2021 ◽  
Vol 205 ◽  
pp. 108787
Author(s):  
Lei Chen ◽  
Ziyun Yuan ◽  
JianXin Xu ◽  
Jingyang Gao ◽  
Yuhan Zhang ◽  
...  

2015 ◽  
Vol 789-790 ◽  
pp. 263-267
Author(s):  
Yan Lei Li ◽  
Ming Yan Wang ◽  
You Min Hu ◽  
Bo Wu

This paper proposes a new method to predict the spindle deformation based on temperature data. The method introduces ANFIS (adaptive neuro-fuzzy inference system). For building the predictive model, we first extract temperature data from sensors in the spindle, and then they are used as the inputs to train ANFIS. To evaluate the performance of the prediction, an experiment is implemented. Three Pt-100 thermal resistances is used to monitor the spindle temperature, and an inductive current sensor is used to obtain the spindle deformation. The experimental results display that our prediction model can better predict the spindle deformation and improve the performance of the spindle.


2021 ◽  
Vol 11 (1) ◽  
Author(s):  
Na Zhao ◽  
Jian Wang ◽  
Yong Yu ◽  
Jun-Yan Zhao ◽  
Duan-Bing Chen

AbstractMany state-of-the-art researches focus on predicting infection scale or threshold in infectious diseases or rumor and give the vaccination strategies correspondingly. In these works, most of them assume that the infection probability and initially infected individuals are known at the very beginning. Generally, infectious diseases or rumor has been spreading for some time when it is noticed. How to predict which individuals will be infected in the future only by knowing the current snapshot becomes a key issue in infectious diseases or rumor control. In this report, a prediction model based on snapshot is presented to predict the potentially infected individuals in the future, not just the macro scale of infection. Experimental results on synthetic and real networks demonstrate that the infected individuals predicted by the model have good consistency with the actual infected ones based on simulations.


2021 ◽  
Vol 36 (Supplement_1) ◽  
Author(s):  
J A Ortiz ◽  
R Morales ◽  
B Lledo ◽  
E Garcia-Hernandez ◽  
A Cascales ◽  
...  

Abstract Study question Is it possible to predict the likelihood of an IVF embryo being aneuploid and/or mosaic using a machine learning algorithm? Summary answer There are paternal, maternal, embryonic and IVF-cycle factors that are associated with embryonic chromosomal status that can be used as predictors in machine learning models. What is known already The factors associated with embryonic aneuploidy have been extensively studied. Mostly maternal age and to a lesser extent male factor and ovarian stimulation have been related to the occurrence of chromosomal alterations in the embryo. On the other hand, the main factors that may increase the incidence of embryo mosaicism have not yet been established. The models obtained using classical statistical methods to predict embryonic aneuploidy and mosaicism are not of high reliability. As an alternative to traditional methods, different machine and deep learning algorithms are being used to generate predictive models in different areas of medicine, including human reproduction. Study design, size, duration The study design is observational and retrospective. A total of 4654 embryos from 1558 PGT-A cycles were included (January-2017 to December-2020). The trophoectoderm biopsies on D5, D6 or D7 blastocysts were analysed by NGS. Embryos with ≤25% aneuploid cells were considered euploid, between 25-50% were classified as mosaic and aneuploid with >50%. The variables of the PGT-A were recorded in a database from which predictive models of embryonic aneuploidy and mosaicism were developed. Participants/materials, setting, methods The main indications for PGT-A were advanced maternal age, abnormal sperm FISH and recurrent miscarriage or implantation failure. Embryo analysis were performed using Veriseq-NGS (Illumina). The software used to carry out all the analysis was R (RStudio). The library used to implement the different algorithms was caret. In the machine learning models, 22 predictor variables were introduced, which can be classified into 4 categories: maternal, paternal, embryonic and those specific to the IVF cycle. Main results and the role of chance The different couple, embryo and stimulation cycle variables were recorded in a database (22 predictor variables). Two different predictive models were performed, one for aneuploidy and the other for mosaicism. The predictor variable was of multi-class type since it included the segmental and whole chromosome alteration categories. The dataframe were first preprocessed and the different classes to be predicted were balanced. A 80% of the data were used for training the model and 20% were reserved for further testing. The classification algorithms applied include multinomial regression, neural networks, support vector machines, neighborhood-based methods, classification trees, gradient boosting, ensemble methods, Bayesian and discriminant analysis-based methods. The algorithms were optimized by minimizing the Log_Loss that measures accuracy but penalizing misclassifications. The best predictive models were achieved with the XG-Boost and random forest algorithms. The AUC of the predictive model for aneuploidy was 80.8% (Log_Loss 1.028) and for mosaicism 84.1% (Log_Loss: 0.929). The best predictor variables of the models were maternal age, embryo quality, day of biopsy and whether or not the couple had a history of pregnancies with chromosomopathies. The male factor only played a relevant role in the mosaicism model but not in the aneuploidy model. Limitations, reasons for caution Although the predictive models obtained can be very useful to know the probabilities of achieving euploid embryos in an IVF cycle, increasing the sample size and including additional variables could improve the models and thus increase their predictive capacity. Wider implications of the findings Machine learning can be a very useful tool in reproductive medicine since it can allow the determination of factors associated with embryonic aneuploidies and mosaicism in order to establish a predictive model for both. To identify couples at risk of embryo aneuploidy/mosaicism could benefit them of the use of PGT-A. Trial registration number Not Applicable


PEDIATRICS ◽  
1983 ◽  
Vol 71 (3) ◽  
pp. 465-465
Author(s):  
C. E. HEALY ◽  
WELBORN CLINIC

To the Editor.— I am concerned about measles immunization in that most studies show up to a 5% failure rate with the current immunization practice, at least as measured by the hemagglutination-inhibition (HI) technique. Some of these children might be protected by neutralizing antibodies or cellular (lymphocytic)-mediated immunity, but the extent and duration of this immunity, if any, is unknown. HI immunity is apparently permanent. The end result of the current immunization practice will be a large pool of hundreds of thousands and eventually, millions of adults, who will be susceptible to measles.


2018 ◽  
Vol 204 ◽  
pp. 02018
Author(s):  
Aisyah Larasati ◽  
Anik Dwiastutik ◽  
Darin Ramadhanti ◽  
Aal Mahardika

This study aims to explore the effect of kurtosis level of the data in the output layer on the accuracy of artificial neural network predictive models. The artificial neural network predictive models are comprised of one node in the output layer and six nodes in the input layer. The number of hidden layer is automatically built by the program. Data are generated using simulation approach. The results show that the kurtosis level of the node in the output layer is significantly affect the accuracy of the artificial neural network predictive model. Platycurtic and leptocurtic data has significantly higher misclassification rates than mesocurtic data. However, the misclassification rates between platycurtic and leptocurtic is not significantly different. Thus, data distribution with kurtosis nearly to zero results in a better ANN predictive model.


Sign in / Sign up

Export Citation Format

Share Document