An accurate estimation of the Levenshtein distance using metric trees and Manhattan distance

Author(s):  
Thierry Lavoie ◽  
Ettore Merlo
Author(s):  
W. R. Schucany ◽  
G. H. Kelsoe ◽  
V. F. Allison

Accurate estimation of the size of spheroid organelles from thin sectioned material is often necessary, as uniquely homogenous populations of organelles such as vessicles, granules, or nuclei often are critically important in the morphological identification of similar cell types. However, the difficulty in obtaining accurate diameter measurements of thin sectioned organelles is well known. This difficulty is due to the extreme tenuity of the sectioned material as compared to the size of the intact organelle. In populations where low variance is suspected the traditional method of diameter estimation has been to measure literally hundreds of profiles and to describe the “largest” as representative of the “approximate maximal diameter”.


Author(s):  
Virginie Crollen ◽  
Julie Castronovo ◽  
Xavier Seron

Over the last 30 years, numerical estimation has been largely studied. Recently, Castronovo and Seron (2007) proposed the bi-directional mapping hypothesis in order to account for the finding that dependent on the type of estimation task (perception vs. production of numerosities), reverse patterns of performance are found (i.e., under- and over-estimation, respectively). Here, we further investigated this hypothesis by submitting adult participants to three types of numerical estimation task: (1) a perception task, in which participants had to estimate the numerosity of a non-symbolic collection; (2) a production task, in which participants had to approximately produce the numerosity of a symbolic numerical input; and (3) a reproduction task, in which participants had to reproduce the numerosity of a non-symbolic numerical input. Our results gave further support to the finding that different patterns of performance are found according to the type of estimation task: (1) under-estimation in the perception task; (2) over-estimation in the production task; and (3) accurate estimation in the reproduction task. Moreover, correlation analyses revealed that the more a participant under-estimated in the perception task, the more he/she over-estimated in the production task. We discussed these empirical data by showing how they can be accounted by the bi-directional mapping hypothesis ( Castronovo & Seron, 2007 ).


1969 ◽  
Vol 62 (4_Suppla) ◽  
pp. S23-S35
Author(s):  
B.-A. Lamberg ◽  
O. P. Heinonen ◽  
K. Liewendahl ◽  
G. Kvist ◽  
M. Viherkoski ◽  
...  

ABSTRACT The distributions of 13 variables based on 10 laboratory tests measuring thyroid function were studied in euthyroid controls and in patients with toxic diffuse or toxic multinodular goitre. Density functions were fitted to the empirical data and the goodness of fit was evaluated by the use of the χ2-test. In a few instances there was a significant difference but the material available was in some respects too small to allow a very accurate estimation. The normal limits for each variable was defined by the 2.5 and 97.5 percentiles. It appears that in some instances these limits are too rigorous from the practical point of view. It is emphasized that the crossing point of the functions for euthyroid controls and hyperthyroid patients may be a better limit to use. In a preliminary analysis of the diagnostic efficiency the variables of total or free hormone concentration in the blood proved clearily superior to all other variables.


2017 ◽  
Vol 3 (2) ◽  
pp. 1-6
Author(s):  
Ferly Gunawan ◽  
M. Ali Fauzi ◽  
Putra Pandu Adikara

Perkembangan aplikasi mobile yang pesat membuat banyak aplikasi diciptakan dengan berbagai kegunaan untuk memenuhi kebutuhan pengguna. Setiap aplikasi memungkinkan pengguna untuk memberi ulasan tentang aplikasi tersebut. Tujuan dari ulasan adalah untuk mengevaluasi dan meningkatkan kualitas produk ke depannya. Untuk mengetahui hal tersebut, analisis sentimen dapat digunakan untuk mengklasifikasikan ulasan ke dalam sentimen positif atau negatif. Pada ulasan aplikasi biasanya terdapat salah eja sehingga sulit dipahami. Kata yang mengalami salah eja perlu dilakukan normalisasi kata untuk diubah menjadi kata standar. Karena itu, normalisasi kata dibutuhkan untuk menyelesaikan masalah salah eja. Penelitian ini menggunakan normalisasi kata berbasis Levenshtein distance. Berdasarkan pengujian, nilai akurasi tertinggi terdapat pada perbandingan data latih 70% dan data uji 30%. Hasil akurasi tertinggi dari pengujian menggunakan nilai edit <=2 adalah 100%, nilai edit tertinggi kedua didapat pada nilai edit <=1 dengan akurasi 96,4%, sedangkan nilai edit dengan akurasi terendah diperoleh pada nilai edit <=4 dan <=5 dengan akurasi 66,6%. Hasil dari pengujian Naive Bayes-Levenshtein Distance memiliki nilai akurasi tertinggi yaitu 96,9% dibandingkan dengan pengujian Naive Bayes tanpa Levenshtein Distance dengan nilai akurasi 94,4%.  


2015 ◽  
Vol 63 (1) ◽  

The aim of this study was to investigate differences in course times of a mountainmarathon (Napfmarathon) versus a city Marathon. Therefore all participants of Napfmarathon were screened concerning a double participation on a city marathon (Zürich, Winterthur, Lausanne, Luzern) and the course time were compared. Of key interest was the influence of ascents and descents which were quantified according to ­guidelines of Youth & Sport (Jugend + Sport / Jeunesse et Sport), whereby in first approximation 100 meter of ascent, 150 meter of descent (more than 20%) and 1 km of horizontal distance were taken as a simallar performance correlat. For the identified double starter different average times per km resulted. For the city marathon with an average time of 4 min 52 sec and for the Napfmarathon with 4 min 28 sec. If speed per km was calculated only with ascent and horizontal distances having performance relevance an average time of 4 min 56 sec per km was identified. This effect seems to be independet from distance absolved, resulting for Halbmarathon on an average time of distance of 4 min 13 sec, for Napfmarathon of 4 min 4 sec and for the performance concept only with ascent an average time per km of 4 min 16 sec. These analysis reveal, that if only ascent is taxed average course times differ less than 5 sec for both distances. For these particular reasons we recommend for running events to calculate only based on ascent and horizontal distances making necessary adjustments based on length of course, steepness of ascent and descent, character of terain (middle-country, pre-alps, alpes) for accurate estimation of course times.


Author(s):  
Sangita Solanki ◽  
Raksha Upadhyay ◽  
Uma Rathore Bhatt

Cloud-integrated wireless optical broadband (CIW) access networks inheriting advantages of cloud computing, wireless and optical access networks have a broad prospect in the future. Due to failure of components like OLT level, ONU level, link or path failure and cloud component level in CIW, survivability is becoming one of the important issues. In this paper, we have presented cloud-integrated wireless-optical broadband access network with survivability using integer linear programming (ILP) model, to minimize the number of cloud components while providing maximum backup paths. Hence, we have proposed protection through cloud-integrated wireless router to available ONUs (PCIWRAO). So, evaluated the backup path computation. We have considered ONU level failure in which the affected traffic is transferred through wireless routers and cloud component to the available ONUs using Manhattan distance algorithm. Simulation results show different configurations for different number of routers and cloud components illustrating available backup path when ONU fails.


Author(s):  
Seema Rani ◽  
Avadhesh Kumar ◽  
Naresh Kumar

Background: Duplicate content often corrupts the filtering mechanism in online question answering. Moreover, as users are usually more comfortable conversing in their native language questions, transliteration adds to the challenges in detecting duplicate questions. This compromises with the response time and increases the answer overload. Thus, it has now become crucial to build clever, intelligent and semantic filters which semantically match linguistically disparate questions. Objective: Most of the research on duplicate question detection has been done on mono-lingual, majorly English Q&A platforms. The aim is to build a model which extends the cognitive capabilities of machines to interpret, comprehend and learn features for semantic matching in transliterated bi-lingual Hinglish (Hindi + English) data acquired from different Q&A platforms. Method: In the proposed DQDHinglish (Duplicate Question Detection) Model, firstly language transformation (transliteration & translation) is done to convert the bi-lingual transliterated question into a mono-lingual English only text. Next a hybrid of Siamese neural network containing two identical Long-term-Short-memory (LSTM) models and Multi-layer perceptron network is proposed to detect semantically similar question pairs. Manhattan distance function is used as the similarity measure. Result: A dataset was prepared by scrapping 100 question pairs from various social media platforms, such as Quora and TripAdvisor. The performance of the proposed model on the basis of accuracy and F-score. The proposed DQDHinglish achieves a validation accuracy of 82.40%. Conclusion: A deep neural model was introduced to find semantic match between English question and a Hinglish (Hindi + English) question such that similar intent questions can be combined to enable fast and efficient information processing and delivery. A dataset was created and the proposed model was evaluated on the basis of performance accuracy. To the best of our knowledge, this work is the first reported study on transliterated Hinglish semantic question matching.


2021 ◽  
Vol 11 (1) ◽  
pp. 126
Author(s):  
Enrique Noé ◽  
Joan Ferri ◽  
José Olaya ◽  
María Dolores Navarro ◽  
Myrtha O’Valle ◽  
...  

Accurate estimation of the neurobehavioral progress of patients with unresponsive wakefulness syndrome (UWS) is essential to anticipate their most likely clinical course and guide clinical decision making. Although different studies have described this progress and possible predictors of neurobehavioral improvement in these patients, they have methodological limitations that could restrict the validity and generalization of the results. This study investigates the neurobehavioral progress of 100 patients with UWS consecutively admitted to a neurorehabilitation center using systematic weekly assessments based on standardized measures, and the prognostic factors of changes in their neurobehavioral condition. Our results showed that, during the analyzed period, 34% of the patients were able to progress from UWS to minimally conscious state (MCS), 12% of the total sample (near one third from those who progressed to MCS) were able to emerge from MCS, and 10% of the patients died. Transition to MCS was mostly denoted by visual signs, which appeared either alone or in combination with motor signs, and was predicted by etiology and the score on the Coma Recovery Scale-Revised at admission with an accuracy of 75%. Emergence from MCS was denoted in the same proportion by functional communication and object use. Predictive models of emergence from MCS and mortality were not valid and the identified predictors could not be accounted for.


Sign in / Sign up

Export Citation Format

Share Document