scholarly journals Empirical Evaluation of the Difficulty of Finding a Good Value of k for the Nearest Neighbor

Author(s):  
Francisco J. Ferrer-Troyano ◽  
Jesús S. Aguilar-Ruiz ◽  
José C. Riquelme
1996 ◽  
Vol 13 (1) ◽  
pp. 15-30 ◽  
Author(s):  
J. E. Cook

AbstractMosaics of neurons are usually quantified by methods based on nearest-neighbor distance (NND). The commonest indicator of regularity has been the ratio of the mean NND to the standard deviation, here termed the ‘conformity ratio.’ However, an accurate baseline value of this ratio for bounded random samples has never been determined; nor was its sampling distribution known, making it impossible to test its significance. Instead, significance was assessed from goodness-of-fit to a Rayleigh distribution, or from another ratio, that of the observed mean NND to an expected mean predicted by theory, termed the dispersion index. Neither approach allows for boundary effects that are common in experimental mosaics. Equally common are ‘missing’ neurons, whose effects on the statistics have not been studied. To address these deficiencies, random patterns and real neuronal mosaics were analyzed statistically. Ns independent random-point samples of size Np were generated for 13 Np values between 25 and 6400, where Ns × Np ≥ 144,000. Samples were generated with rectangular boundaries of aspect ratio 1:1, 1:5, and 1:10 to examine the influence of sample geometry. NND distributions, conformity ratios, and dispersion indices were computed for the resulting 45,997 independent random patterns. From these, empirical sampling distributions and critical values were determined. NND distributions for small-to-medium, bounded, random populations were shown to differ significantly from Rayleigh distributions. Thus, goodness-of-fit tests are invalid for most experimental mosaics. Charts are presented from which the significance of conformity ratios or dispersion indices can be read directly. The conformity ratio reacts conservatively to extremes of sample geometry, and provides a useful and safe test. The dispersion index is nonconservative, making its use problematic. Tests based on the theoretical distribution of the dispersion index are unreliable for all but the largest samples. Random deletions were also made from 33 real retinal ganglion cell mosaics. The mean NND, conformity ratio, and dispersion index were determined for each original mosaic and 36 independent samples at each of nine sampling levels, retaining between 90% and 10% of the original population. An exclusion radius, based on a spatial autocorrelogram, was also calculated for each of these 10,725 mosaic samples. The mean NND was moderately insensitive to undersampling, rising smoothly. The exclusion radius was remarkably insensitive. The conformity ratio and dispersion index fell steeply, sometimes failing to reach significance while half of the cells still remained. For the same 33 original mosaics, linear regression showed the exclusion radius to be 62 ± 3% of the mean NND.


2020 ◽  
Vol 20 (02) ◽  
pp. 2050015
Author(s):  
A. Sinduja ◽  
A. Suruliandi ◽  
S. P. Raja

The liver cancer is one of the most common fatal diseases worldwide, and its early detection through medical imaging is a major contributor to the reduction in mortality from certain cancer. This paves the way to work on diagnosing liver diseases effectively. An accurate diagnosis of liver disease in CT image requires an efficient description of textures and classification methods. This paper performs comparative analysis on proposed texture feature descriptor with the different existing texture features with various classifiers to classify six types of diffused and focal liver diseases. The classification of liver diseases is done in two stages. In first stage, features like segmentation based fractal texture analysis, counting label occurrence matrix, local configuration pattern, eXtended center-symmetric local binary pattern and the proposed local symmetric tetra pattern are used for extracting information from the CT liver structure and classifiers like support vector machine, [Formula: see text]-nearest neighbor, and naive Bayes are used for classifying the pathologic liver. When pathologic conditions are detected, the best feature descriptors and classifiers are used to classify the results into any of six exclusive pathologic liver diseases, in second stage. The experiments are carried out in medically validated liver datasets containing normal and six-disease category of liver. The first experiment is analyzed using sensitivity, specificity, and accuracy. The second experiment is evaluated using precision, recall, BCR, and F-measure. The results demonstrate that the local symmetric tetra pattern with [Formula: see text]-nearest neighbor classifier culminates in a state-of-the-art performance for diagnosing liver diseases.


Author(s):  
J. M. Oblak ◽  
W. H. Rand

The energy of an a/2 <110> shear antiphase. boundary in the Ll2 expected to be at a minimum on {100} cube planes because here strue ture is there is no violation of nearest-neighbor order. The latter however does involve the disruption of second nearest neighbors. It has been suggested that cross slip of paired a/2 <110> dislocations from octahedral onto cube planes is an important dislocation trapping mechanism in Ni3Al; furthermore, slip traces consistent with cube slip are observed above 920°K.Due to the high energy of the {111} antiphase boundary (> 200 mJ/m2), paired a/2 <110> dislocations are tightly constricted on the octahedral plane and cannot be individually resolved.


Author(s):  
S. R. Herd ◽  
P. Chaudhari

Electron diffraction and direct transmission have been used extensively to study the local atomic arrangement in amorphous solids and in particular Ge. Nearest neighbor distances had been calculated from E.D. profiles and the results have been interpreted in terms of the microcrystalline or the random network models. Direct transmission electron microscopy appears the most direct and accurate method to resolve this issue since the spacial resolution of the better instruments are of the order of 3Å. In particular the tilted beam interference method is used regularly to show fringes corresponding to 1.5 to 3Å lattice planes in crystals as resolution tests.


Sign in / Sign up

Export Citation Format

Share Document