An Optimal and Stable Algorithm for Clustering Numerical Data

In the conventional k-means framework, seeding is the first step toward optimization before the objects are clustered. In random seeding, two main issues arise: the clustering results may be less than optimal and different clustering results may be obtained for every run. In real-world applications, optimal and stable clustering is highly desirable. This report introduces a new clustering algorithm called the zero k-approximate modal haplotype (Zk-AMH) algorithm that uses a simple and novel seeding mechanism known as zero-point multidimensional spaces. The Zk-AMH provides cluster optimality and stability, therefore resolving the aforementioned issues. Notably, the Zk-AMH algorithm yielded identical mean scores to maximum, and minimum scores in 100 runs, producing zero standard deviation to show its stability. Additionally, when the Zk-AMH algorithm was applied to eight datasets, it achieved the highest mean scores for four datasets, produced an approximately equal score for one dataset, and yielded marginally lower scores for the other three datasets. With its optimality and stability, the Zk-AMH algorithm could be a suitable alternative for developing future clustering tools.

Download Full-text

Zero Knowledge Proofs

Algorithmic Strategies for Solving Complex Problems in Cryptography - Advances in Information Security, Privacy, and Ethics ◽

10.4018/978-1-5225-2915-6.ch009 ◽

2018 ◽

pp. 111-123 ◽

Cited By ~ 1

Author(s):

Kannan Balasubramanian ◽

Mala K.

Keyword(s):

Real World ◽

The Other ◽

Interactive Proof ◽

Zero Knowledge ◽

Practical Applications ◽

Proof Of Knowledge ◽

Real World Applications ◽

Special Case ◽

Zero Knowledge Proof

Zero knowledge protocols provide a way of proving that a statement is true without revealing anything other than the correctness of the claim. Zero knowledge protocols have practical applications in cryptography and are used in many applications. While some applications only exist on a specification level, a direction of research has produced real-world applications. Zero knowledge protocols, also referred to as zero knowledge proofs, are a type of protocol in which one party, called the prover, tries to convince the other party, called the verifier, that a given statement is true. Sometimes the statement is that the prover possesses a particular piece of information. This is a special case of zero knowledge protocol called a zero-knowledge proof of knowledge. Formally, a zero-knowledge proof is a type of interactive proof.

Download Full-text

The Effect of Proposal Appearance on the Technical Evaluation Scoring of Government Proposals

Journal of Technical Writing and Communication ◽

10.2190/fgvj-ykk8-84x2-f8y7 ◽

1977 ◽

Vol 7 (4) ◽

pp. 285-293 ◽

Cited By ~ 1

Author(s):

Robert D. Dycus

Keyword(s):

Factorial Design ◽

Real World ◽

General Model ◽

The Other ◽

Design Experiment ◽

Transparent Plastic ◽

Real World Applications ◽

Color Printing ◽

Technical Evaluation ◽

Factorial Design Experiment

The effect of proposal appearance on technical evaluation scoring was examined experimentally. Two mock proposals were prepared—one from the A Corporation and the other from the B Corporation. Each proposal was prepared in two versions—a “nice” appearing version (stylized “logoed” pages, offset two-color printing, heavy paper stock, plastic 19-ring spiral binding), and a “poor” appearing version (single-spaced typed pages, xerox reproduction, cheap transparent plastic cover, staple binding.) The proposals were scored against a set of eight evaluation questions by twenty-eight experienced government evaluators in a 2 × 2 factorial design experiment. No statistically significant effects of appearance on evaluation scoring were detected. A general model is presented that describes impression in terms of proposal appearance versus proposal thought content. The experiment is interpreted in terms of this model, and “real-world” applications of the model are discussed.

Download Full-text

Semi-Supervised Outlier Detection with Only Positive and Unlabeled Data Based on Fuzzy Clustering

International Journal of Artificial Intelligence Tools ◽

10.1142/s0218213015500037 ◽

2015 ◽

Vol 24 (03) ◽

pp. 1550003 ◽

Cited By ~ 1

Author(s):

Armin Daneshpazhouh ◽

Ashkan Sami

Keyword(s):

Intrusion Detection ◽

Outlier Detection ◽

Fuzzy Clustering ◽

Real World ◽

State Of The Art ◽

Real Data ◽

Experimental Results ◽

The Other ◽

Data Sets ◽

Real World Applications

The task of semi-supervised outlier detection is to find the instances that are exceptional from other data, using some labeled examples. In many applications such as fraud detection and intrusion detection, this issue becomes more important. Most existing techniques are unsupervised. On the other hand, semi-supervised approaches use both negative and positive instances to detect outliers. However, in many real world applications, very few positive labeled examples are available. This paper proposes an innovative approach to address this problem. The proposed method works as follows. First, some reliable negative instances are extracted by a kNN-based algorithm. Afterwards, fuzzy clustering using both negative and positive examples is utilized to detect outliers. Experimental results on real data sets demonstrate that the proposed approach outperforms the previous unsupervised state-of-the-art methods in detecting outliers.

Download Full-text

A hierarchical ant based clustering algorithm and its use in three real-world applications

European Journal of Operational Research ◽

10.1016/j.ejor.2005.03.062 ◽

2007 ◽

Vol 179 (3) ◽

pp. 906-922 ◽

Cited By ~ 54

Author(s):

Hanene Azzag ◽

Gilles Venturini ◽

Antoine Oliver ◽

Christiane Guinot

Keyword(s):

Real World ◽

Clustering Algorithm ◽

Real World Applications

Download Full-text

Terahertz Multilayer Thickness Measurements: Comparison of Optoelectronic Time and Frequency Domain Systems

Journal of Infrared Millimeter and Terahertz Waves ◽

10.1007/s10762-021-00831-5 ◽

2021 ◽

Author(s):

Lars Liebermeister ◽

Simon Nellen ◽

Robert B. Kohlhaas ◽

Sebastian Lauck ◽

Milan Deumer ◽

...

Keyword(s):

Standard Deviation ◽

Frequency Domain ◽

Time Domain ◽

Real World ◽

State Of The Art ◽

Single Layer ◽

Industrial Applications ◽

Starting Point ◽

Real World Applications ◽

Thickness Measurements

AbstractWe compare a state-of-the-art terahertz (THz) time domain spectroscopy (TDS) system and a novel optoelectronic frequency domain spectroscopy (FDS) system with respect to their performance in layer thickness measurements. We use equal sample sets, THz optics, and data evaluation methods for both spectrometers. On single-layer and multi-layer dielectric samples, we found a standard deviation of thickness measurements below 0.2 µm for TDS and below 0.5 µm for FDS. This factor of approx. two between the accuracy of both systems reproduces well for all samples. Although the TDS system achieves higher accuracy, FDS systems can be a competitive alternative for two reasons. First, the architecture of an FDS system is essentially simpler, and thus the price can be much lower compared to TDS. Second, an accuracy below 1 µm is sufficient for many real-world applications. Thus, this work may be a starting point for a comprehensive cross comparison of different terahertz systems developed for specific industrial applications.

Download Full-text

Planning with Preferences

AI Magazine ◽

10.1609/aimag.v29i4.2204 ◽

2008 ◽

Vol 29 (4) ◽

pp. 25 ◽

Cited By ~ 27

Author(s):

Jorge A, Baier ◽

Sheila A. McIlraith

Keyword(s):

Real World ◽

Automated Planning ◽

The Other ◽

Planning Systems ◽

High Quality ◽

Planning Techniques ◽

Preference Representation ◽

Real World Applications ◽

Initial States ◽

Set Of Initial States

Automated Planning is an old area of AI that focuses on the development of techniques for finding a plan that achieves a given goal from a given set of initial states as quickly as possible. In most real-world applications, users of planning systems have preferences over the multitude of plans that achieve a given goal. These preferences allow to distinguish plans that are more desirable from those that are less desirable. Planning systems should therefore be able to construct high-quality plans, or at the very least they should be able to build plans that have a reasonably good quality given the resources available.In the last few years we have seen a significant amount of research that has focused on developing rich and compelling languages for expressing preferences over plans. On the other hand, we have seen the development of planning techniques that aim at finding high-quality plans quickly, exploiting some of the ideas developed for classical planning. In this paper we review the latest developments in automated preference-based planning. We also review various approaches for preference representation, and the main practical approaches developed so far.

Download Full-text

A Graphical Diagnostic Test for Two-Way Contingency Tables

Revista Colombiana de Estadística ◽

10.15446/rce.v39n1.55142 ◽

2016 ◽

Vol 39 (1) ◽

pp. 97-108 ◽

Cited By ~ 1

Author(s):

Jorge Iván Vélez ◽

Fernando Marmolejo-Ramos ◽

Juan Carlos Correa

Keyword(s):

Diagnostic Test ◽

Real World ◽

Graphical Method ◽

Contingency Tables ◽

The Other ◽

Test Statistic ◽

Real World Applications

We propose and illustrate a new graphical method to perform diagnostic analyses in two-way contingency tables. In this method, one observation is added or removed from each cell at a time, whilst the other cells are held constant, and the change in a test statistic of interest is graphically represented. The method provides a very simple way of determining how robust our model is (and hence our conclusions) to small changes introduced to the data. We illustrate via four examples, three of them from real-world applications, how this method works.

Download Full-text

Deconstructing multivariate decoding for the study of brain function

10.1101/158493 ◽

2017 ◽

Cited By ~ 2

Author(s):

Martin N. Hebart ◽

Chris I. Baker

Keyword(s):

Data Analysis ◽

Real World ◽

Brain Function ◽

Univariate Analysis ◽

The Other ◽

List Type ◽

Dual Use ◽

Real World Applications ◽

Neuroimaging Data ◽

The Common

AbstractMultivariate decoding methods were developed originally as tools to enable accurate predictions in real-world applications. The realization that these methods can also be employed to study brain function has led to their widespread adoption in the neurosciences. However, prior to the rise of multivariate decoding, the study of brain function was firmly embedded in a statistical philosophy grounded on univariate methods of data analysis. In this way, multivariate decoding for brain interpretation grew out of two established frameworks: multivariate decoding for predictions in real-world applications, and classical univariate analysis based on the study and interpretation of brain activation. We argue that this led to two confusions, one reflecting a mixture of multivariate decoding for prediction or interpretation, and the other a mixture of the conceptual and statistical philosophies underlying multivariate decoding and classical univariate analysis. Here we attempt to systematically disambiguate multivariate decoding for the study of brain function from the frameworks it grew out of. After elaborating these confusions and their consequences, we describe six, often unappreciated, differences between classical univariate analysis and multivariate decoding. We then focus on how the common interpretation of what is signal and noise changes in multivariate decoding. Finally, we use four examples to illustrate where these confusions may impact the interpretation of neuroimaging data. We conclude with a discussion of potential strategies to help resolve these confusions in interpreting multivariate decoding results, including the potential departure from multivariate decoding methods for the study of brain function.HighlightsWe highlight two sources of confusion that affect the interpretation of multivariate decoding resultsOne confusion arises from the dual use of multivariate decoding for predictions in real-world applications and for interpretation in terms of brain functionThe other confusion arises from the different statistical and conceptual frameworks underlying classical univariate analysis to multivariate decodingWe highlight six differences between classical univariate analysis and multivariate decoding and differences in the interpretation of signal and noiseThese confusions are illustrated in four examples revealing assumptions and limitations of multivariate decoding for interpretation

Download Full-text

Unravelling pre-eruptive P-T conditions by machine learning

10.5194/egusphere-egu2020-19028 ◽

2020 ◽

Author(s):

Maurizio Petrelli ◽

Luca Caricchi ◽

Diego Perugini

Keyword(s):

Machine Learning ◽

Real World ◽

The Other ◽

Calibration Data ◽

Wide Range ◽

Volcanic Processes ◽

Real World Applications ◽

Complex Models ◽

Temperature And Pressure ◽

Use Of Models

Clinopyroxene based thermometers and barometers are widely used tools for estimating temperature and pressure conditions under which magmas are stored before eruptions.Several studies reported the development and the application of Clinopyroxene&#8211;liquid geothermobarometers in many different volcanic environments, also warning on the potential pitfall in using overly complex models [e.g., 1 and references therein]. The main drawback in the use of models with a large number of parameters is the potential overfitting of the calibration data, yielding a poor accuracy in real-world applications. On the other hand, simpler models cannot account for the complexity of natural magmatic systems, requiring different calibrations for different magma chemistries [e.g., 2, 3].In the present study, we report on the development of Clinopyroxene and Clinopyroxene-liquid thermometers and barometers in a wide range of P-T-X conditions using Machine Learning (ML) algorithms. To avoid overfitting and demonstrate the robustness of the different methods, we randomly split the dataset into training and validation portions and repeating this procedure up to 10000 times to trace the performance of each of the used algorithms. We compared the performance of ML algorithms with classical and established Clinopyroxene and Clinopyroxene-liquid thermometers and barometers using local and global calibrations. Finally, we applied the obtained thermometers and barometers to real study cases.&#160;[1]&#160;&#160;&#160;&#160;&#160; K. D. Putirka, Thermometers and barometers for volcanic systems, Minerals, Inclusions and Volcanic Processes, 69. 61&#8211;120, 2008.[2]&#160;&#160;&#160;&#160;&#160; D. A. Neave, K. D. Putirka, Am. Mineral., 2017, DOI:10.2138/am-2017-5968.[3]&#160;&#160;&#160;&#160;&#160; M. Masotta, S. Mollo, C. Freda, M. Gaeta, G. Moore, Contrib. to Mineral. Petrol., 2013, DOI:10.1007/s00410-013-0927-9.

Download Full-text

Designing Stochastic Optimization Algorithms for Real-world Applications

IEEJ Transactions on Electronics Information and Systems ◽

10.1541/ieejeiss.132.2 ◽

2012 ◽

Vol 132 (1) ◽

pp. 2-5

Author(s):

Hiroshi Someya ◽

Hisashi Handa ◽

Seiichi Koakutsu

Keyword(s):

Stochastic Optimization ◽

Real World ◽

Optimization Algorithms ◽

Real World Applications ◽

Stochastic Optimization Algorithms

Download Full-text