Application of Inductive Modeling Principles to Solve the Double Clustering Problems

Using complete enumeration (e.g., generating all possible subsets of item combinations) to evaluate clustering problems has the benefit of locating globally optimal solutions automatically without the concern of sampling variability. The proposed method is meant to combine clustering variables in such a way as to create groups that are maximally different on a theoretically sound derivation variable(s). After the population of all unique sets is permuted, optimization on some predefined, user-specific function can occur. We apply this technique to optimizing the diagnosis of Alcohol Use Disorder. This is a unique application, from a clustering point of view, in that the decision rule for clustering observations into the diagnosis group relies on both the set of items being considered and a predefined threshold on the number of items required to be endorsed for the diagnosis to occur. In optimizing diagnostic rules, criteria set sizes can be reduced without a loss of significant information when compared to current and proposed, alternative, diagnostic schemes.

Download Full-text

Using variability modeling principles to capture architectural knowledge

ACM SIGSOFT Software Engineering Notes ◽

10.1145/1163514.1178645 ◽

2006 ◽

Vol 31 (5) ◽

pp. 5 ◽

Cited By ~ 2

Author(s):

Marco Sinnema ◽

Jan Salvador van der Ven ◽

Sybren Deelstra

Keyword(s):

Variability Modeling ◽

Architectural Knowledge ◽

Modeling Principles

Download Full-text

Linear-time approximation schemes for clustering problems in any dimensions

Journal of the ACM ◽

10.1145/1667053.1667054 ◽

2010 ◽

Vol 57 (2) ◽

pp. 1-32 ◽

Cited By ~ 48

Author(s):

Amit Kumar ◽

Yogish Sabharwal ◽

Sandeep Sen

Keyword(s):

Linear Time ◽

Approximation Schemes ◽

Time Approximation ◽

Clustering Problems

Download Full-text

Combining a double clustering approach with sentence simplification to produce highly informative multi-document summaries

2012 IEEE 13th International Conference on Information Reuse & Integration (IRI) ◽

10.1109/iri.2012.6303047 ◽

2012 ◽

Cited By ~ 3

Author(s):

Sara Botelho Silveira ◽

Antonio Branco

Keyword(s):

Clustering Approach ◽

Double Clustering

Download Full-text

Clustering problems in optimization models

Computational Economics ◽

10.1007/bf00121636 ◽

1996 ◽

Vol 9 (3) ◽

pp. 229-239 ◽

Cited By ~ 2

Author(s):

Santosh Kabadi ◽

Katta G. Murty ◽

Cosimo Spera

Keyword(s):

Optimization Models ◽

Clustering Problems

Download Full-text

GEOMETRIC ALGORITHMS FOR THE CONSTRAINED 1-D K-MEANS CLUSTERING PROBLEMS AND IMRT APPLICATIONS

International Journal of Foundations of Computer Science ◽

10.1142/s0129054109006619 ◽

2009 ◽

Vol 20 (02) ◽

pp. 361-377

Author(s):

DANNY Z. CHEN ◽

MARK A. HEALY ◽

CHAO WANG ◽

BIN XU

Keyword(s):

Radiation Therapy ◽

Treatment Planning ◽

Intensity Modulated Radiation Therapy ◽

Geometric Algorithms ◽

Maximum Difference ◽

Continuous Version ◽

Heuristic Approaches ◽

Clustering Problem ◽

Intensity Modulated ◽

Clustering Problems

In this paper, we present efficient geometric algorithms for the discrete constrained 1-D K-means clustering problem and extend our solutions to the continuous version of the problem. One key clustering constraint we consider is that the maximum difference in each cluster cannot be larger than a given threshold. These constrained 1-D K-means clustering problems appear in various applications, especially in intensity-modulated radiation therapy (IMRT). Our algorithms improve the efficiency and accuracy of the heuristic approaches used in clinical IMRT treatment planning.

Download Full-text

Application of system modeling principles for the constructionof forecast models of car traffic

Modern technologies System analysis Modeling ◽

10.26731/1813-9108.2021.3(71).171-178 ◽

2021 ◽

pp. 171-178

Author(s):

E. V. Malovetskaya

Keyword(s):

System Modeling ◽

Car Traffic ◽

Forecast Models ◽

Modeling Principles

Download Full-text

An Evolutionary Multi-Layer Extreme Learning Machine for Data Clustering Problems

10.23919/ccc52363.2021.9550399 ◽

2021 ◽

Author(s):

Xian Wu ◽

Tianfang Zhou ◽

Kaixiang Yi ◽

Minrui Fei ◽

Yayu Chen ◽

...

Keyword(s):

Extreme Learning Machine ◽

Data Clustering ◽

Learning Machine ◽

Clustering Problems

Download Full-text

Caveats and Pitfalls of Production Forecast Uncertainty Analysis Using Design of Experiments

10.2118/203919-ms ◽

2021 ◽

Author(s):

Boxiao Li ◽

Hemant Phale ◽

Yanfen Zhang ◽

Timothy Tokar ◽

Xian-Huan Wen

Keyword(s):

Design Of Experiments ◽

Uncertainty Analysis ◽

History Matching ◽

Petroleum Industry ◽

Production Forecast ◽

Forecast Uncertainty ◽

Field Development ◽

Development Alternatives ◽

Modeling Principles ◽

Made In

Abstract Design of Experiments (DoE) is one of the most commonly employed techniques in the petroleum industry for Assisted History Matching (AHM) and uncertainty analysis of reservoir production forecasts. Although conceptually straightforward, DoE is often misused by practitioners because many of its statistical and modeling principles are not carefully followed. Our earlier paper (Li et al. 2019) detailed the best practices in DoE-based AHM for brownfields. However, to our best knowledge, there is a lack of studies that summarize the common caveats and pitfalls in DoE-based production forecast uncertainty analysis for greenfields and history-matched brownfields. Our objective here is to summarize these caveats and pitfalls to help practitioners apply the correct principles for DoE-based production forecast uncertainty analysis. Over 60 common pitfalls in all stages of a DoE workflow are summarized. Special attention is paid to the following critical project transitions: (1) the transition from static earth modeling to dynamic reservoir simulation; (2) from AHM to production forecast; and (3) from analyzing subsurface uncertainties to analyzing field-development alternatives. Most pitfalls can be avoided by consistently following the statistical and modeling principles. Some pitfalls, however, can trap experienced engineers. For example, mistakes made in handling the three abovementioned transitions can yield strongly unreliable proxy and sensitivity analysis. For the representative examples we study, they can lead to having a proxy R2 of less than 0.2 versus larger than 0.9 if done correctly. Two improved experimental designs are created to resolve this challenge. Besides the technical pitfalls that are avoidable via robust statistical workflows, we also highlight the often more severe non-technical pitfalls that cannot be evaluated by measures like R2. Thoughts are shared on how they can be avoided, especially during project framing and the three critical transition scenarios.

Download Full-text

Physical Thoughts, Biological Systems: The Application of Modeling Principles to Understanding Biological Systems

The Physical Basis of Biochemistry ◽

10.1007/978-1-4757-2963-4_2 ◽

1998 ◽

pp. 8-26

Author(s):

Peter R. Bergethon

Keyword(s):

Biological Systems ◽

Modeling Principles

Download Full-text