One-Way between Subjects Design: Simulated Data and Analysis Using SAS

An SAS program enables instructors to provide individual students with simulated data for the one-way between subjects design. The instructor chooses the starting values: means, standard deviations, and number of subjects. For each student, the program produces an ASCII data file that can be analyzed by calculator or by many statistical software packages. For the instructor, the program produces a summary analysis of variance table for each analysis. Individual student names appear on the data sets and the summary file for the instructor.

Download Full-text

Framework for Multivariate Selectivity Analysis, Part I: Theoretical and Practical Merits

Applied Spectroscopy ◽

10.1366/0003702054280621 ◽

2005 ◽

Vol 59 (6) ◽

pp. 787-803 ◽

Cited By ~ 16

Author(s):

Christopher D. Brown ◽

Trent D. Ridder

Keyword(s):

Near Infrared ◽

Clinical Laboratory ◽

Simulated Data ◽

Calibration Method ◽

National Committee ◽

Data Sets ◽

Calibration Methods ◽

Inverse Calibration ◽

The One ◽

Definition Of

A number of definitions of multivariate selectivity have been proposed in the literature. Arguably, the one that enjoys the greatest chemometric attention has been the net analyte signal (NAS) based definitions of Lorber and Zinn. Recent works have suggested that similar inference can be made for inverse least-squares calibration methods (e.g., principal components regression). However, the properties of inverse calibration methods are markedly different than classical methods, so in many practical cases involving inverse models classically derived figures of merit cannot be transparently interpreted. In Part I of this work, we discuss a selectivity framework that is theoretically consistent regardless of the calibration method. Importantly, it is also experimentally measurable, either through controlled selectivity experiments, or through analysis on opportunistically acquired sample measurements. It is statistically advantageous to use the former if such control is achievable. Selectivity is defined to be a function of the change in predicted analyte concentration that will result from a change in the concentration of an interferant, an approach consistent with traditional definitions of analytical selectivity and National Committee for Clinical Laboratory Standards recommendations for interference testing. Unlike the NAS-based definition of selectivity, the definition discussed herein is relevant to only a particular analyte–interferant pair. The theoretical and experimental aspects of this approach are illustrated with simulated data herein and in Part II of this paper, which investigates several experimental near-infrared data sets.

Download Full-text

Mean reversion in corporate leverage: evidence from India

Managerial Finance ◽

10.1108/mf-09-2018-0425 ◽

2019 ◽

Vol 45 (9) ◽

pp. 1183-1198

Author(s):

Gaurav S. Chauhan ◽

Pradip Banerjee

Keyword(s):

Capital Structure ◽

Emerging Market ◽

Simulated Data ◽

Mean Reversion ◽

Developed Countries ◽

Data Sets ◽

Debt Ratio ◽

Testing Strategy ◽

Content Type ◽

Financing Behavior

Purpose Recent papers on target capital structure show that debt ratio seems to vary widely in space and time, implying that the functional specifications of target debt ratios are of little empirical use. Further, target behavior cannot be adjudged correctly using debt ratios, as they could revert due to mechanical reasons. The purpose of this paper is to develop an alternative testing strategy to test the target capital structure. Design/methodology/approach The authors make use of a major “shock” to the debt ratios as an event and think of a subsequent reversion as a movement toward a mean or target debt ratio. By doing this, the authors no longer need to identify target debt ratios as a function of firm-specific variables or any other rigid functional form. Findings Similar to the broad empirical evidence in developed economies, there is no perceptible and systematic mean reversion by Indian firms. However, unlike developed countries, proportionate usage of debt to finance firms’ marginal financing deficits is extensive; equity is used rather sparingly. Research limitations/implications The trade-off theory could be convincingly refuted at least for the emerging market of India. The paper here stimulated further research on finding reasons for specific financing behavior of emerging market firms. Practical implications The results show that the firms’ financing choices are not only depending on their own firm’s specific variables but also on the financial markets in which they operate. Originality/value This study attempts to assess mean reversion in debt ratios in a unique but reassuring manner. The results are confirmed by extensive calibration of the testing strategy using simulated data sets.

Download Full-text

Estimating Simultaneous Equation Models through an Entropy-Based Incremental Variational Bayes Learning Algorithm

Entropy ◽

10.3390/e23040384 ◽

2021 ◽

Vol 23 (4) ◽

pp. 384

Author(s):

Rocío Hernández-Sanjaime ◽

Martín González ◽

Antonio Peñalver ◽

Jose J. López-Espín

Keyword(s):

Statistical Theory ◽

Learning Algorithm ◽

Real Life ◽

Simulated Data ◽

Simultaneous Equation ◽

Variational Bayes ◽

Parameter Estimates ◽

Step Method ◽

The One ◽

Simultaneous Equation Models

The presence of unaccounted heterogeneity in simultaneous equation models (SEMs) is frequently problematic in many real-life applications. Under the usual assumption of homogeneity, the model can be seriously misspecified, and it can potentially induce an important bias in the parameter estimates. This paper focuses on SEMs in which data are heterogeneous and tend to form clustering structures in the endogenous-variable dataset. Because the identification of different clusters is not straightforward, a two-step strategy that first forms groups among the endogenous observations and then uses the standard simultaneous equation scheme is provided. Methodologically, the proposed approach is based on a variational Bayes learning algorithm and does not need to be executed for varying numbers of groups in order to identify the one that adequately fits the data. We describe the statistical theory, evaluate the performance of the suggested algorithm by using simulated data, and apply the two-step method to a macroeconomic problem.

Download Full-text

Analysis of Risk Factors in Dementia Through Machine Learning

Journal of Alzheimer s Disease ◽

10.3233/jad-200955 ◽

2020 ◽

pp. 1-17

Author(s):

Francisco Javier Balea-Fernandez ◽

Beatriz Martinez-Vega ◽

Samuel Ortega ◽

Himar Fabelo ◽

Raquel Leon ◽

...

Keyword(s):

Machine Learning ◽

Optimization Algorithms ◽

Progressive Increase ◽

Control Group ◽

Data Sets ◽

Modifiable Factors ◽

Validation Set ◽

The One ◽

And Control ◽

Potential Tool

Background: Sociodemographic data indicate the progressive increase in life expectancy and the prevalence of Alzheimer’s disease (AD). AD is raised as one of the greatest public health problems. Its etiology is twofold: on the one hand, non-modifiable factors and on the other, modifiable. Objective: This study aims to develop a processing framework based on machine learning (ML) and optimization algorithms to study sociodemographic, clinical, and analytical variables, selecting the best combination among them for an accurate discrimination between controls and subjects with major neurocognitive disorder (MNCD). Methods: This research is based on an observational-analytical design. Two research groups were established: MNCD group (n = 46) and control group (n = 38). ML and optimization algorithms were employed to automatically diagnose MNCD. Results: Twelve out of 37 variables were identified in the validation set as the most relevant for MNCD diagnosis. Sensitivity of 100%and specificity of 71%were achieved using a Random Forest classifier. Conclusion: ML is a potential tool for automatic prediction of MNCD which can be applied to relatively small preclinical and clinical data sets. These results can be interpreted to support the influence of the environment on the development of AD.

Download Full-text

Different algorithms, different models

Quality & Quantity ◽

10.1007/s11135-021-01193-9 ◽

2021 ◽

Author(s):

Martyna Daria Swiatczak

Keyword(s):

Comparative Analysis ◽

Real World ◽

Qualitative Comparative Analysis ◽

Comparative Methods ◽

Data Sets ◽

Simulation Studies ◽

Threshold Values ◽

Real World Data ◽

Software Packages ◽

Methodological Approaches

AbstractThis study assesses the extent to which the two main Configurational Comparative Methods (CCMs), i.e. Qualitative Comparative Analysis (QCA) and Coincidence Analysis (CNA), produce different models. It further explains how this non-identity is due to the different algorithms upon which both methods are based, namely QCA’s Quine–McCluskey algorithm and the CNA algorithm. I offer an overview of the fundamental differences between QCA and CNA and demonstrate both underlying algorithms on three data sets of ascending proximity to real-world data. Subsequent simulation studies in scenarios of varying sample sizes and degrees of noise in the data show high overall ratios of non-identity between the QCA parsimonious solution and the CNA atomic solution for varying analytical choices, i.e. different consistency and coverage threshold values and ways to derive QCA’s parsimonious solution. Clarity on the contrasts between the two methods is supposed to enable scholars to make more informed decisions on their methodological approaches, enhance their understanding of what is happening behind the results generated by the software packages, and better navigate the interpretation of results. Clarity on the non-identity between the underlying algorithms and their consequences for the results is supposed to provide a basis for a methodological discussion about which method and which variants thereof are more successful in deriving which search target.

Download Full-text

Constructing Large-Scale Genetic Maps Using an Evolutionary Strategy Algorithm

Genetics ◽

10.1093/genetics/165.4.2269 ◽

2003 ◽

Vol 165 (4) ◽

pp. 2269-2282

Author(s):

D Mester ◽

Y Ronin ◽

D Minkov ◽

E Nevo ◽

A Korol

Keyword(s):

Discrete Optimization ◽

High Performance ◽

Large Scale ◽

Simulated Data ◽

Real Data ◽

Genetic Maps ◽

Chromosome 1 ◽

Evolutionary Strategy ◽

Group A ◽

The One

Abstract This article is devoted to the problem of ordering in linkage groups with many dozens or even hundreds of markers. The ordering problem belongs to the field of discrete optimization on a set of all possible orders, amounting to n!/2 for n loci; hence it is considered an NP-hard problem. Several authors attempted to employ the methods developed in the well-known traveling salesman problem (TSP) for multilocus ordering, using the assumption that for a set of linked loci the true order will be the one that minimizes the total length of the linkage group. A novel, fast, and reliable algorithm developed for the TSP and based on evolution-strategy discrete optimization was applied in this study for multilocus ordering on the basis of pairwise recombination frequencies. The quality of derived maps under various complications (dominant vs. codominant markers, marker misclassification, negative and positive interference, and missing data) was analyzed using simulated data with ∼50-400 markers. High performance of the employed algorithm allows systematic treatment of the problem of verification of the obtained multilocus orders on the basis of computing-intensive bootstrap and/or jackknife approaches for detecting and removing questionable marker scores, thereby stabilizing the resulting maps. Parallel calculation technology can easily be adopted for further acceleration of the proposed algorithm. Real data analysis (on maize chromosome 1 with 230 markers) is provided to illustrate the proposed methodology.

Download Full-text

Spectral Convolution Feature-Based SPD Matrix Representation for Signal Detection Using a Deep Neural Network

Entropy ◽

10.3390/e22090949 ◽

2020 ◽

Vol 22 (9) ◽

pp. 949

Author(s):

Jiangyi Wang ◽

Min Liu ◽

Xinwu Zeng ◽

Xiaoqiang Hua

Keyword(s):

Neural Network ◽

Signal Detection ◽

Convolutional Neural Network ◽

Deep Neural Network ◽

Detection Method ◽

Learning Algorithm ◽

Simulated Data ◽

Data Sets ◽

Feature Maps ◽

Simulated Data Sets

Convolutional neural networks have powerful performances in many visual tasks because of their hierarchical structures and powerful feature extraction capabilities. SPD (symmetric positive definition) matrix is paid attention to in visual classification, because it has excellent ability to learn proper statistical representation and distinguish samples with different information. In this paper, a deep neural network signal detection method based on spectral convolution features is proposed. In this method, local features extracted from convolutional neural network are used to construct the SPD matrix, and a deep learning algorithm for the SPD matrix is used to detect target signals. Feature maps extracted by two kinds of convolutional neural network models are applied in this study. Based on this method, signal detection has become a binary classification problem of signals in samples. In order to prove the availability and superiority of this method, simulated and semi-physical simulated data sets are used. The results show that, under low SCR (signal-to-clutter ratio), compared with the spectral signal detection method based on the deep neural network, this method can obtain a gain of 0.5–2 dB on simulated data sets and semi-physical simulated data sets.

Download Full-text

On the Distribution of Various Sums of Squares in an Analysis of Variance Table for Different Classifications with Correlated and Non-Homogeneous Errors

Journal of the Royal Statistical Society Series B (Methodological) ◽

10.1111/j.2517-6161.1959.tb00319.x ◽

1959 ◽

Vol 21 (1) ◽

pp. 114-119

Author(s):

B. R. Bhat

Keyword(s):

Analysis Of Variance ◽

Sums Of Squares ◽

Variance Table

Download Full-text

Model Comparison in the One-Way Between-Subjects Design: An Sas Implementation

Perceptual and Motor Skills ◽

10.2466/pms.1992.75.3f.1124 ◽

1992 ◽

Vol 75 (3_suppl) ◽

pp. 1124-1126

Author(s):

John F. Walsh

Keyword(s):

Analysis Of Variance ◽

Model Comparison ◽

Statistical Test ◽

Statistical System ◽

Test Statistic ◽

Cell Means ◽

Treatment Conditions ◽

Competing Models ◽

Proportional Increase ◽

The One

A statistical test is developed based on the comparison of sums of squared errors associated with two competing models. A model based on cell means is compared to a representation that specifies the means for the treatment conditions. Comparing models is more general than the traditional H0 in analysis of variance wherein all the cell means are assumed equal. The test statistic, Proportional Increase in Error, is computed using the SAS statistical system.

Download Full-text

How the one-way analysis of variance model is affected by the degree of precision of the data

Journal of Applied Statistics ◽

10.1080/02664769200000005 ◽

1992 ◽

Vol 19 (1) ◽

pp. 61-73 ◽

Cited By ~ 1

Author(s):

A. R. Tricker

Keyword(s):

Analysis Of Variance ◽

Variance Model ◽

The One

Download Full-text