power of tests Latest Research Papers

HOW TO AVOID THE ZERO-POWER TRAP IN TESTING FOR CORRELATION

Econometric Theory ◽

10.1017/s0266466621000062 ◽

2021 ◽

pp. 1-33

Author(s):

David Preinerstorfer

Keyword(s):

Power Function ◽

Regression Models ◽

Strongly Correlated ◽

Correlated Errors ◽

Initial Test ◽

Power Of Tests ◽

Optimality Properties ◽

Theoretical Results ◽

Modified Test

In testing for correlation of the errors in regression models, the power of tests can be very low for strongly correlated errors. This counterintuitive phenomenon has become known as the “zero-power trap.” Despite a considerable amount of literature devoted to this problem, mainly focusing on its detection, a convincing solution has not yet been found. In this article, we first discuss theoretical results concerning the occurrence of the zero-power trap phenomenon. Then, we suggest and compare three ways to avoid it. Given an initial test that suffers from the zero-power trap, the method we recommend for practice leads to a modified test whose power converges to $1$ as the correlation gets very strong. Furthermore, the modified test has approximately the same power function as the initial test and thus approximately preserves all of its optimality properties. We also provide some numerical illustrations in the context of testing for network generated correlation.

Download Full-text

The Power of Tests

10.4324/9781003062318 ◽

2020 ◽

Author(s):

Elana Shohamy

Keyword(s):

Power Of Tests

Download Full-text

Тестування та оцінювання мовленнєвої компетенції: німецький досвід

East European Journal of Psycholinguistics ◽

10.29038/eejpl.2019.6.1.ita ◽

2019 ◽

Vol 6 (1) ◽

pp. 76-81

Author(s):

Інна Тарасюк

Keyword(s):

Language Learning ◽

Council Of Europe ◽

Critical Perspective ◽

Individualized Learning ◽

Learning And Teaching ◽

Power Of Tests ◽

Individuelle Förderung ◽

Language Tests ◽

Adult Immigrants ◽

Theorie Und Praxis

Оскільки навчання іноземної мови на сьогодні має абсолютно новий, сучасний підхід, оцінювання мовленнєвої компетенції повинно відповідати чітким міжнародним стандартизованим вимогам. Позаяк стаття має стислий інформативний характер, в її межах уточнено поняття оцінювання, як спонукання до відповідної мовленнєвої реакції або дії через чітко поставлені комунікативні завдання. У статті також відображено типи оцінювання, а саме формальне та неформальне, зовнішнє оцінювання та самоконтроль. Через те, що завжди важливо розуміти, з якою метою здійснюється оцінювання мовленнєвої компетенції, у статті виокремлено його функції, а саме: діагностування, сприяння, розпізнання прогресу, констатування досягнення цілей, встановлення рейтингу, виставлення оцінок, порівняння, мотивація та ін. Там, де дві особи, існують дві суб’єктивні картини об’єктивного світу: бачення ситуації тим, хто оцінює, і тим, кого оцінюють, то до уваги також взято психоемоційний аспект досліджуваного питання. Література References Ballweg, S. Drumm, S. Hufeisen, B. Klippel, J., Pilypaityte, L. (2013). Wie lernt man die Fremdsprache Deutsch? Deutsch Lehren Lernen. Band 2. München: Klett-Langenscheidt. Beurteilen im DaF-/DaZ-Unterricht Testen – Evaluieren – Prüfen Akten der Vierten Gesamtschweizerischen Tagung für Deutschlehrerinnen und Deutschlehrer 29. und 30. Juni 2012 – Universität Bern. M. Clalüna, B. Tscharner (Eds.). Impressum Käser Druck. Bolton, S., Glaboniat, M., Lorenz, H., Perlmann-Balme, M., Steiner, S. (2008). Mündlich: Mündliche Produktion und Interaktion Deutsch: Illustration der Niveaustufen des Gemeinsamen europäischen Referenzrahmens. München: Langenscheidt. Garme, B. (2005). Auf den Flügeln der Sprache: Ein diagnostisches Verfahren. In: Anforderungen an Verfahren der regelmäßigen Sprachstandsfeststellung als Grundlage für die frühe und individuelle Förderung von Kindern mit und ohne Migrationshintergrund. (pp. 241-260), K. Ehlich u.a. (Eds.). Bonn: BMBF. Grotjahn, R. (2010). Sprachtests: Formen und Funktionen. In: Handbuch Fremdsprachendidaktik. (pp. 211–215). W. Hallet, F. Königs (Eds.). Seelze-Velber: Kallmeyer. Grotjahn, R., Kleppin, K. (2015) Prüfen, Testen, Evaluieren Klett-Langenscheidt München. Kleppin, K. (2010): Fehleranalyse und Fehlerkorrektur. In: Deutsch als Fremd- und Zweitsprache: ein Internationales Handbuch (1.Halbband). (pp. 1060-1072). H.-J. Krumm (Ed.). Berlin: Mouton de Gruyter. Krumm, H.-J. (2001): Bildungsstandards und Kompetenzorientierung – Herausforderungen für das Fach Deutsch als Fremdsprache. In: Theorie und Praxis. Österreichische Beiträge zu Deutsch als Fremdsprache. Bd. 14/2010. (pp. 171–185). H.-J. Krumm, P. R. PortmannTselikas, (Eds.). Innsbruck: Studienverlag. Lengyel, D. (2010). Language Diagnostics in multilingual settings with respect to continuous procedures as accompaniment of individualized learning and teaching. Strasbourg: Council of Europe. Retrieved from: http://www.coe.int/t/dg4/linguistic/Source/Source2010_Forum Geneva/1_Diagnostic Lengyel_EN.pdf Roche, J. (2010) Fremdevaluation und Selbstevaluation. In: Handbuch Fremdsprachendidaktik. (pp. 228–231). W. Hallet, F. G. Königs. (Eds.). Seelze-Velber: Kallmeyer. Rumpf, H. (1996). Wirklichkeiten berühren. Umrisse einer neuen Lernkultur. Fragen und Versuche, 77, 8–22. Shohamy, E. (2001). The Power of Tests: A Critical Perspective on the Uses of Language Tests. Harlow: Pearson Education. Smit, R. (2008). Formative Beurteilung im kompetenz- und standardorientierten Unterricht. Beiträge zur Lehrerbildung, 26(3), 383–392. Studer, T. (2010). Kompetenzmodelle und Bildungsstandards für Deutsch als Fremd- und Deutsch als Zweitsprache. In: Deutsch als Fremd- und Zweitsprache. Ein internationales Handbuch. (pp. 1264–1271). H-J. Krumm, C. Fandrych, B. Hufeisen, C. Riemer (Eds.). Berlin: De Gruyter. Bd. 2, Art. 142. Van Avermaet, P., Gysen, S. (2008): Language Learning, Teaching and Assessment and the Integration of Adult Immigrants. The Importance of Needs Analysis. Strasbourg: Council of Europe. Retrieved from: http://www.coe.int/t/dg4/linguistic/MigrantsSemin08_MainDocs_ EN.asp.

Download Full-text

On the Asymptotic Power of Tests of Fit under Local Alternatives in Autoregression

Mathematical Methods of Statistics ◽

10.3103/s1066530719020042 ◽

2019 ◽

Vol 28 (2) ◽

pp. 144-154

Author(s):

M. V. Boldin

Keyword(s):

Local Alternatives ◽

Asymptotic Power ◽

Power Of Tests ◽

Tests Of Fit

Download Full-text

On the Conditional and Unconditional Type I Error Rates and Power of Tests in Linear Models with Heteroscedastic Errors

Journal of Modern Applied Statistical Methods ◽

10.22237/jmasm/1551966828 ◽

2019 ◽

Vol 17 (2) ◽

Cited By ~ 1

Author(s):

Patrick J. Rosopa ◽

Alice M. Brawley ◽

Theresa P. Atkinson ◽

Stephen A. Robertson

Keyword(s):

Linear Models ◽

Type I Error ◽

Weighted Least Squares ◽

Error Rates ◽

Type I ◽

Least Squares Regression ◽

Type I Error Rates ◽

Power Of Tests ◽

Wide Range ◽

Heteroscedastic Errors

Preliminary tests for homoscedasticity may be unnecessary in general linear models. Based on Monte Carlo simulations, results suggest that when testing for differences between independent slopes, the unconditional use of weighted least squares regression and HC4 regression performed the best across a wide range of conditions.

Download Full-text

Indices of Rank Histogram Flatness and Their Sampling Properties

Monthly Weather Review ◽

10.1175/mwr-d-18-0369.1 ◽

2019 ◽

Vol 147 (2) ◽

pp. 763-769 ◽

Cited By ~ 3

Author(s):

D. S. Wilks

Keyword(s):

Null Hypothesis ◽

Statistical Power ◽

Small Sample ◽

Sample Sizes ◽

Sampling Distributions ◽

Power Of Tests ◽

Rank Histogram ◽

Small Sample Sizes ◽

Formal Hypothesis Testing ◽

Two Alternatives

Abstract Quantitative evaluation of the flatness of the verification rank histogram can be approached through formal hypothesis testing. Traditionally, the familiar χ2 test has been used for this purpose. Recently, two alternatives—the reliability index (RI) and an entropy statistic (Ω)—have been suggested in the literature. This paper presents approximations to the sampling distributions of these latter two rank histogram flatness metrics, and compares the statistical power of tests based on the three statistics, in a controlled setting. The χ2 test is generally most powerful (i.e., most sensitive to violations of the null hypothesis of rank uniformity), although for overdispersed ensembles and small sample sizes, the test based on the entropy statistic Ω is more powerful. The RI-based test is preferred only for unbiased forecasts with small ensembles and very small sample sizes.

Download Full-text

Power in High‐Dimensional Testing Problems

Econometrica ◽

10.3982/ecta15844 ◽

2019 ◽

Vol 87 (3) ◽

pp. 1055-1069 ◽

Cited By ~ 1

Author(s):

Anders Bredahl Kock ◽

David Preinerstorfer

Keyword(s):

Asymptotic Normality ◽

Sample Size ◽

Parameter Space ◽

Sufficient Conditions ◽

High Dimensional ◽

Local Asymptotic Normality ◽

Asymptotic Power ◽

Asymptotic Size ◽

Power Of Tests ◽

Power Enhancement

Fan, Liao, and Yao (2015) recently introduced a remarkable method for increasing the asymptotic power of tests in high‐dimensional testing problems. If applicable to a given test, their power enhancement principle leads to an improved test that has the same asymptotic size, has uniformly non‐inferior asymptotic power, and is consistent against a strictly broader range of alternatives than the initially given test. We study under which conditions this method can be applied and show the following: In asymptotic regimes where the dimensionality of the parameter space is fixed as sample size increases, there often exist tests that cannot be further improved with the power enhancement principle. However, when the dimensionality of the parameter space increases sufficiently slowly with sample size and a marginal local asymptotic normality (LAN) condition is satisfied, every test with asymptotic size smaller than 1 can be improved with the power enhancement principle. While the marginal LAN condition alone does not allow one to extend the latter statement to all rates at which the dimensionality increases with sample size, we give sufficient conditions under which this is the case.

Download Full-text