Information-theoretic measures of uncertainty for interval-set decision tables

2020 ◽

Vol 6 (1) ◽

pp. 19-54

Author(s):

Ryan Ka Yau Lai ◽

Youngah Do

Keyword(s):

Maximum Likelihood ◽

Corpus Linguistics ◽

Delta Method ◽

Confidence Bounds ◽

Likelihood Estimator ◽

Information Theoretic ◽

Leibler Divergence ◽

Information Theoretic Measures ◽

Data Points ◽

Measure Of Uncertainty

This article explores a method of creating confidence bounds for information-theoretic measures in linguistics, such as entropy, Kullback-Leibler Divergence (KLD), and mutual information. We show that a useful measure of uncertainty can be derived from simple statistical principles, namely the asymptotic distribution of the maximum likelihood estimator (MLE) and the delta method. Three case studies from phonology and corpus linguistics are used to demonstrate how to apply it and examine its robustness against common violations of its assumptions in linguistics, such as insufficient sample size and non-independence of data points.

Download Full-text

Optimized permutation testing for information theoretic measures of multi-gene interactions

BMC Bioinformatics ◽

10.1186/s12859-021-04107-6 ◽

2021 ◽

Vol 22 (1) ◽

Author(s):

James M. Kunert-Graf ◽

Nikita A. Sakhanenko ◽

David J. Galas

Keyword(s):

Large Scale ◽

Permutation Test ◽

Association Studies ◽

Genome Wide Association Studies ◽

Permutation Testing ◽

Exact Test ◽

Information Theoretic ◽

Information Theoretic Measures ◽

Full Analysis ◽

Computational Bottleneck

Abstract Background Permutation testing is often considered the “gold standard” for multi-test significance analysis, as it is an exact test requiring few assumptions about the distribution being computed. However, it can be computationally very expensive, particularly in its naive form in which the full analysis pipeline is re-run after permuting the phenotype labels. This can become intractable in multi-locus genome-wide association studies (GWAS), in which the number of potential interactions to be tested is combinatorially large. Results In this paper, we develop an approach for permutation testing in multi-locus GWAS, specifically focusing on SNP–SNP-phenotype interactions using multivariable measures that can be computed from frequency count tables, such as those based in Information Theory. We find that the computational bottleneck in this process is the construction of the count tables themselves, and that this step can be eliminated at each iteration of the permutation testing by transforming the count tables directly. This leads to a speed-up by a factor of over 103 for a typical permutation test compared to the naive approach. Additionally, this approach is insensitive to the number of samples making it suitable for datasets with large number of samples. Conclusions The proliferation of large-scale datasets with genotype data for hundreds of thousands of individuals enables new and more powerful approaches for the detection of multi-locus genotype-phenotype interactions. Our approach significantly improves the computational tractability of permutation testing for these studies. Moreover, our approach is insensitive to the large number of samples in these modern datasets. The code for performing these computations and replicating the figures in this paper is freely available at https://github.com/kunert/permute-counts.

Download Full-text

Insights into codeswitching from online communication: Effects of language preference and conditions arising from vocabulary richness

Bilingualism Language and Cognition ◽

10.1017/s1366728921000122 ◽

2021 ◽

pp. 1-7

Author(s):

Laurie Beth Feldman ◽

Vidhushini Srinivasan ◽

Rachel B. Fernandes ◽

Samira Shaikh

Keyword(s):

Online Communication ◽

Language Preference ◽

Lexical Diversity ◽

Information Theoretic ◽

Vocabulary Richness ◽

Twitter Data ◽

Language Mixing ◽

Communication Effects ◽

Information Theoretic Measures ◽

Spanish Bilinguals

Abstract Twitter data from a crisis that impacted many English–Spanish bilinguals show that the direction of codeswitches is associated with the statistically documented tendency of single speakers to prefer one language over another in their tweets, as gleaned from their tweeting history. Further, lexical diversity, a measure of vocabulary richness derived from information-theoretic measures of uncertainty in communication, is greater in proximity to a codeswitch than in productions remote from a switch. The prospects of a role for lexical diversity in characterizing the conditions for a language switch suggest that communicative precision may induce conditions that attenuate constraints against language mixing.

Download Full-text

Information-Theoretic Measures and Modeling Stock Market Volatility: A Comparative Approach

Risks ◽

10.3390/risks9050089 ◽

2021 ◽

Vol 9 (5) ◽

pp. 89

Author(s):

Muhammad Sheraz ◽

Imran Nasir

Keyword(s):

Stock Market ◽

Stock Returns ◽

Stock Exchange ◽

Approximate Entropy ◽

Market Volatility ◽

Comparative Approach ◽

Stock Market Volatility ◽

Information Theoretic ◽

Information Theoretic Measures ◽

Garch Modeling

The volatility analysis of stock returns data is paramount in financial studies. We investigate the dynamics of volatility and randomness of the Pakistan Stock Exchange (PSX-100) and obtain insights into the behavior of investors during and before the coronavirus disease (COVID-19 pandemic). The paper aims to present the volatility estimations and quantification of the randomness of PSX-100. The methodology includes two approaches: (i) the implementation of EGARCH, GJR-GARCH, and TGARCH models to estimate the volatilities; and (ii) analysis of randomness in volatilities series, return series, and PSX-100 closing prices for pre-pandemic and pandemic period by using Shannon’s, Tsallis, approximate and sample entropies. Volatility modeling suggests the existence of the leverage effect in both the underlying periods of study. The results obtained using GARCH modeling reveal that the stock market volatility has increased during the pandemic period. However, information-theoretic results based on Shannon and Tsallis entropies do not suggest notable variation in the estimated volatilities series and closing prices. We have examined regularity and randomness based on the approximate entropy and sample entropy. We have noticed both entropies are extremely sensitive to choices of the parameters.

Download Full-text

Measuring the Complexity of Additive Manufacturing Supply Chains

Volume 2: Additive Manufacturing; Materials ◽

10.1115/msec2017-2871 ◽

2017 ◽

Author(s):

Ardeshir Raihanian Mashhadi ◽

Sara Behdad

Keyword(s):

Supply Chain ◽

Additive Manufacturing ◽

Supply Chains ◽

Product Complexity ◽

Information Theoretic ◽

Manufacturing Method ◽

Management Domain ◽

Information Theoretic Measures ◽

Production Cycles ◽

Supply Planning

Complexity has been one of the focal points of attention in the supply chain management domain, as it deteriorates the performance of the supply chain and makes controlling it problematic. The complexity of supply chains has been significantly increased over the past couple of decades. Meanwhile, Additive Manufacturing (AM) not only revolutionizes the way that the products are made, but also brings a paradigm shift to the whole production system. The influence of AM extends to product design and supply chain as well. The unique capabilities of AM suggest that this manufacturing method can significantly affect the supply chain complexity. More product complexity and demand heterogeneity, faster production cycles, higher levels of automation and shorter supply paths are among the features of additive manufacturing that can directly influence the supply chain complexity. Comparison of additive manufacturing supply chain complexity to its traditional counterpart requires a profound comprehension of the transformative effects of AM on the supply chain. This paper first extracts the possible effects of AM on the supply chain and then tries to connect these effects to the drivers of complexity under three main categories of 1) market, 2) manufacturing technology, and 3) supply, planning and infrastructure. Possible impacts of additive manufacturing adoption on the supply chain complexity have been studied using information theoretic measures. An Agent-based Simulation (ABS) model has been developed to study and compare two different supply chain configurations. The findings of this study suggest that the adoption of AM can decrease the supply chain complexity, particularly when product customization is considered.

Download Full-text

Efficiency analysis of information theoretic measures in image registration

Pattern Recognition and Image Analysis ◽

10.1134/s1054661816030226 ◽

2016 ◽

Vol 26 (3) ◽

pp. 502-505 ◽

Cited By ~ 2

Author(s):

S. V. Voronov ◽

A. G. Tashlinskii

Keyword(s):

Image Registration ◽

Efficiency Analysis ◽

Information Theoretic ◽

Information Theoretic Measures

Download Full-text

Information Theoretic Measures for Quantifying the Integration of Neural Activity

2007 Information Theory and Applications Workshop ◽

10.1109/ita.2007.4357556 ◽

2007 ◽

Cited By ~ 3

Author(s):

Selin Aviyente

Keyword(s):

Neural Activity ◽

Information Theoretic ◽

Information Theoretic Measures

Download Full-text

On the complexity of assimilation in urban communities

Applied Network Science ◽

10.1007/s41109-021-00399-y ◽

2021 ◽

Vol 6 (1) ◽

Author(s):

Renita Murimi

Keyword(s):

Complex Systems ◽

Urban Communities ◽

Human Experience ◽

Urban Environments ◽

Urban Systems ◽

Sociological Perspective ◽

Information Theoretic ◽

Information Theoretic Measures ◽

Urban Complex ◽

The Relationship

AbstractCities are microcosms representing a diversity of human experience. The complexity of urban systems arises from this diversity, where the services that cities offer to their inhabitants have to be tailored for their unique requirements. This paper studies the complexity of urban environments in terms of the assimilation of its communities. We examine the urban assimilation complexity with respect to the foreignness between communities and formalize the level of complexity using information-theoretic measures. Our findings contribute to a sociological perspective of the relationship between urban complex systems and the diversity of communities that make up urban systems.

Download Full-text

Universality Classes and Information-Theoretic Measures of Complexity via Group Entropies

Scientific Reports ◽

10.1038/s41598-020-60188-y ◽

2020 ◽

Vol 10 (1) ◽

Cited By ~ 2

Author(s):

Piergiulio Tempesta ◽

Henrik Jeldtoft Jensen

Keyword(s):

Information Theoretic ◽

Information Theoretic Measures ◽

Universality Classes

Download Full-text

What’s Worth Talking About? Information Theory Reveals How Children Balance Informativeness and Ease of Production

Psychological Science ◽

10.1177/0956797617699848 ◽

2017 ◽

Vol 28 (7) ◽

pp. 954-966 ◽

Cited By ~ 8

Author(s):

Colin Bannard ◽

Marla Rosner ◽

Danielle Matthews

Keyword(s):

Information Theory ◽

Information Content ◽

Low Frequency ◽

Initial Experiment ◽

Information Theoretic ◽

Information Theoretic Measures

Of all the things a person could say in a given situation, what determines what is worth saying? Greenfield’s principle of informativeness states that right from the onset of language, humans selectively comment on whatever they find unexpected. In this article, we quantify this tendency using information-theoretic measures and report on a study in which we tested the counterintuitive prediction that children will produce words that have a low frequency given the context, because these will be most informative. Using corpora of child-directed speech, we identified adjectives that varied in how informative (i.e., unexpected) they were given the noun they modified. In an initial experiment ( N = 31) and in a replication ( N = 13), 3-year-olds heard an experimenter use these adjectives to describe pictures. The children’s task was then to describe the pictures to another person. As the information content of the experimenter’s adjective increased, so did children’s tendency to comment on the feature that adjective had encoded. Furthermore, our analyses suggest that children balance informativeness with a competing drive to ease production.

Download Full-text

Information-theoretic measures of uncertainty for interval-set decision tables

Large-sample confidence intervals of information-theoretic measures in linguistics

Optimized permutation testing for information theoretic measures of multi-gene interactions

Insights into codeswitching from online communication: Effects of language preference and conditions arising from vocabulary richness

Information-Theoretic Measures and Modeling Stock Market Volatility: A Comparative Approach

Measuring the Complexity of Additive Manufacturing Supply Chains

Efficiency analysis of information theoretic measures in image registration

Information Theoretic Measures for Quantifying the Integration of Neural Activity

On the complexity of assimilation in urban communities

Universality Classes and Information-Theoretic Measures of Complexity via Group Entropies

What’s Worth Talking About? Information Theory Reveals How Children Balance Informativeness and Ease of Production

Export Citation Format