local statistics Latest Research Papers

Unconscious Bias Towards African American Women in the Education Workforce

10.4018/978-1-7998-8532-0.ch002 ◽

2022 ◽

pp. 23-43

Author(s):

Deborah Carol Fields Harris

Keyword(s):

African American ◽

African American Women ◽

Educational Administration ◽

School Principal ◽

Unconscious Bias ◽

Dominant Culture ◽

American Women ◽

Local Statistics ◽

Job Openings ◽

School Principalship

The number of African American women who become school principals is low per national and local statistics. An unconscious bias towards African American women may have contributed to these low statistics. The process of applying for a school principal's position has not been consistent for over a century. It seems that for job openings in which the dominant culture is not African American, the likelihood of being the school principal is doubtful. Unveiling and examining these biases may lead to determining how to include more African American women in educational administration. This chapter will describe 10 African American women who encountered unconscious bias as they sought and procured public-school principalship.

Download Full-text

Self-Sampling for Neural Point Cloud Consolidation

ACM Transactions on Graphics ◽

10.1145/3470645 ◽

2021 ◽

Vol 40 (5) ◽

pp. 1-14

Author(s):

Gal Metzer ◽

Rana Hanocka ◽

Raja Giryes ◽

Daniel Cohen-Or

Keyword(s):

Point Cloud ◽

Sampling Methods ◽

Difficult Problem ◽

Local Statistics ◽

Novel Technique ◽

Input Point ◽

Shape Learning ◽

Fixed Set ◽

Point Set ◽

Non Local

We introduce a novel technique for neural point cloud consolidation which learns from only the input point cloud. Unlike other point up-sampling methods which analyze shapes via local patches, in this work, we learn from global subsets. We repeatedly self-sample the input point cloud with global subsets that are used to train a deep neural network. Specifically, we define source and target subsets according to the desired consolidation criteria (e.g., generating sharp points or points in sparse regions). The network learns a mapping from source to target subsets, and implicitly learns to consolidate the point cloud. During inference, the network is fed with random subsets of points from the input, which it displaces to synthesize a consolidated point set. We leverage the inductive bias of neural networks to eliminate noise and outliers, a notoriously difficult problem in point cloud consolidation. The shared weights of the network are optimized over the entire shape, learning non-local statistics and exploiting the recurrence of local-scale geometries. Specifically, the network encodes the distribution of the underlying shape surface within a fixed set of local kernels, which results in the best explanation of the underlying shape surface. We demonstrate the ability to consolidate point sets from a variety of shapes, while eliminating outliers and noise.

Download Full-text

Fast and compact matching statistics analytics

10.1101/2021.10.05.463202 ◽

2021 ◽

Author(s):

Fabio Cunial ◽

Olgert Denas ◽

Djamal Belazzougui

Keyword(s):

Shared Memory ◽

Data Structures ◽

Large Scale ◽

State Of The Art ◽

Structural Rearrangements ◽

Compression Scheme ◽

Local Statistics ◽

Compact Representations ◽

Long Reads ◽

New Applications

Fast, lightweight methods for comparing the sequence of ever larger assembled genomes from ever growing databases are increasingly needed in the era of accurate long reads and pan-genome initiatives. Matching statistics is a popular method for computing whole-genome phylogenies and for detecting structural rearrangements between two genomes, since it is amenable to fast implementations that require a minimal setup of data structures. However, current implementations use a single core, take too much memory to represent the result, and do not provide efficient ways to analyze the output in order to explore local similarities between the sequences. We develop practical tools for computing matching statistics between large-scale strings, and for analyzing its values, faster and using less memory than the state of the art. Specifically, we design a parallel algorithm for shared-memory machines that computes matching statistics 30 times faster with 48 cores in the cases that are most difficult to parallelize. We design a lossy compression scheme that shrinks the matching statistics array to a bitvector that takes from 0.8 to 0.2 bits per character, depending on the dataset and on the value of a threshold, and that achieves 0.04 bits per character in some variants. And we provide efficient implementations of range-maximum and range-sum queries that take a few tens of milliseconds while operating on our compact representations, and that allow computing key local statistics about the similarity between two strings. Our toolkit makes construction, storage, and analysis of matching statistics arrays practical for multiple pairs of the largest genomes available today, possibly enabling new applications in comparative genomics.

Download Full-text

Adversarial Spectral Kernel Matching for Unsupervised Time Series Domain Adaptation

Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2021/378 ◽

2021 ◽

Author(s):

Qiao Liu ◽

Hui Xue

Keyword(s):

Time Series ◽

Real World ◽

Domain Adaptation ◽

Kernel Learning ◽

Target Domain ◽

Maximum Mean Discrepancy ◽

Local Statistics ◽

Unsupervised Domain Adaptation ◽

Non Local ◽

Spectral Kernel

Unsupervised domain adaptation (UDA) has been received increasing attention since it does not require labels in target domain. Most existing UDA methods learn domain-invariant features by minimizing discrepancy distance computed by a certain metric between domains. However, these discrepancy-based methods cannot be robustly applied to unsupervised time series domain adaptation (UTSDA). That is because discrepancy metrics in these methods contain only low-order and local statistics, which have limited expression for time series distributions and therefore result in failure of domain matching. Actually, the real-world time series are always non-local distributions, i.e., with non-stationary and non-monotonic statistics. In this paper, we propose an Adversarial Spectral Kernel Matching (AdvSKM) method, where a hybrid spectral kernel network is specifically designed as inner kernel to reform the Maximum Mean Discrepancy (MMD) metric for UTSDA. The hybrid spectral kernel network can precisely characterize non-stationary and non-monotonic statistics in time series distributions. Embedding hybrid spectral kernel network to MMD not only guarantees precise discrepancy metric but also benefits domain matching. Besides, the differentiable architecture of the spectral kernel network enables adversarial kernel learning, which brings more discriminatory expression for discrepancy matching. The results of extensive experiments on several real-world UTSDA tasks verify the effectiveness of our proposed method.

Download Full-text

Ridgelized Hotelling’s T2 test on mean vectors of large dimension

Random Matrices Theory and Application ◽

10.1142/s2010326322500113 ◽

2021 ◽

pp. 2250011

Author(s):

Gao-Fan Ha ◽

Qiuyan Zhang ◽

Zhidong Bai ◽

You-Gan Wang

Keyword(s):

Random Matrices ◽

Pathway Analysis ◽

Large Dimension ◽

High Dimensional ◽

Stat Assoc ◽

Moment Conditions ◽

Local Statistics ◽

Mean Vectors ◽

Simulation Results ◽

Mean Vector

In this paper, a ridgelized Hotelling’s [Formula: see text] test is developed for a hypothesis on a large-dimensional mean vector under certain moment conditions. It generalizes the main result of Chen et al. [A regularized Hotelling’s [Formula: see text] test for pathway analysis in proteomic studies, J. Am. Stat. Assoc. 106(496) (2011) 1345–1360.] by relaxing their Gaussian assumption. This is achieved by establishing an exact four-moment theorem that is a simplified version of Tao and Vu’s [Random matrices: universality of local statistics of eigenvalues, Ann. Probab. 40(3) (2012) 1285–1315] work. Simulation results demonstrate the superiority of the proposed test over the traditional Hotelling’s [Formula: see text] test and its several extensions in high-dimensional situations.

Download Full-text

Mapping a Cloud-Free Rice Growth Stages Using the Integration of PROBA-V and Sentinel-1 and Its Temporal Correlation with Sub-District Statistics

Remote Sensing ◽

10.3390/rs13081498 ◽

2021 ◽

Vol 13 (8) ◽

pp. 1498

Author(s):

Fadhlullah Ramadhani ◽

Reddy Pullanagari ◽

Gabor Kereszturi ◽

Jonathan Procter

Keyword(s):

Remote Sensing ◽

Growth Stage ◽

Temporal Correlation ◽

Lag Time ◽

Rice Production ◽

Support Vector ◽

Growth Stages ◽

Accuracy Measurement ◽

Local Statistics ◽

Rice Growth

Monitoring rice production is essential for securing food security against climate change threats, such as drought and flood events becoming more intense and frequent. The current practice to survey an area of rice production manually and in near real-time is expensive and involves a high workload for local statisticians. Remote sensing technology with satellite-based sensors has grown in popularity in recent decades as an alternative approach, reducing the cost and time required for spatial analysis over a wide area. However, cloud-free pixels of optical imagery are required to produce accurate outputs for agriculture applications. Thus, in this study, we propose an integration of optical (PROBA-V) and radar (Sentinel-1) imagery for temporal mapping of rice growth stages, including bare land, vegetative, reproductive, and ripening stages. We have built classification models for both sensors and combined them into 12-day periodical rice growth-stage maps from January 2017 to September 2018 at the sub-district level over Java Island, the top rice production area in Indonesia. The accuracy measurement was based on the test dataset and the predicted cross-correlated with monthly local statistics. The overall accuracy of the rice growth-stage model of PROBA-V was 83.87%, and the Sentinel-1 model was 71.74% with the Support Vector Machine classifier. The temporal maps were comparable with local statistics, with an average correlation between the vegetative area (remote sensing) and harvested area (local statistics) is 0.50, and lag time 89.5 days (n = 91). This result was similar to local statistics data, which correlate planting and the harvested area at 0.61, and the lag time as 90.4 days, respectively. Moreover, the cross-correlation between the predicted rice growth stage was also consistent with rice development in the area (r > 0.52, p < 0.01). This novel method is straightforward, easy to replicate and apply to other areas, and can be scaled up to the national and regional level to be used by stakeholders to support improved agricultural policies for sustainable rice production.

Download Full-text

On Null Hypotheses and Heteroskedasticity

10.31219/osf.io/ugkhp ◽

2021 ◽

Author(s):

Jeffery Sauer ◽

Taylor M. Oshan ◽

Sergio Rey ◽

Levi John Wolf

Keyword(s):

Variance Components ◽

Null Hypothesis ◽

Alternative Explanation ◽

Real World Data ◽

Local Statistics ◽

Local Variance ◽

Local Heterogeneity ◽

Alternative Measures ◽

The Impact ◽

The Relationship

Bivand and Wong (2018), a recent review on spatial statistical software, noted important differences in the results of the local Moran’s Ii statistic depending on the method of inference. That review speculated the differences may be due to the presence of local spatial heterogeneity. In this paper we design an experiment to assess the impact of local heterogeneity on hypothesis testing for local statistics. In this experiment, we analyze the relationship between measures of local variance, such as the local spatial heteroskedasticity (LOSH) statistic, and components of the local Moran’s Ii statistic. We consider this experiment with controlled synthetic heteroskedastic data and with uncontrolled real world data. We show that in both situations the variance components of the local Moran’s Ii statistic demonstrate a varying correlation with alternative measures of local variance like LOSH. In addition, we resituate the available inferential methods and suggest an alternative explanation for the differences observed in Bivand and Wong 2018. Ultimately, this paper demonstrates that there are important conceptual and computational differences as to what constituents a null hypothesis in local testing frameworks. Therefore, researchers must be aware as to how their choices may shape the observed spatial patterns.

Download Full-text

Application of statistical test on determining the unstable points in the basic network of horizontal displacement monitoring

Journal of Mining and Earth Sciences ◽

10.46326/jmes.2021.62(1).05 ◽

2021 ◽

Vol 62 (1) ◽

pp. 35-41

Author(s):

Khanh Quoc Pham ◽

Keyword(s):

Hydroelectric Plant ◽

Hypothesis Test ◽

Control Point ◽

Horizontal Displacement ◽

Test Method ◽

Reference Network ◽

Control Points ◽

Displacement Monitoring ◽

Local Statistics ◽

Global Statistics

The paper represents the hypothesis test method that can determine the instability control points of the reference network in the displacement of construction. Regarding data processing in displacement monitoring, the detection and modification for instability points is an important task because this affects the computation of the displacement of monitoring points. This method has been applied in many countries over the world but not in Vietnam, and it is processed through two steps including the global statistics test and local statistics test. The global statistics test is to identify whether a control point is stable or not. The local statistics test based on the division of groups is to find the unstable control points exactly. Experimental computation is carried out in two monitoring cycles at Hoa Binh hydroelectric plant. In this experiment, this algorithm detected two unstable points among six control points. This result is in agreement with the result that is solved by Vietnam’s construction standard of TCVN 9399:2012. In conclusion, the hypothesis test method completely can apply in real geodetic production in Vietnam.

Download Full-text

Global and local statistics in turbulent emulsions

Journal of Fluid Mechanics ◽

10.1017/jfm.2020.1118 ◽

2021 ◽

Vol 912 ◽

Author(s):

Lei Yi ◽

Federico Toschi ◽

Chao Sun

Keyword(s):

Local Statistics ◽

Global And Local ◽

Global And Local Statistics

Abstract

Download Full-text

Local Statistics, Semidefinite Programming, and Community Detection

Proceedings of the 2021 ACM-SIAM Symposium on Discrete Algorithms (SODA) ◽

10.1137/1.9781611976465.79 ◽

2021 ◽

pp. 1298-1316

Author(s):

Jess Banks ◽

Sidhanth Mohanty ◽

Prasad Raghavendra

Keyword(s):

Semidefinite Programming ◽

Community Detection ◽

Local Statistics

Download Full-text

local statistics
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Unconscious Bias Towards African American Women in the Education Workforce

Self-Sampling for Neural Point Cloud Consolidation

Fast and compact matching statistics analytics

Adversarial Spectral Kernel Matching for Unsupervised Time Series Domain Adaptation

Ridgelized Hotelling’s T2 test on mean vectors of large dimension

Mapping a Cloud-Free Rice Growth Stages Using the Integration of PROBA-V and Sentinel-1 and Its Temporal Correlation with Sub-District Statistics

On Null Hypotheses and Heteroskedasticity

Application of statistical test on determining the unstable points in the basic network of horizontal displacement monitoring

Global and local statistics in turbulent emulsions

Local Statistics, Semidefinite Programming, and Community Detection

Export Citation Format

local statisticsRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Unconscious Bias Towards African American Women in the Education Workforce

Self-Sampling for Neural Point Cloud Consolidation

Fast and compact matching statistics analytics

Adversarial Spectral Kernel Matching for Unsupervised Time Series Domain Adaptation

Ridgelized Hotelling’s T2 test on mean vectors of large dimension

Mapping a Cloud-Free Rice Growth Stages Using the Integration of PROBA-V and Sentinel-1 and Its Temporal Correlation with Sub-District Statistics

On Null Hypotheses and Heteroskedasticity

Application of statistical test on determining the unstable points in the basic network of horizontal displacement monitoring

Global and local statistics in turbulent emulsions

Local Statistics, Semidefinite Programming, and Community Detection

local statistics
Recently Published Documents