Nearest-Neighbor Projected Distance Regression for Epistasis Detection in GWAS With Population Structure Correction

Nearest-neighbor Projected-Distance Regression (NPDR) for detecting network interactions with adjustments for multiple tests and confounding

10.1101/861492 ◽

2019 ◽

Author(s):

Trang T. Le ◽

Bryan A. Dawkins ◽

Brett A. McKinney

Keyword(s):

Machine Learning ◽

Feature Selection ◽

Nearest Neighbor ◽

Penalized Regression ◽

Covariate Adjustment ◽

Data Types ◽

Multiple Testing Correction ◽

Feature Selection Technique ◽

Continuous Outcomes ◽

Projected Distance

AbstractMachine learning feature selection methods are needed to detect complex interaction-network effects in complicated modeling scenarios in high-dimensional data, such as GWAS, gene expression, eQTL, and structural/functional neuroimage studies for case-control or continuous outcomes. In addition, many machine learning methods have limited ability to address the issues of controlling false discoveries and adjusting for covariates. To address these challenges, we develop a new feature selection technique called Nearest-neighbor Projected-Distance Regression (NPDR) that calculates the importance of each predictor using generalized linear model (GLM) regression of distances between nearest-neighbor pairs projected onto the predictor dimension. NPDR captures the underlying interaction structure of data using nearest-neighbors in high dimensions, handles both dichotomous and continuous outcomes and predictor data types, statistically corrects for covariates, and permits statistical inference and penalized regression. We use realistic simulations with interactions and other effects to show that NPDR has better precision-recall than standard Relief-based feature selection and random forest importance, with the additional benefit of covariate adjustment and multiple testing correction. Using RNA-Seq data from a study of major depressive disorder (MDD), we show that NPDR with covariate adjustment removes spurious associations due to confounding. We apply NPDR to eQTL data to identify potentially interacting variants that regulate transcripts associated with MDD and demonstrate NPDR’s utility for GWAS and continuous outcomes.

Download Full-text

Nearest-neighbor Projected-Distance Regression (NPDR) for detecting network interactions with adjustments for multiple tests and confounding

Bioinformatics ◽

10.1093/bioinformatics/btaa024 ◽

2020 ◽

Vol 36 (9) ◽

pp. 2770-2777

Author(s):

Trang T Le ◽

Bryan A Dawkins ◽

Brett A McKinney

Keyword(s):

Machine Learning ◽

Feature Selection ◽

Nearest Neighbor ◽

Penalized Regression ◽

Supplementary Information ◽

Covariate Adjustment ◽

Data Types ◽

Multiple Testing Correction ◽

Continuous Outcomes ◽

Projected Distance

Abstract Summary Machine learning feature selection methods are needed to detect complex interaction-network effects in complicated modeling scenarios in high-dimensional data, such as GWAS, gene expression, eQTL and structural/functional neuroimage studies for case–control or continuous outcomes. In addition, many machine learning methods have limited ability to address the issues of controlling false discoveries and adjusting for covariates. To address these challenges, we develop a new feature selection technique called Nearest-neighbor Projected-Distance Regression (NPDR) that calculates the importance of each predictor using generalized linear model regression of distances between nearest-neighbor pairs projected onto the predictor dimension. NPDR captures the underlying interaction structure of data using nearest-neighbors in high dimensions, handles both dichotomous and continuous outcomes and predictor data types, statistically corrects for covariates, and permits statistical inference and penalized regression. We use realistic simulations with interactions and other effects to show that NPDR has better precision-recall than standard Relief-based feature selection and random forest importance, with the additional benefit of covariate adjustment and multiple testing correction. Using RNA-Seq data from a study of major depressive disorder (MDD), we show that NPDR with covariate adjustment removes spurious associations due to confounding. We apply NPDR to eQTL data to identify potentially interacting variants that regulate transcripts associated with MDD and demonstrate NPDR’s utility for GWAS and continuous outcomes. Availability and implementation Available at: https://insilico.github.io/npdr/. Supplementary information Supplementary data are available at Bioinformatics online.

Download Full-text

Observation of Superlattice Dislocations on Cube Planes in Ni3Al

Proceedings, annual meeting, Electron Microscopy Society of America ◽

10.1017/s0424820100071314 ◽

1973 ◽

Vol 31 ◽

pp. 174-175 ◽

Cited By ~ 1

Author(s):

J. M. Oblak ◽

W. H. Rand

Keyword(s):

Nearest Neighbor ◽

Antiphase Boundary ◽

Nearest Neighbors ◽

High Energy ◽

Cross Slip ◽

Octahedral Plane ◽

Trapping Mechanism ◽

Slip Traces

The energy of an a/2 <110> shear antiphase. boundary in the Ll2 expected to be at a minimum on {100} cube planes because here strue ture is there is no violation of nearest-neighbor order. The latter however does involve the disruption of second nearest neighbors. It has been suggested that cross slip of paired a/2 <110> dislocations from octahedral onto cube planes is an important dislocation trapping mechanism in Ni3Al; furthermore, slip traces consistent with cube slip are observed above 920°K.Due to the high energy of the {111} antiphase boundary (> 200 mJ/m2), paired a/2 <110> dislocations are tightly constricted on the octahedral plane and cannot be individually resolved.

Download Full-text

Application of Lattice Imaging Techniques to Amorphous Films

Proceedings, annual meeting, Electron Microscopy Society of America ◽

10.1017/s0424820100113664 ◽

1975 ◽

Vol 33 ◽

pp. 8-9

Author(s):

S. R. Herd ◽

P. Chaudhari

Keyword(s):

Nearest Neighbor ◽

Random Network ◽

Imaging Techniques ◽

Network Models ◽

Accurate Method ◽

Direct Transmission ◽

Lattice Imaging ◽

Transmission Electron ◽

Lattice Planes ◽

Nearest Neighbor Distances

Electron diffraction and direct transmission have been used extensively to study the local atomic arrangement in amorphous solids and in particular Ge. Nearest neighbor distances had been calculated from E.D. profiles and the results have been interpreted in terms of the microcrystalline or the random network models. Direct transmission electron microscopy appears the most direct and accurate method to resolve this issue since the spacial resolution of the better instruments are of the order of 3Å. In particular the tilted beam interference method is used regularly to show fringes corresponding to 1.5 to 3Å lattice planes in crystals as resolution tests.

Download Full-text

Review for "Early failure detection of paper manufacturing machinery using nearest neighbor‐based feature extraction"

10.1002/eng2.12291/v1/review1 ◽

2020 ◽

Keyword(s):

Feature Extraction ◽

Nearest Neighbor ◽

Failure Detection ◽

Early Failure ◽

Paper Manufacturing

Download Full-text

Decision letter for "Early failure detection of paper manufacturing machinery using nearest neighbor‐based feature extraction"

10.1002/eng2.12291/v1/decision1 ◽

2020 ◽

Keyword(s):

Feature Extraction ◽

Nearest Neighbor ◽

Failure Detection ◽

Early Failure ◽

Paper Manufacturing

Download Full-text

Author response for "Early failure detection of paper manufacturing machinery using nearest neighbor‐based feature extraction"

10.1002/eng2.12291/v3/response1 ◽

2020 ◽

Author(s):

Wonjae Lee ◽

Kangwon Seo

Keyword(s):

Feature Extraction ◽

Nearest Neighbor ◽

Failure Detection ◽

Author Response ◽

Early Failure ◽

Paper Manufacturing

Download Full-text

MEAN SQUARE ERROR FOR UNIFORM–KERNEL ESTIMATE AND NEAREST NEIGHBOR ESTIMATE OF NONPARAMETRIC REGRESSION FUNCTIONS

Acta Mathematica Scientia ◽

10.1016/s0252-9602(18)30700-8 ◽

1985 ◽

Vol 5 (2) ◽

pp. 175-185

Author(s):

Dongchu Sun

Keyword(s):

Mean Square Error ◽

Nonparametric Regression ◽

Nearest Neighbor ◽

Mean Square ◽

Kernel Estimate ◽

Regression Functions

Download Full-text

Simulation of Ostwald Ripening in Two Dimensions: Spatial and Nearest Neighbor Correlations

Journal de Physique I ◽

10.1051/jp1:1995188 ◽

1995 ◽

Vol 5 (9) ◽

pp. 1143-1159 ◽

Cited By ~ 6

Author(s):

Norbert Masbaum

Keyword(s):

Nearest Neighbor ◽

Ostwald Ripening ◽

Two Dimensions

Download Full-text

Machine Learning Verdict of EEG Signals in Brain Computer Interface

International Journal of Scientific Research in Computer Science Engineering and Information Technology ◽

10.32628/cseit1838114 ◽

2018 ◽

pp. 429-441

Author(s):

M. Jeyanthi ◽

C. Velayutham

Keyword(s):

Nearest Neighbor ◽

Technology Development ◽

Vital Role ◽

Svm Classifier ◽

K Nearest Neighbor ◽

Data Mining Technique ◽

Data Set ◽

Eeg Data ◽

Irrelevant Attributes

In Science and Technology Development BCI plays a vital role in the field of Research. Classification is a data mining technique used to predict group membership for data instances. Analyses of BCI data are challenging because feature extraction and classification of these data are more difficult as compared with those applied to raw data. In this paper, We extracted features using statistical Haralick features from the raw EEG data . Then the features are Normalized, Binning is used to improve the accuracy of the predictive models by reducing noise and eliminate some irrelevant attributes and then the classification is performed using different classification techniques such as Naïve Bayes, k-nearest neighbor classifier, SVM classifier using BCI dataset. Finally we propose the SVM classification algorithm for the BCI data set.

Download Full-text