Dataset Modelability by QSAR: Continuous Response Variable

2021 ◽  
pp. 233-253
Author(s):  
Alexander Golbraikh ◽  
Rong Wang ◽  
Vinicius M. Alves ◽  
Inta Liepina ◽  
Eugene Muratov ◽  
...  
Author(s):  
Donald Quicke ◽  
Buntika A. Butcher ◽  
Rachel Kruft Welton

Abstract This chapter employs generalized linear modelling using the function glm when we know that variances are not constant with one or more explanatory variables and/or we know that the errors cannot be normally distributed, for example, they may be binary data, or count data where negative values are impossible, or proportions which are constrained between 0 and 1. A glm seeks to determine how much of the variation in the response variable can be explained by each explanatory variable, and whether such relationships are statistically significant. The data for generalized linear models take the form of a continuous response variable and a combination of continuous and discrete explanatory variables.


Author(s):  
Donald L. J. Quicke ◽  
Buntika A. Butcher ◽  
Rachel A. Kruft Welton

Abstract This chapter deals with analysis of covariance or ANCOVA, a combination of ANOVA and regression. It tests the effects of a mix of continuous and categorical variables on a continuous response variable. Two examples are presented. Example 1 is based on a study investigating the effects of two types of tagging (acrylic paint and subcutaneous microtags) on the growth of the coral reef goby, Coryphopterus glaucofraenum, in the British Virgin Islands and included initial size as a continuous explanatory variable. Example 2 analyses data from a study on the number of pollinaria removed by pollinators from inflorescences of two Sirindhornia orchid species (S. monophylla and S. mirabillis) in relation to the number of flowers in the inflorescence (also count data) and the orchid species (categorical).


Author(s):  
Donald L. J. Quicke ◽  
Buntika A. Butcher ◽  
Rachel A. Kruft Welton

Abstract This chapter deals with analysis of covariance or ANCOVA, a combination of ANOVA and regression. It tests the effects of a mix of continuous and categorical variables on a continuous response variable. Two examples are presented. Example 1 is based on a study investigating the effects of two types of tagging (acrylic paint and subcutaneous microtags) on the growth of the coral reef goby, Coryphopterus glaucofraenum, in the British Virgin Islands and included initial size as a continuous explanatory variable. Example 2 analyses data from a study on the number of pollinaria removed by pollinators from inflorescences of two Sirindhornia orchid species (S. monophylla and S. mirabillis) in relation to the number of flowers in the inflorescence (also count data) and the orchid species (categorical).


Author(s):  
J. Richard Stewart

In recent years a number of nonparametric regression-type statistical procedures have been developed. Classification and regression trees (CART) is one such method that can be used as a classifier for a discrete-valued response variable or as a regression model for a continuous response variable. Advantages of CART over many other methods are its ability to include a relatively large number of independent variables and to identify complex interactions among these variables. A brief description of the CART procedure is given, and its application as a classifier and as a regression model to highway safety analyses is illustrated.


Sign in / Sign up

Export Citation Format

Share Document