In linear regression analysis, it's an important assumption that there should be a linear relationship between independent variable and dependent variable. Whereas, logistic regression assumes there should be a linear relationship between independent variable and logit function.
How to check non-linearity
Pearson correlation is a measure of linear relationship. The variables must be measured at interval scales. It is sensitive to outliers. If pearson correlation coefficient of a variable is close to 0, it means there is no linear relationship between variables.
How to check non-linearity
Pearson correlation is a measure of linear relationship. The variables must be measured at interval scales. It is sensitive to outliers. If pearson correlation coefficient of a variable is close to 0, it means there is no linear relationship between variables.
Spearman's correlation is a measure of monotonic relationship. It can be used for ordinal variables. It is less sensitive to outliers. If spearman correlation coefficient of a variable is close to 0, it means there is no monotonic relationship between variables.
Hoeffding’s D correlation is a measure of linear, monotonic and non-monotonic relationship. It has values between –0.5 to 1. The signs of Hoeffding coefficient has no interpretation.
If a variable has a very low rank for Spearman (coefficient - close to 0) and a very high rank for Hoeffding indicates a non-monotonic relationship.
If a variable has a very low rank for Pearson (coefficient - close to 0) and a very high rank for Hoeffding indicates a non-linear relationship.
Criterion to eliminate irrelevant variables
If a variable has poor rank on both the spearman and hoeffding correlation metrics, it means the relationship between the variables is random.
SAS Macro to detect non-monotonic relationship
Thanks
ReplyDeleteSheet which I have downloaded is password protected. Kindly provide password
ReplyDeleteHello, thank you so much for the article. Is there anyway that you can provide me with the password to the workbook?
ReplyDeleteI'm doing some statistical analysis with asphalt pavement data and would like to use Hoeffding’s D correlation. If you can, please email me.
Please, password for the MS Excel sheet.
ReplyDeletepassword for the excel?
ReplyDelete