What does the adjusted R-squared value tell you in multiple linear regression?
The proportion of variance in the outcome explained by the predictors, adjusted for the number of predictors in the model.
The statistical significance of the overall model.
The accuracy of the model's predictions.
The presence of outliers in the data.
When using Principal Component Analysis (PCA) as a remedy for multicollinearity, what is the primary aim?
To increase the sample size of the dataset
To create new, uncorrelated variables from the original correlated ones
To introduce non-linearity into the model
To remove all independent variables from the model
Which of the following is NOT a valid approach to address multicollinearity?
Increasing the sample size of the dataset
Centering or scaling the independent variables
Removing one or more of the highly correlated independent variables
Transforming the independent variables (e.g., using principal component analysis)
Elastic Net Regression combines the penalties of which two regularization techniques?
Linear Regression and Ridge Regression
Lasso Regression and Polynomial Regression
Lasso Regression and Ridge Regression
Ridge Regression and Polynomial Regression
How does Lasso Regression differ from Ridge Regression in terms of feature selection?
Both Lasso and Ridge Regression can shrink coefficients to zero, but Lasso does it more aggressively.
Ridge Regression tends to shrink all coefficients towards zero but rarely sets them exactly to zero.
Neither Lasso nor Ridge Regression performs feature selection; they only shrink coefficients.
Lasso Regression can shrink coefficients to exactly zero, effectively performing feature selection.
What is a key limitation of relying solely on Adjusted R-squared for model evaluation in linear regression?
It is difficult to interpret.
It doesn't provide information about the magnitude of prediction errors.
It is highly sensitive to outliers.
It can be misleading when comparing models with different numbers of predictors.
A model has a high R-squared but a low Adjusted R-squared. What is a likely explanation?
The model is too simple.
The model is overfitting.
The model is a perfect fit.
The model has high bias.
In which scenario might you prefer Huber regression over RANSAC for robust regression?
When it's important to completely discard the outliers from the analysis
When dealing with high-dimensional data with a large number of features
When the outliers are expected to be clustered together
When the proportion of outliers is relatively small
If a predictor has a p-value of 0.02 in a multiple linear regression model, what can you conclude?
The predictor is not statistically significant.
The predictor explains 2% of the variance in the outcome.
The predictor is statistically significant at the 0.05 level.
The predictor has a practically significant effect on the outcome.
What distinguishes a random slope model from a random intercept model in HLM?
Random slope models allow intercepts to vary, while random intercept models don't.
Random slope models handle categorical variables, while random intercept models handle continuous variables.
Random slope models allow slopes to vary, while random intercept models don't.
Random slope models are used for smaller datasets, while random intercept models are used for larger datasets.