where sample selection come from
- self-selection
observed working women have higher market wage than home wage. Reasoning: working status (career) increase the earnings Bias: more skilled woment choose to work (skills increase the earnings )
observed immigrants earn more than non-immigrants Reasoning: immigration increase earning Bias: more skilled workers choose (be able) to work as immigrants
- collection-selection
observed spouse income on personal health (panel data) Reasoning: household income influence well-beings Bias: only stable maritual status sample (relation stability influence well-beings)
Overall, the sample selection bias emerges as factors “determining the probability of entrance into the sample” (Hechman, 1979) confound the estimates of interest in a regression model.
sympton of sample selection bias
downward estimation of population variance \(\sigma\)
variables that do not belong in true structural equation appear to be statistically significant when fitting
special cases
multivariate extensions of the preceding analysis might be of substantive interest
Code in R
library(ivreg)
## Warning: package 'ivreg' was built under R version 4.1.3