Wooldridge Source: T.J. Kane and C.E. Rouse (1995), Labor-Market Returns to Two- and Four-Year Colleges, American Economic Review 85, 600-614. With Professor Rouse’s kind assistance, I obtained the data from her web site at Princeton University. Data loads lazily.
data('twoyear')
A data.frame with 6763 observations on 23 variables:
female: =1 if female
phsrank: percent high school rank; 100 = best
BA: =1 if Bachelor's degree
AA: =1 if Associate's degree
black: =1 if African-American
hispanic: =1 if Hispanic
id: ID Number
exper: total (actual) work experience
jc: total 2-year credits
univ: total 4-year credits
lwage: log hourly wage
stotal: total standardized test score
smcity: =1 if small city, 1972
medcity: =1 if med. city, 1972
submed: =1 if suburb med. city, 1972
lgcity: =1 if large city, 1972
sublg: =1 if suburb large city, 1972
vlgcity: =1 if very large city, 1972
subvlg: =1 if sub. very lge. city, 1972
ne: =1 if northeast
nc: =1 if north central
south: =1 if south
totcoll: jc + univ
https://www.cengage.com/cgi-wadsworth/course_products_wp.pl?fid=M20b&product_isbn_issn=9781111531041
As possible extensions, students can explore whether the returns to two-year or four-year colleges depend on race or gender. This is partly done in Problem 7.9 but where college is aggregated into one number. Also, should experience appear as a quadratic in the wage specification?
Used in Text: pages 140-143, 165, 261, 340
str(twoyear)
#> 'data.frame': 6763 obs. of 23 variables:
#> $ female : int 1 1 1 1 1 0 0 0 0 0 ...
#> $ phsrank : int 65 97 44 34 80 59 81 50 8 56 ...
#> $ BA : int 0 0 0 0 0 0 1 0 0 1 ...
#> $ AA : int 0 0 0 0 0 0 0 0 0 0 ...
#> $ black : int 0 0 0 0 0 0 0 1 0 1 ...
#> $ hispanic: int 0 0 0 1 0 0 0 0 0 0 ...
#> $ id : num 19 93 96 119 132 156 163 188 199 200 ...
#> $ exper : int 161 119 81 39 141 165 127 161 138 64 ...
#> $ jc : num 0 0 0 0.267 0 ...
#> $ univ : num 0 7.03 0 0 0 ...
#> $ lwage : num 1.93 2.8 1.63 2.22 1.64 ...
#> $ stotal : num -0.442 0 -1.357 -0.19 0 ...
#> $ smcity : int 0 1 0 1 0 1 1 0 1 0 ...
#> $ medcity : int 0 0 0 0 0 0 0 0 0 0 ...
#> $ submed : int 0 0 0 0 0 0 0 0 0 0 ...
#> $ lgcity : int 0 0 0 0 0 0 0 1 0 0 ...
#> $ sublg : int 1 0 1 0 0 0 0 0 0 0 ...
#> $ vlgcity : int 0 0 0 0 0 0 0 0 0 0 ...
#> $ subvlg : int 0 0 0 0 0 0 0 0 0 0 ...
#> $ ne : int 1 0 1 0 0 0 0 0 0 0 ...
#> $ nc : int 0 1 0 0 0 0 1 0 0 0 ...
#> $ south : int 0 0 0 0 1 1 0 1 0 1 ...
#> $ totcoll : num 0 7.033 0 0.267 0 ...
#> - attr(*, "time.stamp")= chr "25 Jun 2011 23:03"