Wooldridge Source: These are data from the 1976 Current Population Survey, collected by Henry Farber when he and I were colleagues at MIT in 1988. Data loads lazily.
data('wage1')
A data.frame with 526 observations on 24 variables:
wage: average hourly earnings
educ: years of education
exper: years potential experience
tenure: years with current employer
nonwhite: =1 if nonwhite
female: =1 if female
married: =1 if married
numdep: number of dependents
smsa: =1 if live in SMSA
northcen: =1 if live in north central U.S
south: =1 if live in southern region
west: =1 if live in western region
construc: =1 if work in construc. indus.
ndurman: =1 if in nondur. manuf. indus.
trcommpu: =1 if in trans, commun, pub ut
trade: =1 if in wholesale or retail
services: =1 if in services indus.
profserv: =1 if in prof. serv. indus.
profocc: =1 if in profess. occupation
clerocc: =1 if in clerical occupation
servocc: =1 if in service occupation
lwage: log(wage)
expersq: exper^2
tenursq: tenure^2
https://www.cengage.com/cgi-wadsworth/course_products_wp.pl?fid=M20b&product_isbn_issn=9781111531041
Barry Murphy, of the University of Portsmouth in the UK, has pointed out that for several observations the values for exper and tenure are in logical conflict. In particular, for some workers the number of years with current employer (tenure) is greater than overall work experience (exper). At least some of these conflicts are due to the definition of exper as “potential” work experience, but probably not all. Nevertheless, I am using the data set as it was supplied to me.
Used in Text: pages 7, 17, 33-34, 37, 76, 91, 125, 183, 194-195, 220, 231, 234, 235-236, 240-241, 243-244, 263, 272, 326, 678
str(wage1)
#> 'data.frame': 526 obs. of 24 variables:
#> $ wage : num 3.1 3.24 3 6 5.3 ...
#> $ educ : int 11 12 11 8 12 16 18 12 12 17 ...
#> $ exper : int 2 22 2 44 7 9 15 5 26 22 ...
#> $ tenure : int 0 2 0 28 2 8 7 3 4 21 ...
#> $ nonwhite: int 0 0 0 0 0 0 0 0 0 0 ...
#> $ female : int 1 1 0 0 0 0 0 1 1 0 ...
#> $ married : int 0 1 0 1 1 1 0 0 0 1 ...
#> $ numdep : int 2 3 2 0 1 0 0 0 2 0 ...
#> $ smsa : int 1 1 0 1 0 1 1 1 1 1 ...
#> $ northcen: int 0 0 0 0 0 0 0 0 0 0 ...
#> $ south : int 0 0 0 0 0 0 0 0 0 0 ...
#> $ west : int 1 1 1 1 1 1 1 1 1 1 ...
#> $ construc: int 0 0 0 0 0 0 0 0 0 0 ...
#> $ ndurman : int 0 0 0 0 0 0 0 0 0 0 ...
#> $ trcommpu: int 0 0 0 0 0 0 0 0 0 0 ...
#> $ trade : int 0 0 1 0 0 0 1 0 1 0 ...
#> $ services: int 0 1 0 0 0 0 0 0 0 0 ...
#> $ profserv: int 0 0 0 0 0 1 0 0 0 0 ...
#> $ profocc : int 0 0 0 0 0 1 1 1 1 1 ...
#> $ clerocc : int 0 0 0 1 0 0 0 0 0 0 ...
#> $ servocc : int 0 1 0 0 0 0 0 0 0 0 ...
#> $ lwage : num 1.13 1.18 1.1 1.79 1.67 ...
#> $ expersq : int 4 484 4 1936 49 81 225 25 676 484 ...
#> $ tenursq : int 0 4 0 784 4 64 49 9 16 441 ...
#> - attr(*, "time.stamp")= chr "25 Jun 2011 23:03"