Wooldridge Source: These are data from the 1976 Current Population Survey, collected by Henry Farber when he and I were colleagues at MIT in 1988. Data loads lazily.

data('wage1')

Format

A data.frame with 526 observations on 24 variables:

  • wage: average hourly earnings

  • educ: years of education

  • exper: years potential experience

  • tenure: years with current employer

  • nonwhite: =1 if nonwhite

  • female: =1 if female

  • married: =1 if married

  • numdep: number of dependents

  • smsa: =1 if live in SMSA

  • northcen: =1 if live in north central U.S

  • south: =1 if live in southern region

  • west: =1 if live in western region

  • construc: =1 if work in construc. indus.

  • ndurman: =1 if in nondur. manuf. indus.

  • trcommpu: =1 if in trans, commun, pub ut

  • trade: =1 if in wholesale or retail

  • services: =1 if in services indus.

  • profserv: =1 if in prof. serv. indus.

  • profocc: =1 if in profess. occupation

  • clerocc: =1 if in clerical occupation

  • servocc: =1 if in service occupation

  • lwage: log(wage)

  • expersq: exper^2

  • tenursq: tenure^2

Notes

Barry Murphy, of the University of Portsmouth in the UK, has pointed out that for several observations the values for exper and tenure are in logical conflict. In particular, for some workers the number of years with current employer (tenure) is greater than overall work experience (exper). At least some of these conflicts are due to the definition of exper as “potential” work experience, but probably not all. Nevertheless, I am using the data set as it was supplied to me.

Used in Text: pages 7, 17, 33-34, 37, 76, 91, 125, 183, 194-195, 220, 231, 234, 235-236, 240-241, 243-244, 263, 272, 326, 678

Examples

 str(wage1)
#> 'data.frame':	526 obs. of  24 variables:
#>  $ wage    : num  3.1 3.24 3 6 5.3 ...
#>  $ educ    : int  11 12 11 8 12 16 18 12 12 17 ...
#>  $ exper   : int  2 22 2 44 7 9 15 5 26 22 ...
#>  $ tenure  : int  0 2 0 28 2 8 7 3 4 21 ...
#>  $ nonwhite: int  0 0 0 0 0 0 0 0 0 0 ...
#>  $ female  : int  1 1 0 0 0 0 0 1 1 0 ...
#>  $ married : int  0 1 0 1 1 1 0 0 0 1 ...
#>  $ numdep  : int  2 3 2 0 1 0 0 0 2 0 ...
#>  $ smsa    : int  1 1 0 1 0 1 1 1 1 1 ...
#>  $ northcen: int  0 0 0 0 0 0 0 0 0 0 ...
#>  $ south   : int  0 0 0 0 0 0 0 0 0 0 ...
#>  $ west    : int  1 1 1 1 1 1 1 1 1 1 ...
#>  $ construc: int  0 0 0 0 0 0 0 0 0 0 ...
#>  $ ndurman : int  0 0 0 0 0 0 0 0 0 0 ...
#>  $ trcommpu: int  0 0 0 0 0 0 0 0 0 0 ...
#>  $ trade   : int  0 0 1 0 0 0 1 0 1 0 ...
#>  $ services: int  0 1 0 0 0 0 0 0 0 0 ...
#>  $ profserv: int  0 0 0 0 0 1 0 0 0 0 ...
#>  $ profocc : int  0 0 0 0 0 1 1 1 1 1 ...
#>  $ clerocc : int  0 0 0 1 0 0 0 0 0 0 ...
#>  $ servocc : int  0 1 0 0 0 0 0 0 0 0 ...
#>  $ lwage   : num  1.13 1.18 1.1 1.79 1.67 ...
#>  $ expersq : int  4 484 4 1936 49 81 225 25 676 484 ...
#>  $ tenursq : int  0 4 0 784 4 64 49 9 16 441 ...
#>  - attr(*, "time.stamp")= chr "25 Jun 2011 23:03"