jtrain98 — jtrain98 • wooldridge

Wooldridge Source: This is a data set I created many years ago intended as an update to the files JTRAIN2 and JTRAIN3. While the data were partly generated by me, the data attributes are similar to data sets used to evaluate job training programs. Data loads lazily.

data('jtrain98')

Format

A data.frame with 1130 observations on 10 variables:

train: =1 if in job training
age: in years
educ: years of schooling
black: =1 if black
hisp: =1 if Hispanic
married: =1 if married
earn96: earnings in 1996, $1000s
unem96: =1 if unemployed all of 1995
earn98: earnings in 1998, $1000s
unem98: =1 if unemployed all of 1998

Source

http://www.cengage.com/c/introductory-econometrics-a-modern-approach-7e-wooldridge

Notes

The response variables, earn98 and unem98, both have discreteness: the former is a corner solutions (takes on the value zero and then a range of strictly positive values) and the latter is binary. One could use these in an exercise using methods in Chapter 17. unem98 can be used in a probit or logit model, earn98 in a Tobit model, or in Poisson regression (without assuming, of course, that the Poisson distribution is correct).

Used in Text: 101-102, 248, 601

Examples

 str(jtrain98)
#> 'data.frame':	1130 obs. of  10 variables:
#>  $ train  : int  0 0 1 1 0 1 0 1 0 1 ...
#>  $ age    : int  37 30 27 33 22 23 32 22 19 21 ...
#>  $ educ   : int  11 12 11 8 9 12 11 16 9 13 ...
#>  $ black  : int  1 1 1 1 1 1 1 1 1 1 ...
#>  $ hisp   : int  0 0 0 0 0 0 0 0 0 0 ...
#>  $ married: int  1 0 0 0 0 0 0 0 0 0 ...
#>  $ earn96 : num  0 0 0 0 0 0 0 0 0 0 ...
#>  $ unem96 : int  1 1 1 1 1 1 1 1 1 1 ...
#>  $ earn98 : num  1.62 0 0 9.07 4.02 ...
#>  $ unem98 : int  0 1 1 0 0 0 0 0 1 0 ...
#>  - attr(*, "time.stamp")= chr "21 Dec 2018 17:08"
#>  - attr(*, "label.table")= list()
#>  - attr(*, "expansion.fields")= list()
#>  - attr(*, "byteorder")= chr "LSF"
#>  - attr(*, "orig.dim")= int [1:2] 1130 10