Wooldridge Source: Dr. Zhehui Luo, a recent MSU Ph.D. in economics and Visiting Research Associate in the Department of Epidemiology at MSU, kindly provided these data. She obtained them from state files linking birth and infant death certificates, and from the National Center for Health Statistics natality and mortality data. Data loads lazily.

data('bwght2')

Format

A data.frame with 1832 observations on 23 variables:

  • mage: mother's age, years

  • meduc: mother's educ, years

  • monpre: month prenatal care began

  • npvis: total number of prenatal visits

  • fage: father's age, years

  • feduc: father's educ, years

  • bwght: birth weight, grams

  • omaps: one minute apgar score

  • fmaps: five minute apgar score

  • cigs: avg cigarettes per day

  • drink: avg drinks per week

  • lbw: =1 if bwght <= 2000

  • vlbw: =1 if bwght <= 1500

  • male: =1 if baby male

  • mwhte: =1 if mother white

  • mblck: =1 if mother black

  • moth: =1 if mother is other

  • fwhte: =1 if father white

  • fblck: =1 if father black

  • foth: =1 if father is other

  • lbwght: log(bwght)

  • magesq: mage^2

  • npvissq: npvis^2

Notes

There are many possibilities with this data set. In addition to number of prenatal visits, smoking and alcohol consumption (during pregnancy) are included as explanatory variables. These can be added to equations of the kind found in Exercise C6.10. In addition, the one- and five-minute APGAR scores are included. These are measures of the well being of infants just after birth. An interesting feature of the score is that it is bounded between zero and 10, making a linear model less than ideal. Still, a linear model would be informative, and you might ask students about predicted values less than zero or greater than 10.

Used in Text: pages 184, 223

Examples

 str(bwght2)
#> 'data.frame':	1832 obs. of  23 variables:
#>  $ mage   : int  26 29 33 28 23 28 27 41 32 16 ...
#>  $ meduc  : int  12 12 12 17 13 12 16 17 12 11 ...
#>  $ monpre : int  2 2 1 5 2 1 3 6 3 2 ...
#>  $ npvis  : int  12 12 12 8 6 12 11 8 11 10 ...
#>  $ fage   : int  34 32 36 32 24 30 28 56 36 21 ...
#>  $ feduc  : int  16 12 16 17 16 16 16 NA 16 10 ...
#>  $ bwght  : int  3060 3730 2530 3289 3590 3420 3355 3459 3590 4410 ...
#>  $ omaps  : int  9 8 8 8 6 9 9 8 9 6 ...
#>  $ fmaps  : int  9 9 9 9 8 9 9 9 9 9 ...
#>  $ cigs   : int  0 NA 0 0 0 0 0 0 0 0 ...
#>  $ drink  : int  0 NA 0 0 0 0 0 0 0 0 ...
#>  $ lbw    : int  0 0 0 0 0 0 0 0 0 0 ...
#>  $ vlbw   : int  0 0 0 0 0 0 0 0 0 0 ...
#>  $ male   : int  1 0 0 1 1 0 1 0 0 1 ...
#>  $ mwhte  : int  0 1 1 1 1 1 1 0 1 1 ...
#>  $ mblck  : int  0 0 0 0 0 0 0 1 0 0 ...
#>  $ moth   : int  1 0 0 0 0 0 0 0 0 0 ...
#>  $ fwhte  : int  0 1 1 1 1 1 1 0 1 1 ...
#>  $ fblck  : int  0 0 0 0 0 0 0 0 0 0 ...
#>  $ foth   : int  1 0 0 0 0 0 0 1 0 0 ...
#>  $ lbwght : num  8.03 8.22 7.84 8.1 8.19 ...
#>  $ magesq : int  676 841 1089 784 529 784 729 1681 1024 256 ...
#>  $ npvissq: int  144 144 144 64 36 144 121 64 121 100 ...
#>  - attr(*, "time.stamp")= chr "25 Jun 2011 23:03"