Wooldridge Source: Dr. Zhehui Luo, a recent MSU Ph.D. in economics and Visiting Research Associate in the Department of Epidemiology at MSU, kindly provided these data. She obtained them from state files linking birth and infant death certificates, and from the National Center for Health Statistics natality and mortality data. Data loads lazily.

data('bwght2')

Format

A data.frame with 1832 observations on 23 variables:

  • mage: mother's age, years

  • meduc: mother's educ, years

  • monpre: month prenatal care began

  • npvis: total number of prenatal visits

  • fage: father's age, years

  • feduc: father's educ, years

  • bwght: birth weight, grams

  • omaps: one minute apgar score

  • fmaps: five minute apgar score

  • cigs: avg cigarettes per day

  • drink: avg drinks per week

  • lbw: =1 if bwght <= 2000

  • vlbw: =1 if bwght <= 1500

  • male: =1 if baby male

  • mwhte: =1 if mother white

  • mblck: =1 if mother black

  • moth: =1 if mother is other

  • fwhte: =1 if father white

  • fblck: =1 if father black

  • foth: =1 if father is other

  • lbwght: log(bwght)

  • magesq: mage^2

  • npvissq: npvis^2

Source

https://www.cengage.com/cgi-wadsworth/course_products_wp.pl?fid=M20b&product_isbn_issn=9781111531041

Notes

There are many possibilities with this data set. In addition to number of prenatal visits, smoking and alcohol consumption (during pregnancy) are included as explanatory variables. These can be added to equations of the kind found in Exercise C6.10. In addition, the one- and five-minute APGAR scores are included. These are measures of the well being of infants just after birth. An interesting feature of the score is that it is bounded between zero and 10, making a linear model less than ideal. Still, a linear model would be informative, and you might ask students about predicted values less than zero or greater than 10.

Used in Text: pages 184, 223

Examples

str(bwght2)
#> 'data.frame': 1832 obs. of 23 variables: #> $ mage : int 26 29 33 28 23 28 27 41 32 16 ... #> $ meduc : int 12 12 12 17 13 12 16 17 12 11 ... #> $ monpre : int 2 2 1 5 2 1 3 6 3 2 ... #> $ npvis : int 12 12 12 8 6 12 11 8 11 10 ... #> $ fage : int 34 32 36 32 24 30 28 56 36 21 ... #> $ feduc : int 16 12 16 17 16 16 16 NA 16 10 ... #> $ bwght : int 3060 3730 2530 3289 3590 3420 3355 3459 3590 4410 ... #> $ omaps : int 9 8 8 8 6 9 9 8 9 6 ... #> $ fmaps : int 9 9 9 9 8 9 9 9 9 9 ... #> $ cigs : int 0 NA 0 0 0 0 0 0 0 0 ... #> $ drink : int 0 NA 0 0 0 0 0 0 0 0 ... #> $ lbw : int 0 0 0 0 0 0 0 0 0 0 ... #> $ vlbw : int 0 0 0 0 0 0 0 0 0 0 ... #> $ male : int 1 0 0 1 1 0 1 0 0 1 ... #> $ mwhte : int 0 1 1 1 1 1 1 0 1 1 ... #> $ mblck : int 0 0 0 0 0 0 0 1 0 0 ... #> $ moth : int 1 0 0 0 0 0 0 0 0 0 ... #> $ fwhte : int 0 1 1 1 1 1 1 0 1 1 ... #> $ fblck : int 0 0 0 0 0 0 0 0 0 0 ... #> $ foth : int 1 0 0 0 0 0 0 1 0 0 ... #> $ lbwght : num 8.03 8.22 7.84 8.1 8.19 ... #> $ magesq : int 676 841 1089 784 529 784 729 1681 1024 256 ... #> $ npvissq: int 144 144 144 64 36 144 121 64 121 100 ... #> - attr(*, "time.stamp")= chr "25 Jun 2011 23:03"