Wooldridge Source: These data were used in the doctoral dissertation of Jeffrey Blend, Department of Agricultural Economics, Michigan State University, 1998. The thesis was supervised by Professor Eileen van Ravensway. Drs. Blend and van Ravensway kindly provided the data, which were obtained from a telephone survey conducted by the Institute for Public Policy and Social Research at MSU. Data loads lazily.

data('apple')

Format

A data.frame with 660 observations on 17 variables:

  • id: respondent identifier

  • educ: years schooling

  • date: date: month/day/year

  • state: home state

  • regprc: price of regular apples

  • ecoprc: price of ecolabeled apples

  • inseason: =1 if interviewed in Nov.

  • hhsize: household size

  • male: =1 if male

  • faminc: family income, thousands

  • age: in years

  • reglbs: quantity regular apples, pounds

  • ecolbs: quantity ecolabeled apples, lbs

  • numlt5: # in household younger than 5

  • num5_17: # in household 5 to 17

  • num18_64: # in household 18 to 64

  • numgt64: # in household older than 64

Source

https://www.cengage.com/cgi-wadsworth/course_products_wp.pl?fid=M20b&product_isbn_issn=9781111531041

Notes

This data set is close to a true experimental data set because the price pairs facing a family were randomly determined. In other words, the family head was presented with prices for the eco-labeled and regular apples, and then asked how much of each kind of apple the family would buy at the given prices. As predicted by basic economics, the own price effect is negative (and strong) and the cross price effect is positive (and strong). While the main dependent variable, ecolbs, piles up at zero, estimating a linear model is still worthwhile. Interestingly, because the survey design induces a strong positive correlation between the prices of eco-labeled and regular apples, there is an omitted variable problem if either of the price variables is dropped from the demand equation. A good exam question is to show a simple regression of ecolbs on ecoprc and then a multiple regression on both prices, and ask students to decide whether the price variables must be positively or negatively correlated.

Used in Text: pages 201, 223, 266, 626-627

Examples

str(apple)
#> 'data.frame': 660 obs. of 17 variables: #> $ id : int 10002 10004 10034 10035 10039 10041 10046 10048 10050 10057 ... #> $ educ : int 16 16 18 12 15 12 14 16 17 12 ... #> $ date : chr "111597" "121897" "111097" "111597" ... #> $ state : chr "SD" "KS" "MI" "TN" ... #> $ regprc : num 1.19 0.59 0.59 0.89 0.89 ... #> $ ecoprc : num 1.19 0.79 0.99 1.09 1.09 ... #> $ inseason: int 1 0 1 1 0 1 1 0 1 1 ... #> $ hhsize : int 4 1 3 2 1 4 5 4 1 2 ... #> $ male : int 0 0 0 1 1 0 0 0 1 0 ... #> $ faminc : int 45 65 65 55 25 15 25 45 105 25 ... #> $ age : int 43 37 44 55 22 34 72 38 50 69 ... #> $ reglbs : num 2 0 0 3 0 0 0 0 0 0 ... #> $ ecolbs : num 2 2 2.67 0 3 ... #> $ numlt5 : int 0 0 0 0 0 0 0 0 0 0 ... #> $ num5_17 : int 1 0 2 0 0 2 1 2 0 0 ... #> $ num18_64: int 3 1 1 2 1 2 2 2 1 0 ... #> $ numgt64 : int 0 0 0 0 0 0 2 0 0 2 ... #> - attr(*, "time.stamp")= chr "25 Jun 2011 23:03"