Galton’s work on children and parents’ height was published in: Galton, F. (1886): “Regression
towards mediocrity in hereditary stature”, Journal of the Anthropological Institute, 15: 246-63. In
this first part of the project you are asked to reconstruct the original data from this original article and replicate his analysis.
it on LEARN. On Table I of his article, the data used is summarized. You need to create a
STATA data set that contains the 928 observations that Galton collected. It is recommended
that you first type the data in an excel file and then have STATA read that file. Some versions of the Galton data set are available online. You are advised NOT to use them. It is part of this project that you show that you understand how to make a data set from such a table. There are important conceptual issues that you will miss if you borrow the data from somewhere else.
(i) For those observations reported in Table I of Galton’s article as “below” or “above” the minimum and maximum height values, you need to assume some particular values. Please state these explicitly in a table (Table 1.1.a.) and provide a justification with one sentence.
(ii) Given your assumptions, what is the sample mean height and standard deviation for adult children and for parents, respectively? Report this in a table (Table 1.1.b.).
(i) Are children of “tall parents” as tall as their parents? And similarly, are children of “short parents” as short as their parents? Report your results in a table.
(ii) Does the assumption of having 928 parents rather than 205 matter for this exercise?
(i) Regress the height of adult children against the height of parents. Report your results in a table and interpret the estimated coe