Part 1: Analysis with Galton’s original data set
Galton’s work on children and parents’ height was published in: Galton, F. (1886): “Regression towards mediocrity in hereditary stature”, Journal of the Anthropological Institute, 15: 246-63. In
this first part of the project you are asked to reconstruct the original data from this original article and replicate his analysis.
(i) For those observations reported in Table I of Galton’s article as “below” or “above” the minimum and maximum height values, you need to assume some particular values. State these explicitly in a table (Table 1.1.a.) and provide a justification with one sentence.
(ii) Given your assumptions, what is the sample mean height and standard deviation for adult children and for parents, respectively? Report this in a table (Table 1.1.b.).
(i) Are children of “tall parents” as tall as their parents? And similarly, are children of “short parents” as short as their parents? Report your results in a table.
(ii) Does the assumption of having 928 parents rather than 205 matter for this exercise?
(i) Regress the height of adult children against the height of parents. Report your results in
a table and interpret the estimated coecients.
(ii) What can you say about the relationship between the height of parents and their children?
How does it relate to the findings in question 1.2.? You can answer these questions with a short paragraph and a graph.
(i) Calculate the predicted adult children’s height whose parents are “tall” after 1, 2, 3, …, Z generations? And similarly, what is your prediction for adult children’s height whose parents are “short” after 1, 2, 3, …, Z generations? Report your results in a table. Is there convergence in heights? If so, how many generations does it take?
(ii) How do you interpret the results? Did Galton do something wrong in his regression? You can answer this question with a short paragraph.