Allison Horst, Alison Hill, and Kristen Gorman are working to make a neat new example data set available to
R users: the palmer penguins. It is a nice alternative to the over-used Iris data set as it has more rows, some missing values, nicer examples of Simpson’s Paradox, and more than one string-valued or categorical variable. It should be a great teaching tool, and thank you to them for the effort to make it easily available to
As favor to our additional
Python data scientist friends we have exported this data and worked an example in
Python here. So enjoy the new examples, and thank you to Allison Horst, Alison Hill, and Kristen Gorman for sharing their work.
Data Scientist and trainer at Win Vector LLC. One of the authors of Practical Data Science with R.