Menu Home

Penguins Data for Python

Allison Horst, Alison Hill, and Kristen Gorman are working to make a neat new example data set available to R users: the palmer penguins. It is a nice alternative to the over-used Iris data set as it has more rows, some missing values, nicer examples of Simpson’s Paradox, and more than one string-valued or categorical variable. It should be a great teaching tool, and thank you to them for the effort to make it easily available to R users.

As favor to our additional Python data scientist friends we have exported this data and worked an example in Python here. So enjoy the new examples, and thank you to Allison Horst, Alison Hill, and Kristen Gorman for sharing their work.

Categories: Administrativia data science

Tagged as:


Data Scientist and trainer at Win Vector LLC. One of the authors of Practical Data Science with R.

%d bloggers like this: