Menu Home

Penguins Data for Python

Allison Horst, Alison Hill, and Kristen Gorman are working to make a neat new example data set available to R users: the palmer penguins. It is a nice alternative to the over-used Iris data set as it has more rows, some missing values, nicer examples of Simpson’s Paradox, and more than one string-valued or categorical variable. It should be a great teaching tool, and thank you to them for the effort to make it easily available to R users.

As favor to our additional Python data scientist friends we have exported this data and worked an example in Python here. So enjoy the new examples, and thank you to Allison Horst, Alison Hill, and Kristen Gorman for sharing their work.

Categories: Administrativia data science

Tagged as:

jmount

Data Scientist and trainer at Win Vector LLC. One of the authors of Practical Data Science with R.

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out /  Change )

Google photo

You are commenting using your Google account. Log Out /  Change )

Twitter picture

You are commenting using your Twitter account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )

Connecting to %s

%d bloggers like this: