Menu Home

Using the data algebra for Statistics and Data Science

I have a new intermediate introduction on the data algebra up here: Using the data algebra for Statistics and Data Science.

The data algebra is a tool for data processing in Python which is implemented on top of any of Pandas, Google BigQuery, PostgreSQL, MySQL, Spark, and SQLite. It allows you to develop data processing pipelines incrementally and then use and re-use them on different data sets in different data stores.

Please check it out.

Categories: data science Programming Tutorials

Tagged as:

John Mount

Leave a Reply

%d bloggers like this: