We have had some trouble with some articles being damaged or hard to access in the Win Vector blog. I (John Mount) do want to apologize for that. In particular the graphs are missing for Dr. Nina Zumel’s wonderful y-aware Pricipal Components regression series. The complete R .md and .Rmd […]
Estimated reading time: 2 minutes
Just a quick administrative note. To lower the number of dependencies in our Jupyter to Python converter (text and video tutorial here) I have moved the other data science tools (and their dependencies) out of the wvpy package and into a new package named wvu (“Win Vector University”). This will, […]
Estimated reading time: 1 minute
I am pleased to announce the 0.9.0 release of the data algebra. The data algebra is realization of the Codd relational algebra for data in written in terms of Python method chaining. It allows the concise clear specification of useful data transforms. Some examples can be found here. Benefits include […]
Estimated reading time: 1 minute
Back to teaching. For a few years we’ve been running a data science intensive at for a really neat FAAMG company. The idea is to give engineers some hands on live workbook time using methods varying from linear regression, xgboost, to deep neural networks. Learning how participants progress and internalize […]
Estimated reading time: 1 minute
Our book, Practical Data Science with R, just had its first year anniversary! The book is doing great, if you are working with R and data I recommend you check it out. (link)
Estimated reading time: 22 seconds
I’d like to address how and why I am making the recent light-board video lectures (please check them out: A/B testing and Simpson’s Paradox, and Bayes’s Law and Odds). How How is the easy part. There are a number of tutorials on how to do this. The one I found […]
Estimated reading time: 5 minutes
The core of our “statistics to English translation” series is Nina Zumel’s sequence of articles: “I don’t think that means what you think it means;” Statistics to English Translation, Part 1: Accuracy Measures Statistics to English Translation, Part 2a: ’Significant’ Doesn’t Always Mean ’Important’ Statistics to English Translation, Part 2b: […]
Estimated reading time: 55 seconds