I’ve been tinkering a lot recently with the data_algebra, and just released version 0.7.0 to PyPi. In this note I’ll touch on what the data algebra is, what the new features are, and my plans going forward.
One often hears that R can not be fast (false), or more correctly that for fast code in R you may have to consider “vectorizing.” A lot of knowledgable R users are not comfortable with the term “vectorize”, and not really familiar with the method. “Vectorize” is just a slightly […]
We try to keep this blog mostly technical and business (as we assume that is what our readers are here for). However, this post is going to be an exception. I’ve just got back from photographing the Rotary Club of San Francisco‘s 2018 Holiday Party. We had a special guest […]
R is designed to make working with statistical models fast, succinct, and reliable. For instance building a model is a one-liner: model <- lm(Petal.Length ~ Sepal.Length, data = iris) And producing a detailed diagnostic summary of the model is also a one-liner: summary(model) # Call: # lm(formula = Petal.Length ~ […]
Did you know R‘s for() loop control structure drops class annotations from vectors?
We’ve just finished off a series of articles on some recent research results applying differential privacy to improve machine learning. Some of these results are pretty technical, so we thought it was worth working through concrete examples. And some of the original results are locked behind academic journal paywalls, so […]