We have added a worked example to the README of our experimental logistic regression code. The Logistic codebase is designed to support experimentation on variations of logistic regression including: A pure Java implementation (thus directly usable in Java server environments). A simple multinomial implementation (that allows more than two possible […]
A constant problem for computer science (since its inception) is how to manipulate data that is larger than machine memory. We present here some general strategies for working “out of core” or what you should do when you run out of memory. Early computers were most limited by their paltry […]
Our friends at Dataspora have a nice article on the more modern Map Reduce languages. A very good read and clearly a lot of thought went into preparing it.
Living in the age of big data we ask what to do when we have the good fortune to be presented with a huge amount of supervised training data? Most often at large scale we are presented with the un-supervised problems of characterization and information extraction; but some problem domains […]