A big thank you to Dmytro Perepolkin for sharing a “Keep Calm and Use vtreat” poster!
Also, we have translated the Python vtreat steps from our recent “Cross-Methods are a Leak/Variance Trade-Off” article into R vtreat steps here.
This R-port demonstrates the new to R fit/prepare notation!
We want vtreat to be a platform agnostic (works in R, works in Python, works elsewhere) well documented standard methodology.
To this end: Nina and I have re-organized the basic vtreat use documentation as follows:
- Regression:
Rregression example, fit/prepare
interface,
Rregression example, design/prepare/experiment
interface,
Pythonregression
example. - Classification:
Rclassification example, fit/prepare
interface,
Rclassification example, design/prepare/experiment
interface,
Pythonclassification
example. - Unsupervised tasks:
Runsupervised example, fit/prepare
interface,
Runsupervised example, design/prepare/experiment
interface,
Pythonunsupervised
example. - Multinomial classification:
Rmultinomial classification
example, fit/prepare
interface,
Rmultinomial classification example, design/prepare/experiment
interface,
Pythonmultinomial classification
example.
Categories: Administrativia data science Practical Data Science Pragmatic Data Science Pragmatic Machine Learning Statistics Tutorials
jmount
Data Scientist and trainer at Win Vector LLC. One of the authors of Practical Data Science with R.
