In Gelman and Nolan's paper “You Can Load a Die, But You Can't Bias a Coin” The American Statistician, November 2002, Vol. 56, No. 4 it is argued you can't easily produce a coin that is biased when flipped (and caught). A number of variations that can be easily biased (such as spinning) are also ...

Proud to share Win-Vector LLC's new (pay) statistics course: Campaign Result Testing
I am proud to announce a new Win-Vector LLC statistics video course: Campaign Response Testing 467362 fdea 6. John Mount, Win-Vector LLC. This course works through the very specific statistics problem of trying to estimate the unknown true response rates one or more populations in responding to ...

New technical article: "Using closures as objects in R"  Some R as a programming language writing.

Don’t use the Sharpe ratio to A/B test email campaigns 
Having worked in finance I am a public fan of the Sharpe ratio. I have written about this here and here. One thing I have often forgotten (driving some bad analyses) is: the Sharpe ratio isn't appropriate for models of repeated events that already have linked mean and variance (such as Poisson ...

Win-Vector LLC is proud to announce the R data science value pack. 50% off our video course Introduction to Data Science (available at Udemy) and 30% off Practical Data Science with R (from Manning). Pick any combination of video, e-book, and/or print-book you want. Instructions below.

Nina Zumel and I are proud to announce our new data science video course: Introduction to Data Science.  Here is a half-off coupon for those of you who want to check it out (should be good for about 2 weeks): 
+ann this may be of interest to you

As an #R programmer have you every wondered what can be in a data.frame column?
If you ask an R programmer the commonly depended upon properties of a data.frame columns are: All columns in a data frame have the same length. (true, but with an asterisk); All columns in a data frame are vectors with type (see help(typeof) ) and class (see help(class) ) deriving from one of ...

New technical R article, where I get to use the term "unfulfilled promise leak"
One of the advantages of functional languages (such as R) is the ability to create and return functions “on the fly.” We will discuss one good use of this capability and what to look out for when creating functions in R. Why wrap/return functions? One of my favorite uses of “on the fly ...
Yes, it turns out R is a functional language with only immutable data structures (but mutable environments).  It is essentially a scheme (it has static/lexical closures) executing fexprs (functions like things that take lazy arguments).

It just through some syntactic sugar and environment mutation foo pretends to be imperative or object oriented (and it is not good at object oriented).

Deal of the Day March 15: Half off my book Practical Data Science with R. Use code dotd031515au at

The Win-Vector LLC value pack!

Half off Introduction to Data Science video course:

10% off Practical Data Science with R book

Free in-depth blog content:

And Win-Vector LLC consulting services:

I don't just write about ghosty folklore. I write about folk theorems, too.
It's a folk theorem I sometimes hear from colleagues and clients: that you must balance the class prevalence before training a classifier. Certainly, I believe that classification tends to be easier when the classes are nearly balanced, especially when the class you are actually interested in is ...
I produce applied research, prototyping and training in information extraction, algorithms and data-mining for web-scale businesses, hedge funds and start ups. Right now I do this as a consultant at Win-Vector LLC. 

Please check out our book Practical Data Science with R

Also check out the Win-Vector LLC blog our Twitter feed .
