Profile cover photo
Profile photo
Tommy Jones
54 followers -
Marine, Statistician, Data Scientist
Marine, Statistician, Data Scientist

54 followers
About
Posts

Post has attachment
textmineR 2.1.0 is up
Over the weekend I released textmineR 2.1.0 to CRAN. The current version contains a couple minor updates and 5 vignettes to get you up and running with text mining. The vignettes cover the philosophy of textmineR, basic corpus statistics, document clusterin...
textmineR 2.1.0 is up
textmineR 2.1.0 is up
biasedestimates.com
Add a comment...

Post has attachment
A few things I'm working on...
I've got a few things in the pipe over the next 6 months or so that I want to get out of my brain and on to paper. Some of them will even end up on this blog! A proper vignette for textmineR It turns out that "here you go, just read the documentation" isn't...
Add a comment...

Post has attachment
textmineR has a logo
I was at the EARL conference in San Francisco a couple months ago and got inspiration from AirBnb. AirBnb has its own R package it uses internally. To gin up interest and encourage employees to use it and contribute to it, they distributed swag. So, in that...
textmineR has a logo
textmineR has a logo
biasedestimates.com
Add a comment...

Post has attachment
Weird Error: fatal error in wrapper code
I suppose I'm publishing this so that I can save the next programmer the effort of tracking down the source of a weird error thrown by mclapply . fatal error in wrapper code What? The cause, according to this , is that (I think) mclapply is using too many t...
Add a comment...

Post has attachment
textmineR
textmineR's number one concern is usability! Thanks for looking out for us @thos_jones #datadc #NLP pic.twitter.com/iIfFBNaALW — Danielle Beaulieu (@andDunny) April 27, 2016 I (quietly) released an R package back in January, textmineR . It's a text mining t...
textmineR
textmineR
biasedestimates.com
Add a comment...

Post has attachment
More on statisticians in data science
The November issue of AMSTAT News has published an opinion piece by yours truly on the identity of statisticians in data science. My piece starts on page 25 of the print version . The online version is here . A quote: I am not convinced that statistics is d...
Add a comment...

Post has attachment
Oops.
I made an accident yesterday. Oops. pic.twitter.com/IDYmbdDwvP — 3e Labs (@3eLabs) May 7, 2015 What happened? When creating a matrix of zeros I accidentally typed matrix() instead of Matrix() . What's the difference? 4.8 terabytes versus less than one GB. I...
Oops.
Oops.
biasedestimates.com
Add a comment...

Post has attachment
Ukrain?
So, I've noticed a trend over the last few month's in the blog's traffic. The vast majority of hits seem to be coming from domains ending in ".ru". Of course, they are bots. (I am heartened to see that when you aggregate URLs to sites, twitter, meetup, and ...
Ukrain?
Ukrain?
biasedestimates.com
Add a comment...

Post has attachment
Saved by plagiarism!
I am writing a paper on goodness-of-fit for topic models. (Specifically, I've derived an R-squared metric for use with topic models.) I came across this definition for goodness-of-fit in our friend, Wikipedia . The goodness of fit of a statistical model des...
Saved by plagiarism!
Saved by plagiarism!
biasedestimates.com
Add a comment...

Post has attachment
Look up
I've added a couple pages to the blog here. The about me page has a quick bio. The publications and presentations page is where I'll be putting up my bragging rights research portfolio.
Look up
Look up
biasedestimates.com
Add a comment...
Wait while more posts are being loaded