Profile cover photo
Profile photo
Alex Holmes
101 followers
101 followers
About
Alex's posts

Post has attachment

Post has attachment
First blog post in a series where I'll look at how to bucket, multiplex and combine data in Hadoop MapReduce http://grepalex.com/2013/05/20/multipleoutputs-part1/

Post has attachment

Post has attachment
If you want to avoid the "uncompressed data not written to a terminal" error in lzop, then take a look at http://grepalex.com/2013/02/08/lzop-decompression-useless-cat/

Post has attachment
New blog post: using awk and friends with Hadoop and LZO http://grepalex.com/2013/01/17/awk-with-hadoop-streaming/

Post has attachment
An annotated guide to how the shuffle works in MapReduce http://grepalex.com/2012/09/24/map-partition-sort-spill/

Post has attachment
Wait while more posts are being loaded