Profile

Cover photo
Clint Green
Works at Koverse
198 followers|321,772 views
AboutPostsPhotosVideos+1's

Stream

Clint Green

Shared publicly  - 
 
 
Now the decade of work we put into NoSQL is available to everyone using GCP.  One way it shows that we've been working on this longer than anyone else: 99% read latency is 6ms vs ~300ms for other systems.
10 comments on original post
1
1
Corey Nolet's profile photo
Add a comment...

Clint Green

Shared publicly  - 
 
My other cloud is an actual cloud
 
We're elevating cloud computing like never before
17 comments on original post
1
Add a comment...

Clint Green

Shared publicly  - 
 
It's time for the 2015 St. Baldrick's campaign.

It is the 4th year I've had the pleasure in participating in this event.  Last year, +St. Baldrick's Foundation  raised over $39 Million for childhood cancer research.
1
Add a comment...

Clint Green

Shared publicly  - 
 
 
A New Year means novel challenges. Whether you're a data science expert or just getting started, check out the National Data Science Bowl and use your skills for social good: http://bit.ly/1z7txbU
1
1
Mark Robinson's profile photo
Add a comment...

Clint Green

Shared publicly  - 
 
 
I've been evaluating Hortonworks HDP 2.2 recently.  It is still a little rough around the edges, particularly where documentation is involved.  However, I've been very impressed with the platform compared to Cloudera CDH 5.2.

I /really/ like Apache Ranger + Apache Knox for security.  I think it's a better approach than Cloudera's Sentry approach, particularly when you only need to focus on LDAP integration and don't want to mess with Kerberos.  There is still some work to do there, but it's off to a great start.

I think there needs to be better integration of components with Ambari.  For example, Ranger and Hue have to be installed manually and are not managed with Ambari.  On larger Hadoop clusters that can be a real hassle.  I'm told that Ambari integration for those components are coming in the near future (Q1 or Q2 2015), so I'm looking forward to that.

I've benchmarked Hive queries using both MR and Tez.  There are clear performance advantages to using Tez.  It's not nearly as fast as Impala. However I have queries that won't run on Impala due to memory limits that run perfectly fine with Hive.

If you haven't tried Hortonworks HDP 2.2 yet, I recommend you give it a try.  I think you'll be pleasantly surprised.

#hortonworks #hdp #hadoop  
1
Add a comment...

Clint Green

Shared publicly  - 
 
 
#hortonworks   #ipo  
The filing could end up stiffing more deserving competitors.
1
Add a comment...
In their circles
253 people
Have them in circles
198 people
Lewis John Mcgibbney's profile photo
Kapiton Ermilov's profile photo
Amit Vij's profile photo
Joel Ramirez's profile photo
Jeff Schuster's profile photo
葛庆虎's profile photo
Ed Kohlwey's profile photo
Tjeerd Veeger's profile photo
Abbey Bukhari's profile photo

Clint Green

Shared publicly  - 
 
Make #analytics faster on striped HDDs, SSDs, RAM? Those problems are solved...

Try L1/L2/L3 cache optimization.

#spark #BigData
 
Code generation + Cache-aware computation + memory management = Apache Spark closer to bare metal http://goo.gl/nxiepH  
View original post
1
Add a comment...

Clint Green

Shared publicly  - 
 
You got HDFS in my Datacenter OS!

You got Mesos in my Distributed Filesystem! 
1
2
Mark Robinson's profile photoCorey Nolet's profile photo
Add a comment...

Clint Green

Shared publicly  - 
 
Great new post on how +Koverse helps you take on speed and scale.   #bigdata   #realtime  
Introduction The challenges presented by big data are becoming more widely recognized; there exists more massive, diverse data than ever before and that data is being generated at extreme velocities.  It comes in the form of traditional and internet-scale data, structured and unstructured data, static and streaming data, and so on.  It is increasingly becoming the case that businesses need smart big data processes to facilitate timel...
2
Add a comment...

Clint Green

Shared publicly  - 
 
Great presentation on stream processing and Millwheel from +Tyler Akidau

Learn about approaches to batching, and the costs of unnecessary system complexity. 
1
Add a comment...

Clint Green

Shared publicly  - 
 
 
Linus Torvalds: "The only place where parallelism matters is in graphics or on the server side ... Give it up. The whole 'parallel computing is the future' is a bunch of crock" http://goo.gl/yBCVLr
Jouni Osmala (josmala.delete@this.cc.hut.fi) on December 8, 2014 1:10 pm wrote: > > I'm assuming that 90+% of programs already run fast enough and they don't matter for this. > Its all about asking question in what use current computers are too slow , and can you parallerize ...
1
Add a comment...

Clint Green

Shared publicly  - 
 
These photos have so much space, and so few people, it has a very Player Piano feel to it.
 
Facebook shares photos of its massive new data center, the first in its fleet to feature an entirely new network architecture. #Facebook   #DataCenter  
1
Chad Richardson's profile photo
 
There is one person that works there and he changes his sweatshirt to give the appearance of two.  Interesting fact this also doubles the population of Altoona, IA.
Add a comment...
People
In their circles
253 people
Have them in circles
198 people
Lewis John Mcgibbney's profile photo
Kapiton Ermilov's profile photo
Amit Vij's profile photo
Joel Ramirez's profile photo
Jeff Schuster's profile photo
葛庆虎's profile photo
Ed Kohlwey's profile photo
Tjeerd Veeger's profile photo
Abbey Bukhari's profile photo
Collections Clint is following
Links
Other profiles
Story
Introduction
I am a DC-area Cloud Engineer, specializing in solutions built around Big Data, Hadoop and NoSQL.
Work
Occupation
Koverse, Inc: Big Data, Massive Compute, Actionable Intelligence
Employment
  • Koverse
    present
    www.koverse.com
Clint Green's +1's are the things they like, agree with, or want to recommend.
Google +1 Button
chrome.google.com

+1 and share a web page, anywhere you go on the web.

Namenode HA Reaches a Major Milestone | Hortonworks
hortonworks.com

We reached a significant milestone in HDFS: the Namenode HA branch was merged into the trunk. With this merge, HDFS trunk now supports HOT

Movember United States
www.movember.com

Movember, the month formerly known as November, is a moustache growing charity event held during November each year that raises funds and aw

Apache Launches Hadoop 1.0 - Linux and Open Source - News & Reviews
www.eweek.com

The Apache Software Foundation delivers Hadoop 1.0, the much-anticipated 1.0 version of the popular open-source platform for storing and pro

Kinect sensor positioning? - Kinect
www.seasonedgamers.com

Kinect sensor positioning? - posted in Kinect: Let me start by saying that my Kinect is still in the box. But I've been trying to think of w

cloud computing: Data Scientists Should Be Design Thinkers
cloudcomputing.blogspot.com

But as DJ Patil said in “Building Data Science Teams,” the best data scientists are not statisticians; they come from a wide range of scient

Strata Online Conference: Data Warfare
oreil.ly

Duration: Approximately 120 minutes. Cost: Free From public policy to elections, from healthcare to the battlefield, our lives rely on the a

MapReduce Design Patterns
shop.oreilly.com

This handy guide brings together a unique collection of valuable MapReduce patterns that will save you time and effort regardless of the dom

Google introduces Compute Engine, Google-scale Linux virtualization
arstechnica.com

New infrastructure-as-a-service joins the App Engine PaaS offering.

SWAT team throws flashbangs, raids wrong home due to open WiFi network
arstechnica.com

Whoops! Those anonymous Internet threats came from up the block.

Native JSON features submitted for PostgreSQL 9.2
arstechnica.com

Patches have been submitted for PostgreSQL 9.2 that introduce a native JSON type and support for returning JSON-formatted query results.

Why Real-Time Analytics? [Free White Paper] | Infochimps Blog
blog.infochimps.com

Updated daily, Monday - Friday. Chock full of big data insights, news and tips straight from the Data Mine.

Paper.li – Be a publisher
paper.li

Turn Twitter, Facebook and RSS feeds into online newspapers in just a few clicks.

100 Years of ships
www.youtube.com

Visualization of 18th and 19th-century shipping routes. More info at http://sappingattention.blogspot.com/2012/04/visualizing-ocean-shipping