Profile

Cover photo
Aaron Daubman
Attended CMU
Lives in Billerica, MA
253 followers|279,583 views
AboutPostsCollectionsPhotosVideos+1'sReviews

Stream

Aaron Daubman

Shared publicly  - 
 
Distributed Consensus Reloaded: Apache ZooKeeper and Replication in Apache Kafka
Many distributed systems that we build and use currently rely on dependencies like Apache ZooKeeper, Consul, etcd, or even a homebrewed version based on Raft. Systems solving consensus at their core have been often called “consensus services”. The name “consensus service”, however, is possibly a poor choice of a name because none of those services actually exposes a way of solving consensus explicitly. If we are given a lock service, then we expe...
1
Add a comment...

Aaron Daubman

Shared publicly  - 
 
Pulling Back the Curtain on Google’s Network Infrastructure
1
Add a comment...

Aaron Daubman

Shared publicly  - 
 
It's unbounded data streams all the way down...
Asynchronous Distributed Snapshots for Distributed Dataflows
1
Add a comment...

Aaron Daubman

Shared publicly  - 
 
Crunching Parquet Files with Apache Flink
Apache Flink is a fault-tolerant streaming dataflow engine that provides a generic distributed runtime with powerful pro…
1
Add a comment...

Aaron Daubman

Shared publicly  - 
 
Solr 5.3, finally with sorting on multi-valued fields (via min/max)!
Here's an overview of some of the new features that will be in Apache Solr 5.3
1
Add a comment...

Aaron Daubman

Shared publicly  - 
 
Google Cloud Dataflow and Cloud Pub/Sub grow up (have gone GA!)
1
Add a comment...
Have him in circles
253 people
Joe Black's profile photo
Amy Hughes's profile photo
Nguyen Hoang Anh's profile photo
Randy Charland's profile photo
Will Davis's profile photo
Bruce Bivins (Phantom)'s profile photo
Randy May's profile photo
phalange pandora's profile photo
Santiago Paredes's profile photo

Communities

14 communities

Aaron Daubman

Shared publicly  - 
 
More details on Gelly: Graph Processing with Apache Flink
Introducing Gelly: Graph Processing with Apache Flink. 24 Aug 2015. This blog post introduces Gelly, Apache Flink's graph-processing API and library. Flink's native support for iterations makes it a suitable platform for large-scale graph analytics. By leveraging delta iterations, Gelly is able ...
1
Add a comment...

Aaron Daubman

Shared publicly  - 
 
The Dataflow Model: A Practical Approach to Balancing Correctness, Latency, and Cost in Massive-Scale, Unbounded, Out-of-Order Data Processing
The Dataflow Model: A Practical Approach to Balancing Correctness, Latency, and Cost in Massive-Scale, Unbounded, Out-of-Order Data Processing - Akidau et al. (Google) - 2015 With thanks to William...
1
Add a comment...

Aaron Daubman

Shared publicly  - 
 
Reinventing the Wheel: How companies like GE, Adobe, and Deloitte get rid of the performance review with one on ones
1
Add a comment...

Aaron Daubman

Shared publicly  - 
 
The Dataflow Model: A Practical Approach to Balancing Correctness, Latency, and Cost in Massive-Scale, Unbounded, Out-of-Order Data Processing
Venue. Proceedings of the VLDB Endowment, vol. 8 (2015), pp. 1792-1803. Publication Year. 2015. Authors. Tyler Akidau, Robert Bradshaw, Craig Chambers, Slava Chernyak, Rafael J. Fernández-Moctezuma, Reuven Lax, Sam McVeety, Daniel Mills, Frances Perry, Eric Schmidt, Sam Whittle. BibTeX ...
1
Add a comment...

Aaron Daubman

Shared publicly  - 
 
How Google Invented An Amazing Datacenter Network Only They Could Create
Google with justly earned pride recently announced : Today at the 2015 Open Network Sum...
1
Add a comment...

Aaron Daubman

Shared publicly  - 
 
G is for Google: I Alpha-Bet you didn't A-B-See this coming!
1
Add a comment...
Aaron's Collections
People
Have him in circles
253 people
Joe Black's profile photo
Amy Hughes's profile photo
Nguyen Hoang Anh's profile photo
Randy Charland's profile photo
Will Davis's profile photo
Bruce Bivins (Phantom)'s profile photo
Randy May's profile photo
phalange pandora's profile photo
Santiago Paredes's profile photo
Communities
14 communities
Education
  • CMU
    Master of Science: Software Engineering, 2009 - 2010
  • UMass Lowell
    Statistics, 2007 - 2008
  • RPI
    Bachelor of Science: Computer Science, 1999 - 2003
  • SUNY Dutchess
  • Arlington High School
  • Tabernacle Christian Academy
Basic Information
Gender
Male
Looking for
Friends, Networking
Relationship
Married
Other names
daubman, ajd, AaronD
Apps with Google+ Sign-in
  • AlphaBear
  • Monument Valley
  • The Bot Squad
  • Hitman GO
  • Great Little War Game 2
Story
Tagline
Big Data Nerd, Android Enthusiast, Husband, Father, Wannabe rocker
Work
Occupation
Data Architect
Places
Map of the places this user has livedMap of the places this user has livedMap of the places this user has lived
Currently
Billerica, MA
Previously
Poughkeepsie, NY - Troy, NY - East Hartford, CT - Cambridge, MA - Somerville, MA
Aaron Daubman's +1's are the things they like, agree with, or want to recommend.
Distributed Consensus Reloaded: Apache ZooKeeper and Replication in Apac...
www.confluent.io

Many distributed systems that we build and use currently rely on dependencies like Apache ZooKeeper, Consul, etcd, or even a homebrewed vers

Apache Flink: Introducing Gelly: Graph Processing with Apache Flink
flink.apache.org

Introducing Gelly: Graph Processing with Apache Flink. 24 Aug 2015. This blog post introduces Gelly, Apache Flink's graph-processing API and

The Dataflow Model: A Practical Approach to Balancing Correctness, Laten...
blog.acolyer.org

The Dataflow Model: A Practical Approach to Balancing Correctness, Latency, and Cost in Massive-Scale, Unbounded, Out-of-Order Data Processi

Asynchronous Distributed Snapshots for Distributed Dataflows
blog.acolyer.org

Asynchronous Distributed Snapshots for Distributed Dataflows - Carbone et al. 2015 The team behind Apache Flink and data Artisans are a smar

How GE, Adobe, & others get rid of the performance review
getlighthouse.com

Want to get rid of the performance review? Many of the best companies have. Read on to find out what some of the best companies do instead o

Crunching Parquet Files with Apache Flink — Medium
medium.com

Apache Flink is a fault-tolerant streaming dataflow engine that provides a generic distributed runtime with powerful pro…

The Dataflow Model: A Practical Approach to Balancing Correctness, Laten...
research.google.com

Venue. Proceedings of the VLDB Endowment, vol. 8 (2015), pp. 1792-1803. Publication Year. 2015. Authors. Tyler Akidau, Robert Bradshaw, Crai

Solr 5.3 Features
yonik.com

Here's an overview of some of the new features that will be in Apache Solr 5.3

How Google Invented an Amazing Datacenter Network Only They Could Create...
highscalability.com

Google with justly earned pride recently announced : Today at the 2015 Open Network Sum...

Composing Music With Recurrent Neural Networks
www.hexahedria.com

It's hard not to be blown away by the surprising power of neural networks these days. With enough training, so called "deep neural networks"

Apache™ Logging Services™ Project Announces Log4j™ 1 End-Of-Life; Recomm...
blogs.apache.org

5 August 2015 --The Apache Logging Services™ Project Management Committee (PMC) has announced that the Log4j™ 1.x logging framework has reac

High-throughput, low-latency, and exactly-once stream processing with Ap...
data-artisans.com

The popularity of stream data platforms is skyrocketing. Several companies are transitioning parts of their data infrastructure to a streami

Compression in Apache Kafka is now 34% faster
www.confluent.io

Apache Kafka is now 34% faster. In this post we cover how compression works in Kafka and how we improved its efficiency to optimize performa

Diving into Spark Streaming’s Execution Model
databricks.com

In this post, we outline Spark Streaming’s architecture and explain how it provides the above benefits. We also discuss some of the interest

Cassandra 3.0 materialised views in action
christopher-batey.blogspot.com

Disclaimer: C* 3.0 is not released yet and all these examples are from a branch that hasn't even made it to trunk yet. So this feature start

Spree: A Live-Updating Web UI for Spark · Hammer Lab
www.hammerlab.org

Spree: A Live-Updating Web UI for Spark. 25 Jul 2015. At Hammer Lab, we run various genomic analyses using Spark. Most Spark users follow th

Public - 2 years ago
reviewed 2 years ago
Nobody beats Anna's Taqueria for Davis Square burritos, but if you're looking for a slightly healthier option, Chipotle's burrito bowls are a great option!
Food: Very GoodDecor: GoodService: Good
Public - 2 years ago
reviewed 2 years ago
Some of the best burgers I have ever had! I've tried several now and have never been disappointed...
Food: ExcellentDecor: ExcellentService: Excellent
Public - 2 years ago
reviewed 2 years ago
This location is now closed
Public - 3 years ago
reviewed 3 years ago
19 reviews
Map
Map
Map
One of the better healthy-eating options in Davis Square. Also, while I'm not sure how healthy their Huevos Rancheros breakfast burrito (w/added bacon) is, I can certainly vouch for how tasty and filling it is!
Food: Very GoodDecor: Very GoodService: Very Good
Public - 2 years ago
reviewed 2 years ago
Atmosphere: Very GoodDecor: Very GoodService: Excellent
Public - 3 years ago
reviewed 3 years ago
Public - 4 years ago
reviewed 4 years ago