Profile

Cover photo
Aaron Daubman
Attended CMU
Lives in Billerica, MA
259 followers|338,411 views
AboutPostsCollectionsPhotosVideos+1'sReviews

Stream

Aaron Daubman

Shared publicly  - 
 
Learning to learn by gradient descent by gradient descent
Abstract: The move from hand-designed features to learned features in machine learning has been wildly successful. In spite of this, optimization algorithms are still designed by hand. In this paper we show how the design of an optimization algorithm can be cast as a learning problem, ...
1
Add a comment...

Aaron Daubman

Shared publicly  - 
 
Stichfix's Hybrid lda2vec Algorithm: Build models for humans!
The goal of lda2vec is to make volumes of text useful to humans (not machines!) while still keeping the model simple to modify. It learns the powerful word r...
1
Add a comment...

Aaron Daubman

Shared publicly  - 
 
Cloudera Engineering open sources distributed unit testing framework cutting testing time from hours to minutes!
Cloudera Engineering has developed (and recently open sourced) a distributed unit testing framework that cuts testing time from multiple hours to just 10 minutes. Upstream unit tests are Cloudera’s first line of defense for finding and fixing software bugs, as part of a multidimensional process that also includes static/dynamic code analysis, fault injection, integration/scale/endurance testing, and validation on real workloads. However, running ...
1
Add a comment...

Aaron Daubman

Shared publicly  - 
 
Alpha: BigQuery gains SQL 2011 standard compliance with nested/repeated data extensions
https://code.google.com/p/google-bigquery/issues/detail?id=448
1
Add a comment...

Aaron Daubman

Shared publicly  - 
 
You Can Do Research Too!
You Can Do Research Too. 24 Apr 2016. I was recently discussing gatekeeping and the process of getting started in CS research with a close friend. I feel compelled to offer a note. As a practicing academic researcher, I'm personally thrilled by the degree of excitement regarding CS research ...
1
Add a comment...

Aaron Daubman

Shared publicly  - 
 
Hierarchical Temporal Memory for streaming anomaly detection in Apache Flink
1
Add a comment...

Aaron Daubman

Shared publicly  - 
 
Google shares Maglev, the software network load balancer powering GCP networking
1
Add a comment...
Have him in circles
259 people
Aaron Smith's profile photo
Wendy Jacobi's profile photo
Angel Bond's profile photo
Paul Danik's profile photo
Morgaine Fowle (de la faye)'s profile photo
David McKenna's profile photo
Tracey Greene's profile photo
Pam Labbe's profile photo
Randy Charland's profile photo

Aaron Daubman

Shared publicly  - 
 
Understanding Consensus and Paxos in Distributed Systems
A whirlwind tour on acheiving agreement in a distributed system
1
Add a comment...

Aaron Daubman

Shared publicly  - 
 
Why Apache Beam? A Google Perspective: provide an easy-to-use, but powerful model for data-parallel processing, portable across a variety of runtime platforms
Why it made sense for us to move Cloud Dataflow SDK into the Apache Beam project and provde an easy-to-use, powerful model for data-parallel processing.
1
Add a comment...

Aaron Daubman

Shared publicly  - 
 
Inside Capacitor, BigQuery’s next-generation columnar storage format
1
Add a comment...

Aaron Daubman

Shared publicly  - 
 
Capitalism excels at innovation but is failing at maintenance, and for most lives it is maintenance that matters more
Capitalism excels at innovation but is failing at maintenance, and for most lives it is maintenance that matters more
1
Add a comment...

Aaron Daubman

Shared publicly  - 
 
k-Nearest Neighbors using Quadtrees with Apache Flink
Daniel Blazevski. March 25th, 2016. Daniel Blazevski Insight Data Engineering Program Director. During the seven-week Insight Data Engineering Fellows Program experienced software engineers and recent grads learn the latest open source technologies by building a data platform to handle large, ...
1
Add a comment...
Aaron's Collections
People
Have him in circles
259 people
Aaron Smith's profile photo
Wendy Jacobi's profile photo
Angel Bond's profile photo
Paul Danik's profile photo
Morgaine Fowle (de la faye)'s profile photo
David McKenna's profile photo
Tracey Greene's profile photo
Pam Labbe's profile photo
Randy Charland's profile photo
Education
  • CMU
    Master of Science: Software Engineering, 2009 - 2010
  • UMass Lowell
    Statistics, 2007 - 2008
  • RPI
    Bachelor of Science: Computer Science, 1999 - 2003
  • SUNY Dutchess
  • Arlington High School
  • Tabernacle Christian Academy
Basic Information
Gender
Male
Looking for
Friends, Networking
Relationship
Married
Other names
daubman, ajd, AaronD
Apps with Google+ Sign-in
  • Space Grunts
  • Hitman:Sniper
  • Wayward Souls
  • LIMBO
  • Quadropus Rampage
  • Digfender
  • Knights of Pen and Paper +1
  • Rayman Fiesta Run
  • Lara Croft GO
  • Alto's Adventure
  • BADLAND
Story
Tagline
Big Data Nerd, Android Enthusiast, Husband, Father, Wannabe rocker
Work
Occupation
Data Architect
Places
Map of the places this user has livedMap of the places this user has livedMap of the places this user has lived
Currently
Billerica, MA
Previously
Poughkeepsie, NY - Troy, NY - East Hartford, CT - Cambridge, MA - Somerville, MA
Aaron Daubman's +1's are the things they like, agree with, or want to recommend.
Aaron Daubman on Twitter: "@hulu It's not having to watch ads that stink...
twitter.com

@verizon was odd to see your WinWinWin http://www.verizonwireless.com/promos/apple/ ad on @hulu just now (evening of July 1st) clearly state

[1606.04474] Learning to learn by gradient descent by gradient descent
arxiv.org

Abstract: The move from hand-designed features to learned features in machine learning has been wildly successful. In spite of this, optimiz

Introducing our Hybrid lda2vec Algorithm | Stitch Fix Technology – Multi...
multithreaded.stitchfix.com

The goal of lda2vec is to make volumes of text useful to humans (not machines!) while still keeping the model simple to modify. It learns th

Ifeanyi Ubah
ifeanyi.co

A whirlwind tour on acheiving agreement in a distributed system

Quality Assurance at Cloudera: Distributed Unit Testing - Cloudera Engin...
blog.cloudera.com

Cloudera Engineering has developed (and recently open sourced) a distributed unit testing framework that cuts testing time from multiple hou

Why Apache Beam? A Google Perspective | Google Cloud Big Data and Machin...
cloud.google.com

Why it made sense for us to move Cloud Dataflow SDK into the Apache Beam project and provde an easy-to-use, powerful model for data-parallel

You Can Do Research Too | Peter Bailis
www.bailis.org

You Can Do Research Too. 24 Apr 2016. I was recently discussing gatekeeping and the process of getting started in CS research with a close f

Innovation is overvalued. Maintenance often matters more – Lee Vinsel &a...
aeon.co

Capitalism excels at innovation but is failing at maintenance, and for most lives it is maintenance that matters more

GitHub - nupic-community/flink-htm: Distributed, streaming anomaly detec...
github.com

flink-htm - Distributed, streaming anomaly detection and prediction with HTM in Apache Flink

Planting Quadtrees for Apache Flink
insightdataengineering.com

Daniel Blazevski. March 25th, 2016. Daniel Blazevski Insight Data Engineering Program Director. During the seven-week Insight Data Engineeri

Introduction To The Apache Cassandra 3.x Storage Engine
thelastpickle.com

Introduction To The Apache Cassandra 3.x Storage Engine

Introducing GraphFrames
databricks.com

Databricks is excited to announce the release of GraphFrames, a graph processing library for Apache Spark. Read about the new library and se

Inside Libpostal - a fast, multilingual, international street address pa...
mapzen.com

Data scientist Al Barrentine introduces Libpostal, a state-of-the-art, lightning-fast library and statistical model for parsing and normaliz

Announcing Spotify Infrastructure’s Googley Future
news.spotify.com

Editor's Note: This blog post was written by Nicholas Harteau/VP, Engineering & Infrastructure-------- As a company most often associated wi

Announcing Kafka Connect: Building large-scale low-latency data pipelines
www.confluent.io

Kafka Connect, a new feature in Apache Kafka 0.9+ that makes building and managing stream data pipelines orders of magnitude easier. ALL YOU

Introducing Apache Arrow: Columnar In-Memory Analytics
dremio.com

Apache Arrow establishes a de-facto standard for columnar in-memory analytics which will redefine the performance and interoperability of mo

A Short Note on Atomicity and Ordering
blog.acolyer.org

Just a short observation to start the week this week, inspired by the All File Systems are Not Created Equal paper that we looked at last we

It was a lot nicer before they became very strict on limiting the number of visitors to 2, previously all the kids and grandkids could visit my parents when they stayed here...
Public - 2 months ago
reviewed 2 months ago
Event facilities were very nice, food and beverage service were great, and most importantly the acoustics in the meeting rooms were great.
Public - 4 months ago
reviewed 4 months ago
Nobody beats Anna's Taqueria for Davis Square burritos, but if you're looking for a slightly healthier option, Chipotle's burrito bowls are a great option!
Food: Very GoodDecor: GoodService: Good
Public - 3 years ago
reviewed 3 years ago
Some of the best burgers I have ever had! I've tried several now and have never been disappointed...
Food: ExcellentDecor: ExcellentService: Excellent
Public - 3 years ago
reviewed 3 years ago
20 reviews
Map
Map
Map
Public - 3 years ago
reviewed 3 years ago
One of the better healthy-eating options in Davis Square. Also, while I'm not sure how healthy their Huevos Rancheros breakfast burrito (w/added bacon) is, I can certainly vouch for how tasty and filling it is!
Food: Very GoodDecor: Very GoodService: Very Good
Public - 3 years ago
reviewed 3 years ago
Atmosphere: Very GoodDecor: Very GoodService: Excellent
Public - 3 years ago
reviewed 3 years ago