Profile

Cover photo
Felipe Hoffa
Works at Google
3,228 followers|1,031,959 views
AboutPosts

Stream

Felipe Hoffa

Shared publicly  - 
 
 
Google Genomics (http://goo.gl/q4ot5G) has released a dataset that allows you to see how genetic variation is shared among individuals in 26 populations across the world. A Google Dataflow (https://goo.gl/DBRU1W) pipeline computed an analysis on over 5 trillion pairs of variants for each of 31 population groupings of the 2,504 individuals in the 1000 Genomes data set.

To learn more about how Google technologies can be used to to study genomic data, check out the new Linkage Disequilibrium (https://goo.gl/p4Q2bx) data sets at http://goo.gl/A4Od2D and the Jupyter (iPython) notebooks at http://goo.gl/xLh6Sk.
1 comment on original post
4
1
Add a comment...

Felipe Hoffa

Shared publicly  - 
 
I had a great time presenting in Hanoi and HCMC!
9
Add a comment...

Felipe Hoffa

Shared publicly  - 
 
All of Hacker News now on BigQuery!

https://github.com/fhoffa/notebooks/blob/master/analyzing%20hacker%20news.ipynb

Time to analyze - 8 million comments, 2 million posts.
6
2
Add a comment...

Felipe Hoffa

Shared publicly  - 
 
 
Simple but very powerful new feature: write your own functions that can be used in BigQuery queries like built-in ones.
1 comment on original post
3
Add a comment...

Felipe Hoffa

Shared publicly  - 
 
All the NYC Taxi Trips: Now officially shared by the NYC TLC, up-to-date (June 2015) data

https://www.reddit.com/r/bigquery/comments/3fo9ao/nyc_taxi_trips_now_officially_shared_by_the_nyc/
8
3
Add a comment...
Have them in circles
3,228 people
Dario Grünewald's profile photo
Christopher Wuthnow's profile photo
Abhay Kashyap's profile photo
maistro chair's profile photo
Franklin Quispe Garcés's profile photo
Dina Loreto Langenbach sepúlveda's profile photo
Felipe Santos's profile photo
Christian Henriquez's profile photo
andres marchant's profile photo

Felipe Hoffa

Shared publicly  - 
 
 
Detailed and informative Cloud comparison article. Conclusion: "Everyone involved unanimously picked GCP. It came down to this: we believe the core technology is better."

One of my favourite quotes: "It's hard to overemphasize how much friendlier GCP pricing is, and how poorly the AWS model works. Amazon's rhetoric for why you should run cloud vs self-hosted infrastructure is the additional agility you get; You can elastically spin up and shut down capacity whenever you want. Reserved Instance pricing effectively negates this advantage, and running all your instances On-Demand is extremely expensive in comparison. So you're left with the decision to over-spend or lock yourself in."
1 comment on original post
4
Add a comment...

Felipe Hoffa

Shared publicly  - 
 
 
The next generation of applications won’t just store images. They’ll actually understand them thanks to models like the one we’re releasing today. Our Cloud Vision API uses machine learning to classify images into thousands of categories, detect objects (and facial features), and even identify language. We’re excited to see what you build with it: http://goo.gl/RfEFv0
7 comments on original post
10
2
Gg Ggh's profile photo
Gg Ggh
 
Lablanche & company price: 1 billion of dollards
Add a comment...

Felipe Hoffa

Shared publicly  - 
 
 
17 governments adopted the new International Open Data Charter
http://opendatacharter.net/

US and Japan did not adopt :(
http://opendatacharter.net/adopted-by-countries-and-cities-2/

The Open Data Charter: A Roadmap for Using a Global Resource
http://www.huffingtonpost.com/joel-gurin/the-open-data-charter-a-r_b_8391470.html

1. Open by Default. Making government data "open by default," as many governments are starting to do, is a paradigm shift for how we treat government information. Under typical freedom-of-information laws, which have been enacted in more than 100 countries, citizens have the right to ask the government to give them documents and data. But to exercise this right, you have to know what you are looking for, where to find it, and how to submit a request. "Open by default" cuts through that cumbersome process by making government information automatically available to everyone, with exceptions for information that could breach privacy protections or threaten national security.

2. Timely and Comprehensive. Many governments are experimenting with new ways to make their data more timely and thus more valuable. They're supplementing their own data collections with data provided by citizens through crowdsourcing, data from mobile phone companies, and other sources. This gives them real-time data that can be used to analyze traffic and commuting patterns, track the spread of infectious disease, flag spikes in food or commodity prices, and more.

3. Accessible and Usable. Some 45 countries have launched centralized portals for their data, making it much easier to find, access, and use. The EU is building a centralized data portal for its members as well. In addition, government agencies are increasingly releasing data in machine-readable form, meaning that it can be put directly into a computer for analysis. This is a major improvement from keeping public data in paper files in government offices -- or even posting copies of those documents online, which makes them more accessible but still very hard to analyze.

4. Comparable and Interoperable. Data has a multiplier effect: Each dataset becomes much more valuable if it can be combined with others (meaning that those datasets are "interoperable"). Real estate websites, for example, need data in comparable formats to be able to put together a full picture of a neighborhood including housing prices, crime rates, school quality, access to public services, and other factors. On a more complex level, interoperability can make it possible to analyze the relationship between climate change and health trends, integrate data on federal spending from many different agencies, or analyze a host of factors that can impact national security. While interoperability is still a major challenge, there's growing agreement that international data standards -- particularly for widely used sources like geospatial data -- can open up new opportunities to combine or compare datasets for new insights.

5. For Improved Governance and Citizen Engagement. Open government data is advancing the cause of good government in many parts of the world. For example, Brazil's Transparency Portal, which launched with modest goals in 2004, now helps citizens oversee more than $12 trillion in federal spending -- everything from the funding of the World Cup to elected officials' credit card records. A growing global movement toward open contracting is now working to make government procurement open and transparent, to fight corruption and improve government efficiency.

6. For Inclusive Development and Innovation. The new Sustainable Development Goals, adopted at the U.N. General Assembly in September, set out an ambitious agenda for worldwide development and innovation over the next 15 years. The 17 Goals set targets for ending poverty, fighting hunger, achieving gender equality and improving the lives of the world population in many other profound ways. As my colleagues and I wrote recently , open data has a role to play in each of these goals and can be a powerful resource for governments working to achieve them.

#ogp15
View original post
11
3
Add a comment...

Felipe Hoffa

commented on a post on Blogger.
Shared publicly  - 
 
BigQuery JavaScript User Defined Functions finally available for everyone! Also see a video we taped last year while the BigQuery JS UDF were being developed: https://www.youtube.com/watch?v=6TYA6hy44Jo
1
Felipe Hoffa's profile photo
 
Also released plenty of new features (UDFs, GCS file reading, > quota, UI, no more EACH, speed, slots, high compute pricing)

http://googlecloudplatform.blogspot.com/2015/08/Google-BigQuery-adds-UDF-support-for-deeper-cloud-analytics.html
Add a comment...

Felipe Hoffa

Shared publicly  - 
 
All the NYC Taxi Trips: Now officially shared by the NYC TLC, up-to-date (June 2015) data

https://www.reddit.com/r/bigquery/comments/3fo9ao/nyc_taxi_trips_now_officially_shared_by_the_nyc/
1
Add a comment...
People
Have them in circles
3,228 people
Dario Grünewald's profile photo
Christopher Wuthnow's profile photo
Abhay Kashyap's profile photo
maistro chair's profile photo
Franklin Quispe Garcés's profile photo
Dina Loreto Langenbach sepúlveda's profile photo
Felipe Santos's profile photo
Christian Henriquez's profile photo
andres marchant's profile photo
Work
Occupation
Developer Advocate
Employment
  • Google
    Developer Advocate, 2013 - present
  • Google
    Software engineer, 2011 - present
Story
Tagline
From Chile. Working at Google. Living in San Francisco.
Bragging rights
"I find a duck's opinion of me is very much influenced by whether or not I have bread." --MH