The Google poster that you wrote about is a few years old, and I was hoping to see a followup research paper associated with it, but I can't say that I've seen one come out. I've suspected that Google has looked at a number of the heuristics described within it, and likely implemented a few of them.
I recently wrote a blog post about a recently granted Google patent that was originally filed in 2006, which described how they might filter some blogs out of blog search, and the description included some really broad, outdated and not very good rules for deciding whether or not they would include blog posts within blog search. These included considering the number of links within a post (with too many being bad), distance of links within posts that had links too far from the start of a post not included, and presence of links pointing back to the post or to other pages on the same domain.
I followed up that post with another one that (1) had 98 external links, (2) had links throughout the post instead of just a short distance from the start of the post, and (3) had 36 named anchor links towards the start of the post that linked back to different sections of the post. All three of those would potentially keep the post from being included in Google Blog search because that post broke three of the rules from the description of that patent. The post was showing in Google's blog search sometime shortly after I posted it.
While I suspect that Google did come up with filters to keep some blog posts from appearing in Google Blog search, I don't think many of the rules described within that patent were implemented as described in the patent.
But they could have been. All three were link analysis type heuristics, and if any of them were still in use by Google, they were ones that should be retired, because they were too broad and didn't do things like consider the target of the outgoing links (in my example, 97 of the 98 links were pointed to pages at the USPTO) or even the internal ones, which were named anchor links helping to make the blog post more usable by delivering readers to sections of the post that they might find most interesting.
Another of the rules from that particular patent would potentially filter some blog posts out of Google Blog search results if they linked to videos. The patent was originally filed a number of months before Google acquired YouTube. The intent was to avoid blog posts that might link to "undesirable" content, but it didn't distinguish between the kinds of content that those videos might contain. Again, a rule that was likely too broad when described in the patent, but which probably didn't get implemented as written.
I suspect that there are other "link analysis" methods that Google may have actually implemented that may have been based upon assumptions that didn't carry out as providing the value they were intended to give, or might have been based upon circumstances that have changed.