Profile cover photo
Profile photo
Ben Hoyle
About
Posts

Post has attachment
Getting All the Books [from Project Gutenberg] #quickblogpost

This is a short post explaining how to obtain over 50,000 text books for your natural language processing projects. The source of these books is the excellent Project Gutenberg. Project Gutenberg offers the ability to use sync the collection of books. To…
Add a comment...

Post has attachment
Taming the Docker Blob #quickpost

Or understanding how to best use Docker. Docker is a great way to build services with modular and changeable components without borking your server / computer. I like to think of Docker containers as a system version of Python’s virtual environment – you…
Taming the Docker Blob
Taming the Docker Blob
roboticape.wordpress.com
Add a comment...

Post has attachment
Sampling vs Prediction [Or the one where I over-think some basic terms in machine learning and probably get the wrong end of the stick.]

Some things have recently been bugging me when applying deep learning models to natural language generation. This post contains my random thoughts on two of these:  sampling and prediction. By writing this post, I hope to try to tease these apart in my…
Sampling vs Prediction
Sampling vs Prediction
roboticape.wordpress.com
Add a comment...

Post has attachment
I’ve finally found out how to access UK legislation in XML format – http://www.legislation.gov.uk/developer/uris – you just add /data.xml to the end of the statute URI! E.g. – https://www.legislation.gov.uk/ukpga/1977/37/data.xml . If anyone wants to play…
Add a comment...

Post has attachment
A blogpost on converting patent search into a deep learning classification problem.

This article will look into how the process of obtaining a patent could be automated using deep learning approaches. A possible pipeline for processing a patent application will be discussed. It will be shown how current state of the art natural language…
Add a comment...

Post has attachment
Can you protect Artificial Intelligence inventions at the European Patent Office? #blogpost

In recent years there has been a resurgence of interest in machine learning and so-called “artificial intelligence” systems. Much of this resurgence is based on advances in so-called “deep learning”, neural networks with multiple layers of connections.…
Add a comment...

Post has attachment
Practical Problems with Natural Language Processing #amateur_deeplearning #blogpost

Recently I've been playing around with the last 15 years of patent publications as a 'big data' source. This includes over 4 million individual documents. Here I thought I'd highlight some problems I faced. I found that a lot of academic papers tend to…
Add a comment...

Post has attachment
Your Patent Department in 2030 #futuregazing #nlp #newblogpost

Natural Language Processing and Deep Learning have the potential to overhaul patent operations for large patent departments. Jobs that used to cost hundreds of dollars / pounds per hour may cost cents / pence. This post looks at where I would be investing…
Add a comment...

Post has attachment
Modelling Claim Language #shortpost #ipchimp #workinprogress

Playing around with natural language processing has given me the confidence to attempt some claim language modelling. This may be used as a claim drafting tool or to process patent publication data. Here is a short post describing the work in progress.…
Add a comment...

Post has attachment
Resources for (Legal) Deep Learning - [new #ipchimp blog post]

This post sets out a number of resources to get you started with deep learning, with a focus on natural language processing for legal applications.
Add a comment...
Wait while more posts are being loaded