Profile

Cover photo
Matt Burgess
131 followers|250,783 views
AboutPostsPhotosVideos

Stream

Matt Burgess

Shared publicly  - 
 
Command-line utility for PDI Marketplace (using Spoon?!)
The PDI Marketplace is a great way to extend the capabilities of your PDI installation, using excellent contributions from the community, and some less-excellent ones from yours truly ;)  At present, the Marketplace is a core PDI plugin (meaning it is not i...
The PDI Marketplace is a great way to extend the capabilities of your PDI installation, using excellent contributions from the community, and some less-excellent ones from yours truly ;)  At present, the Marketplace is a core...
1
Add a comment...

Matt Burgess

Shared publicly  - 
 
Apache Pig UDF: Call a PDI transformation
For my latest fun side project, I looked at the integration of Pentaho Data Integration (PDI) and Apache Pig .  From the website: "Apache Pig is a platform for analyzing large data sets that consists of a high-level language for expressing data analysis pro...
For my latest fun side project, I looked at the integration of Pentaho Data Integration (PDI) and Apache Pig.  From the website: "Apache Pig is a platform for analyzing large data sets that consists of a high-level language f...
3
Add a comment...

Matt Burgess

Shared publicly  - 
 
I've spent quite a bit of time looking at Pentaho Data Integration (aka Kettle) and trying to make it do things with external technologies and idioms, anywhere from Groovy, Drill, memcached, Redis, Hazelcast, and even Markov ...
1
Add a comment...

Matt Burgess

Shared publicly  - 
 
ZooKeeper Input and Output steps in PDI
While working with Apache Drill and PDI (see previous posts), I found myself needing to read and write values to and from Drill's ZooKeeper instance. Since ZooKeeper can be (and is) used for many other applications besides Drill, I thought I'd write some si...
While working with Apache Drill and PDI (see previous posts), I found myself needing to read and write values to and from Drill's ZooKeeper instance. Since ZooKeeper can be (and is) used for many other applications besides Dr...
1
Add a comment...

Matt Burgess

Shared publicly  - 
 
Flatten JSON to key-value pairs in PDI
I've heard a number of comments regarding JSON and PDI, most of them having to do with difficulties parsing nested documents, using JSONPath, etc.  Personally, I've had a JSON doc I'd like to fetch fields from but I didn't want to try to figure out the JSON...
I've heard a number of comments regarding JSON and PDI, most of them having to do with difficulties parsing nested documents, using JSONPath, etc.  Personally, I've had a JSON doc I'd like to fetch fields from but I didn't wa...
2
Add a comment...

Matt Burgess

Shared publicly  - 
 
Groovy Datasources with Pentaho Report Designer
Ok, so this blog is called "Fun with Pentaho Data Integration", but I recently fielded a question about using scriptable data sources in Pentaho Report Designer (PRD), and rather than start a whole new blog, I thought I'd just post it here. The techniques a...
1
Add a comment...
Have him in circles
131 people
Roland Bouman's profile photo
Dilip Ladhani's profile photo
Vonna w's profile photo
Matt Casters's profile photo
Điền Bá Quang's profile photo
Lindsay Scherr Burgess's profile photo
Yvette Wilde's profile photo
Bart Maertens's profile photo
Joel Latino's profile photo

Matt Burgess

Shared publicly  - 
 
Using AppleScript with PDI SuperScript
In a previous blog post , I announced my SuperScript step for PDI, which adds and enhances some capabilities of the built-in Script step.  One notable addition is the ability to use AppleScript on a Mac, as the AppleScript script engine comes with the Mac J...
In a previous blog post, I announced my SuperScript step for PDI, which adds and enhances some capabilities of the built-in Script step.  One notable addition is the ability to use AppleScript on a Mac, as the AppleScript scr...
1
Add a comment...

Matt Burgess

Shared publicly  - 
 
SuperScript PDI plugin
As readers of my blog know, I'm a huge fan of scripting languages on the JVM (especially Groovy), and of course I'm a huge fan of Pentaho Data Integration :)  While using the (experimental) Script step to do various things, I saw a few places where a script...
As readers of my blog know, I'm a huge fan of scripting languages on the JVM (especially Groovy), and of course I'm a huge fan of Pentaho Data Integration :)  While using the (experimental) Script step to do various things, I...
1
Add a comment...

Matt Burgess

Shared publicly  - 
 
How sorted (or sordid) is your data?
I've spent quite a bit of time looking at Pentaho Data Integration  (aka Kettle) and trying to make it do things with external technologies and idioms, anywhere from Groovy , Drill , memcached , Redis , Hazelcast , and even Markov Chains . Recently though, ...
I've spent quite a bit of time looking at Pentaho Data Integration (aka Kettle) and trying to make it do things with external technologies and idioms, anywhere from Groovy, Drill, memcached, Redis, Hazelcast, and even Markov ...
1
Add a comment...

Matt Burgess

Shared publicly  - 
 
Scripting Extension Points in PDI
PDI Extension points are an awesome feature added to PDI 5.0 (and updated throughout 5.x) that allow you to hook into the operational aspects of your ETL processes to provide finer-grained control, optimizations, additional auditing/logging, or whatever you...
4
1
Andrea Torre's profile photo
Add a comment...

Matt Burgess

Shared publicly  - 
 
List Zookeeper Nodes and Data with Groovy
Here's a quick Groovy script to recursively list Zookeeper nodes (and optionally, data), also on Gist here .  What does this have to do with PDI, you may ask?  Stay tuned ;) @Grab('org.apache.zookeeper:zookeeper:3.4.6')

import org.apache.zookeeper.*
import...
1
Add a comment...

Matt Burgess

Shared publicly  - 
 
Using Apache Drill with PDI
One of the non-Pentaho side projects I've become interested in is Apache Drill , I like all the different aspects of it and hope to contribute in some meaningful way shortly :) As a first step, without touching any code, I thought I'd see if I could configu...
One of the non-Pentaho side projects I've become interested in is Apache Drill, I like all the different aspects of it and hope to contribute in some meaningful way shortly :) As a first step, without touching any code, I tho...
2
Add a comment...
People
Have him in circles
131 people
Roland Bouman's profile photo
Dilip Ladhani's profile photo
Vonna w's profile photo
Matt Casters's profile photo
Điền Bá Quang's profile photo
Lindsay Scherr Burgess's profile photo
Yvette Wilde's profile photo
Bart Maertens's profile photo
Joel Latino's profile photo
Basic Information
Gender
Male
Story
Tagline
Yep.
Introduction
Yep.
Links
Contributor to