Marc Russel’s blog links to a Manapps ELT benchmark report comparing the performance of several leading ETL tools both proprietary (DataStage and Informatica) and OS (Talend and PDI (aka Kettle)). As would be expected each tool has their own strengths and weaknesses, but one thing stands out, the venerable Kettle ETL aka PDI 3.0 is now [...]
Archive for the ‘kettle’ Category
Open Source Metrics and Benchmarks
Posted in ETL, Talend, kettle, tagged ETL benchmarks, PDI 3.0, WaveMaker on October 30, 2008 | 11 Comments »
New universal SQLite JDBC library.
Posted in ETL, Java, SQLite, Talend, kettle, news, tagged JDBC, universal, zentus.com on July 21, 2008 | No Comments »
Both Talend (Java) and Kettle distribute the Zentus.com pure-Java SQLite JDBC driver and for most purposes this run-anywhere version is fine. But, if you really need to take advantage of SQLite’s speed then connecting using the native JNI version is a must. Doing this was easy enough, just change over to using a generic JDBC [...]
Regular Expressions as an end-user programming tool?
Posted in ETL, Talend, excel, kettle, tagged regex, regular expressions on July 1, 2008 | 2 Comments »
“What? Have you completely lost the plot, Gleeson?”, I hear you scream. Jamie Zawinski’s famous quote is intoned once more ..
Some people, when confronted with a problem, think
“I know, I’ll use regular expressions.” Now they have two problems.
Of course the above quote could be (and probably has been) changed to…
Most business people, when confronted with [...]
SQLite - the ultimate data-smithing tool!
Posted in AmazonAWS, ETL, SQLite, Talend, data, excel, kettle, tagged Amazon SimpleDB, Microsoft Access on April 26, 2008 | 1 Comment »
Image via Wikipedia
Although my data-smithing tool box is full to the brim with powerful tools such as Talend, Kettle PDI, Picalo and Excel, all backed by the cloud infrastructure of Amazon’s S3, SImpleDB and EC2, there’s one simple yet powerful tool that I always seem to gravitate back to, that tool is SQLite.
Now obviously being [...]
Dublin Bus and PALO ETL - the connection!
Posted in AmazonAWS, BI, ETL, Palo, S3, SQLite, SimpleDB, Talend, VBA, excel, kettle, olap, tagged Dublin, Dublin Bus, hmac, sha1, sha1hmac on January 26, 2008 | 5 Comments »
Dublin buses, as is the norm with most road-based public transport systems in our increasingly car-choked cities, tend to operate on the basis of “no sign of a bus for ages, then two or three arrive at the same time”. Palo MOLAP ETL options appear to be following the same pattern; we’ve been waiting for [...]
PALO ETL-Server, first sighting …
Posted in ETL, Palo, Talend, kettle, tagged HSQLDB, IMPPalo, Palo ETL-Server on December 6, 2007 | 1 Comment »
I was wrong. I figured Jedox would build their new ETL server on one of the existing open source ETL project code-bases, either Talend or Pentaho’s Kettle. Instead, the new alpha ETL server code which has just been uploaded to SourceForge is based on neither and appears to have been developed by another [...]
New ETL platform for PALO OLAP
Posted in BI, ETL, Palo, Talend, kettle, olap, tagged Jedox, Drill-down, drill-back, Mondrian on November 28, 2007 | No Comments »
Jedox have announced that they intend to ship a Palo centric ETL open source server product early next year. This is excellent news and is on top of the new rules engine that was added to Palo this summer. Open source MOLAP has suddenly taken off the training wheels and is getting ready [...]
Using the latest Pure Java SQLite JDBC driver in Kettle
Posted in ETL, Java, SQLite, kettle, tagged JDBC, out of memory on October 5, 2007 | 3 Comments »
The bug in the pure Java SQLiteJDBC driver that caused an “out of memory” error when trying to connect to a SQLite database using standard windows drive letters (e.g. c:\kettle\mydata.db) is now fixed. The current version (V037) has also been updated to SQLite version 3.4.2. To use the latest driver within Kettle, download [...]
Google Spreadsheets - ETL tool
Posted in BI, ETL, GoogleApps, RSSBus, Ruby, SQLite, Talend, VBA, Web2.0, excel, kettle, tagged google on September 6, 2007 | 1 Comment »
Although I’m a total Excel fanboy, I most admit I rarely use it any longer for personal stuff such as home budgets, tax calculations, what-ifs, to-do lists etc.; I now tend to use Google Spreadsheets. Likewise, personal notes, drafts and useful bits of code are stored using Google Docs rather than MS Word. [...]
Apatar - a few extracts short of a load
Posted in BI, ETL, Proto, RSSBus, SQLite, Talend, data, excel, kettle, tagged AmazonAWS on July 12, 2007 | 4 Comments »
I’ve been meaning to try out the Apatar ETL/Mashup tool for sometime and today being yet another rainy day in this the worst Irish summer that I can remember (and Irish summers are not renowned for the lack of rainfall) I decided to give it a try out. Not impressed I’m afraid; comes up [...]
Google Gears - SQLite Killer App
Posted in ETL, JavaScript, Proto, SQLite, excel, kettle on May 31, 2007 | 1 Comment »
The announcement of Google Gears is of course a game changer for those working in the development of online apps; its addition to Goggle Reader alone would make it worth while for me and I’m sure we’ll see it integrated into Google Docs and GMail in the near future. If you had any plans [...]
Talend vs. Kettle (Pentaho PDI)
Posted in BI, ETL, Java, JavaFX, JavaScript, Palo, Ruby, SQLite, Talend, kettle, xLite, tagged update on May 27, 2007 | 5 Comments »
Over the last few weeks I’ve received a lot of traffic from Goggle searches comparing Talend and Kettle and also from Vincent McBurney’s ITtoolbox article comparing the two products, so where do I stand?
As ETL tools they take different approaches, Kettle is a meta data driven framework (which is in turn tightly integrated into an [...]
I’ve got talend and I’m going to use it…
Posted in BI, ETL, Java, Palo, SQLite, Talend, data, excel, kettle, olap, xLite on April 30, 2007 | 1 Comment »
For the last few months I’ve being looking for my ideal ETL platform. That ideal would be open source, platform independent (well at least Windows and Linux), flexible, and easily deployable. It had looked like a combination of Kettle and my micro-ETL combinations of Ruby/SSQLite and Excel/SQLite would be the eventual “winners”. [...]
Talend ETL - A New Contender
Posted in BI, ETL, SQLite, Talend, kettle, news, olap on April 26, 2007 | 4 Comments »
Talend have released a new version of their Open Studio ETL tool. Not as full featured as Pentaho Kettle; only supports a limited number of databases and file formats - no SQLite support shock-horror! The press release promises More than 100 Native Connectors and promises connectors to ERP and CRM tools but [...]
New software - Pentaho Kettle 2.5 RC1 and IMP:Palo
Posted in BI, ETL, Palo, data, excel, kettle, news on April 21, 2007 | 3 Comments »
I’ve spend a few hours trying out the latest Kettle 2.5.0 RC1 release candidate, new UI and lots of new features. Looks like the PALO code developed by 3a-strategy will not make into this release, but I see Cubeware have released IMP:PALO cube loading software, offering both a free and a premium [...]
VBA & JavaScript - glue languages
Posted in JavaScript, Proto, excel, kettle, programming, tagged zimki, AmazonAWS on March 22, 2007 | 2 Comments »
What have Javascript and VBA in common? Not much on the surface and their respective user bases rarely if ever overlap. What they do share are their roles as the imperative (the-if-then-else-loop-etc) programming languages of the “I’m not a programmer” programmers, the great unwashed, the “normal” people out there who [...]
PALO plugin for Kettle (Pentaho) ETL
Posted in ETL, Palo, kettle on February 7, 2007 | 2 Comments »
The much awaited Palo plugin for the Kettle ETL tool has been released. Oh happy days!
Palo is an open source MOLAP database developed by the German company Jedox. Although it doesn’t the match the power of established OLAP engines such as Essbase and many simple cross-tab/pivot requirements can be handled by an Excel [...]
Kettle and SQLite
Posted in ETL, SQLite, kettle, tagged JDBC, out of memory, unix on January 20, 2007 | 1 Comment »
Matt Casters has added SQLite support to Pentaho’s Kettle ETL tool in the latest development release. I’ve tested it under Windows using JRE 1.5.0_09 and it worked fine but having upgraded to JRE 1.5.0._10 I’m now getting “out of memory” errors, appears to be a problem with the “pure java” jdbc driver [...]
SQLite JDBC and Kettle (Pentaho Data Integration) ETL
Posted in ETL, kettle on December 18, 2006 | 17 Comments »
I’ve been a big fan of SQLite for several years now. Although I come from an Oracle database background, I find for day-to-day data smith’ing SQLite is ideal. Combine it with the expressive power of Ruby and you have a very powerful micro-ETL environment.
I’m also [...]