Data Integration - Kettle

Data Integration (or Kettle) delivers powerful Extraction, Transformation, and Loading (ETL) capabilities, using a groundbreaking, metadata-driven approach.

Downloads

Downloads

Reliable Service Architecture

With an intuitive, graphical, drag and drop design environment and a proven, scalable, standards-based architecture, Data Integration is increasingly the choice for organizations over traditional, proprietary ETL or data integration tools.


Frequently Asked Questions

When I start Spoon.bat in a Windows environment nothing happens. How can I solve it?

Edit the Spoon.bat file and:

  1. Replace in the last line "start javaw" with only "java"
  2. Add a "pause" in the next line
  3. Save and try it again.

How to use JNDI?

If you look inside the PDI main directory you'll see a sub-directory called "simple-jndi", which contains a file named "jdbc.properties". You should change this file so the JNDI information matches the one you use in your application server.

Can I sequence transformations?

This is not possible, one of the basic things in PDI transformations is that all of the steps run in parallel. So you can't sequence them. This would require architectural changes to PDI and sequential processing would also result in very slow processing.

What's the difference between transformations and jobs?

Transformations are about moving and transforming rows from source to target. Jobs are more about high level flow control: executing transformations, sending mails on failure, ftp'ing files, ...

How to use database connections from repository?

Create a new transformation (or job) or close and re-open the ones you have loaded in Spoon.

How to do a database join with PDI?

Create a new transformation (or job) or close and re-open the ones you have loaded in Spoon.

New Features

1. Improved system performance monitoring

Minor bug fixes to the PDI-specific portions of the Pentaho.

2. Data profiling enhancements

Data Profiling Perspective includes DataCleaner: Analyze Tables and Columns in preparation for ETL.

3. Easily add new plugins

The PDI Marketplace makes it possible to share and download new plugins.

4. Deliver data from multiple data sources

The Data Services and Kettle JDBC driver enable you to deliver data from multiple data sources, while enriching, cleansing, and transforming the data.

5. Data movement load balancing

PDI provides load balancing of data within transformations and over multiple cluster nodes when using transformation clustering.

6. Revert changes in job database transactions

Database connections can be used with all jobs. This enables commits and rollbacks on a job level. Prior to this release, this was only possible with transformations.

How to contribute

The open Source delivers better, faster and reliable products, empowered by an active and wider community. Developers, testers, writers, implementers, and most of all users can make valuable contributions.

Here’s your guide to submitting just about any contribution to the Pentaho project. If you don’t find the answers you need here, please post your question to our Forums.

Places to contribute

There are two primary ways to make sure that your contributions are recognized and reviewed in a timely fashion: through our Discussion Forums and through our issue and bug tracking system, JIRA.
All bug reports are recorded and tracked through our JIRA issue and bug tracking system.
We rely on time and code contributions from our community (and we'll never turn down money or beer) to keep our commitment of delivering a quality Business Intelligence platform in the open source scenario.
Both bug fixes, new features and improvements are types inputs in our JIRA system, allowing you to choose the type appropriated to your case.
Solutions should be submitted through the new feature type case in our JIRA system. Complete the case as described for other code contributions, then attach your solution as an additional file to the case.
Whether you're a developer who's implementing the platform or a business analyst who needs to solve a particular problem while using it, your suggestions are valuable! We encourage any community member to share its needs with us, as long as they're related to Business Intelligence software and the problems addressed by BI.
If you'd like to contribute documentation improvements, or submit a technical article, you can do so in the Pentaho Documentation Wiki.
The place to start with a language specific contribution is to look under our International Forums for the language you're interested in. Internationalization efforts are coordinated within these forums among the community members that are most experienced with each language.
The Pentaho team is building an automated platform test suite and submission protocol because, well, frankly, it's a damn good idea. Until that suite is completed, if you're willing to contribute with time and resources for quality assurance testing, please contact us. We will send you a matrix spreadsheet where you'll identify the environment, test variables in play and submit your results back to us.
If you want to get involved, have the time or resources to commit, and are not sure where to dig in, send us an email with a description of the resources you have and the commitment you can make, and we'll be happy to hook you up with a Pentaho team member to coordinate the best use of your time.
Any good community member knows a decent meal (or even a handful of corn chips) goes a long way toward increasing developer productivity. Well, it at least improves their outlook anyway?

If you would like to submit a contribution of beer, snacks, soda, beer, chocolate, pizza or pretty much anything else edible (like beer), send your package to:

Pentaho Corporation
5950 Hazeltine National Drive, Suite #340
Orlando, FL 32822
While we do not condone frivolous gifts or taking bribes, we are not above such gestures either. If there is a certain body of work that, when completed, makes your life easier, and you have a small yacht surprisingly available for a weekend, we can definitely talk. Contact us with your offer, and we can surely find SOMEONE, a Pentaho team member or a community member, that can help you out!

Download

Data Integration 5.2.0 Stable
Description

Pentaho’s Data Integration, also known as Kettle, delivers powerful extraction, transformation, and loading (ETL) capabilities.


Looking for a different version? Find it here.

Help us improve this website. Send your feedback to community@pentaho.com