An Engagement Mechanism

  • Participate in the Pentaho Community: By using Pentaho CE, you become part of an active and engaging Community and benefit from its open source contributions.
  • Make valuable contributions: The Pentaho open source projects deliver better, faster, and more reliable products that are time-tested by the community. Developers, testers, writers, implementers, and most of all users are directly involved as a team to make high-value software contributions.

A QA Environment

  • Test it: There are no better testers than people who actually use the Pentaho CE software, reporting bugs, making suggestions, and providing direction for the projects.

An Evaluation Platform

  • Build with CE: The flexibility of Pentaho CE enables you to jump-start your development, innovate, experiment, find a solution you like, and then upgrade to Pentaho EE when you are ready for production.

  • Complete Spark Support: Pentaho is the only vendor to support Spark with all data integration steps in a visual drag-and-drop environment. Unlike other vendors who require users to build Spark-specific data integration logic – and often require Java development skills – with Pentaho you only need to design your logic once, regardless of execution engine.
  • Adaptive Execution on Big Data: Transitioning from one engine for big data processing to another often means users need to re-write and debug their data integration logic for each engine, which takes time. Pentaho’s adaptive execution allows users to match workloads with the most appropriate processing engine, without having to re-write any data integration logic.
  • Prepare Better Data, Faster: More visualizations throughout the data prep process allows users to spot check data for quality issues and prototype analytic data, without switching in and out of tools or waiting until the very end to discover data quality problems. Now, users can interact with heat grids, geo maps, and sunbursts, as well as drill-down into data sets for further exploration.
  • Integrate 3rd Party Visualizations: Leverage an easy to use and flexible API with full documentation to integrate visualizations from third party libraries such as D3 or FusionCharts.

With Pentaho Marketplace you can easily download and install plugins developed by the Pentaho Community, extending the capabilities of your Pentaho platform.


  • Discover plugins: In Pentaho Marketplace you can browse and install the latest plugins in a very simple way.
  • Submit your plugins: You can contribute plugins and let everyone benefit from your work. This is a great way to get feedback and other community members involved in your work.


Main Downloads

Pentaho's modern, simplified, and interactive approach empowers business users to access, discover and blend any data types regardless of their size. With a full spectrum of increasingly advanced analytics tools, from basic reports to predictive modeling, users can help themselves to analyze and visualize data across multiple measures and dimensions without being dependent on IT.

Pentaho’s Data Integration, also known as Kettle, delivers powerful extraction, transformation, and loading (ETL) capabilities. You can use this stand-alone application to visually design transformations and jobs that extract your existing data and make them available for easy reporting and analysis.

How to get PDI up and running

The Report Designer is a graphical tool that generates reports from data streamed through the Data Integration engine without the need for any intermediate staging tables. You can output your reports in several formats, includingPDF, Excel, HTML, rich-text-file, XML. and CSV.

Pentaho Marketplace allows users to explore and test the plugins that are most relevant for them. Download and install plugins developed by the Pentaho Community, extending the capabilities of your Pentaho platform.

Design Tools

The Aggregation Designer provides a simple interface that allows you to create and deploy aggregate tables to improve the performance of your Pentaho Analysis (Mondrian) OLAP cubes.

The Schema Workbench is a visual design interface that allows you to create and test Mondrian OLAP cube schemas. You can present your data multi-dimensionally and let users select which dimensions and measures to explore, interactively drilling into cross-tabulating data.

Metadata Editor is a tool that simplifies your experience when creating reports, and allows you to build metadata domains and relational data models.

Big Data

Pentaho uses an abstraction layer to facilitate supporting a broad set of Hadoop distributions and version updates. The plugins which enable compatibility with a specific distribution are called /shims/. Use the link below to get a list of the currently support distributions and versions.

Pentaho does not ship all available shims with the product and we generally do not have to update a shim for a minor or patch version change. Shims that support older distributions as well as new ones created after release are available for download here. If the note says that a later version of a shim also supports your version, Pentaho recommends using the later version.

Help us improve this website. Send your feedback to