Pentaho Delivers First Complete Data Integration and BI Suite for Hadoop

Available for download, Pentaho for Hadoop lowers Big Data onramp with easy-to-use, affordable ETL and analytics offering

October 12, 2010, Hadoop World, NYC —

Pentaho Corporation, the open source business intelligence (BI) anddata integration leader, today announced download availability of Pentaho Data Integration for Hadoop and the Pentaho BI Suite for Hadoop. More and more enterprises are turning to Hadoop to reduce costs and improve their ability to extract actionable business insight from the vast amount of data being collected throughout the enterprise. With these releases, Pentaho addresses the biggest challenges experienced by users of Hadoop – steep technical learning curve, a lack of qualified technical staff and the lack of availability of development and deployment applications for performing data integration and business intelligence with Hadoop. Pentaho makes Hadoop easy.

Product Summary

Pentaho Data Integration (PDI) for Hadoop provides a zero-programming, graphical design environment enabling organizations to:

  • Easily manage how data is moved into and out of Hadoop;
  • Coordinate, execute and schedule Hadoop tasks in the context of existing ETL and BI workflows;
  • Design and execute massively scalable ETL jobs in Hadoop using the 200+ out-of-the-box ETL steps;
  • Harness the power of Hadoop, regardless of preference for on-premise or cloud based deployment, through tight integration with all common Hadoop distributions including Amazon Elastic MapReduce, Cloudera Distribution for Hadoop (CDH) and Apache Hadoop.

The Pentaho BI Suite for Hadoop, which includes PDI for Hadoop and all the benefits detailed above, empowers organizations to:

  • Perform production, operational and batch reporting against the full set of data in Hadoop using Hive,
  • Provide Ad hoc reporting against data in Hadoop with zero knowledge of Hadoop or SQL,
  • Spin off high performance data marts in minutes for interactive analysis and dashboarding using Pentaho Agile BI.

A collaborative effort from both Pentaho Corporation and the Pentaho community, the Pentaho Data Integration and BI Suite for Hadoop was the subject of an extensive beta program that involved both Pentaho community members and commercial customers. It is the first solution to address user needs for ETL and BI applications that make Hadoop easier to use for Big Data analytics. In addition to enhancing the Pentaho BI Suite, Pentaho also contributed numerous improvements to a number of open source projects in the Hadoop ecosystem including Apache HIVE and VFS.


"The need for big data technology like Hadoop is clear, present and expanding but there are some barriers to widespread adoption," said Richard Daley, founder and CEO of Pentaho Corporation. "Pentaho is breaking down these barriers with our latest releases, thereby helping increase the overall adoption rate and success of Hadoop in the market."

"Pentaho just lowered the onramp to Big Data analytics by making it easier and more affordable for companies to get up and running with Hadoop," said Shawn Rogers, Vice President Research, Business Intelligence, at analyst firm Enterprise Management Associates. "Pentaho for Hadoop offers tight data integration and BI capabilities at a great price per CPU. It's an essential tool set addition for senior level architects and others at larger organizations with Big Data initiatives, or even for a DBA or ETL guy trying to get into Hadoop."

Tweet this:
Pentaho for Hadoop available for download 

About Pentaho, a Hitachi Data Systems company

Pentaho, a Hitachi Data Systems company, is a leading data integration and business analytics company with an enterprise-class, open source-based platform for diverse big data deployments. Pentaho’s unified data integration and analytics platform is comprehensive, completely embeddable and delivers governed data to power any analytics in any environment. Pentaho’s mission is to help organizations across multiple industries harness the value from all their data, including big data and IoT, enabling them to find new revenue streams, operate more efficiently, deliver outstanding service and minimize risk. Pentaho has over 15,000 product deployments and 1,500 commercial customers today including ABN-AMRO Clearing, EMC, Landmark Halliburton, Moody's, NASDAQ, RichRelevance, and Staples. For more information visit