Explore and Learn

Big Data Should Not Mean Big Cost

Data volumes are growing at rates never seen before. An open source technology, Apache Hadoop, is the technology of choice for enterprises that need to effectively collect, store and process large amounts of structured and complex data.

While Hadoop is very powerful, in its raw form, it lacks easy-to-use interfaces for timely and cost-effective analysis. Once that data gets into Hadoop, how do you get it out? How do you explore and analyze that data? If only there was an ETL and BI product...

Announcing Pentaho's support of Hadoop.

Learn more - Watch the demo:

Slides


Highlights

  • The Pentaho BI Suite offers comprehensive data integration, reporting and analytical capabilities that enable Hadoop developers and business analysts to quickly and easily create BI applications without coding.  Think of Pentaho as the front-end to Hadoop for building data integration and business intelligence applications.
  • Pentaho Data Integration (also known as Kettle) is a natural technology fit and data integration solution for Hadoop given its rich design tools, scalable architecture, open source distribution and adoption at a large number of Hadoop sites. 
  • The Pentaho BI Suite offers unmatched deployment flexibility as the same full-feature platform can be deployed on-premise or in the cloud or embedded in custom applications.
  • The first deliverable in this initiative is the enhancement of Pentaho Data Integration (PDI) to be the visual design environment for ETL processes that include the manipulation of Apache Hadoop files and the execution of Hadoop tasks. The next set of deliverables, to follow soon after, will be enabling reporting, dashboards and analysis directly against data stored in Hadoop.

What the community is saying about Pentaho’s support for Hadoop

The demanding pressure to apply Analytics to deliver insight for business continues to grow as the volumes of data exponentially grow. Pentaho is stepping up to lead the integration of data for Hadoop and provide the BI platform and tools to generate the Analytics and deliver a broad range of capabilities for business and IT.
Mark Smith
CEO & EVP Research
Ventana Research
We use Hadoop simply because we hit the wall with traditional RDBMS based on our impression volume. The combination of Hadoop and Pentaho will give us the opportunity to easily and cost effectively take our 'big data' analysis to an entirely new level and gain insights never before possible.
Naghi Prasad
VP Engineering
Offerpal
Attributor’s Guardian™ monitoring service scans more than 40 billion pages daily consequently our data needs are significant. We are committed to both Hadoop and Pentaho, and this integration is a huge win for us.
Adrian McDermott
Chief Technical Officer
Attributor