The Power of Pentaho and Hadoop in Action

Type: Whitepaper

Product: Data Integration

This brief discusses a recent scalability test conducted using Pentaho Data Integration to execute and orchestrate MapReduce jobs in Hadoop with the purpose of demonstrating sustained performance at scale.

Key points include:

  • How Pentaho provides enhanced productivity and time to value in Hadoop projects through Pentaho Visual MapReduce
  • Overview of scalability study trials, including architecture, data set, and transformations executed
  • Study results showing sustained processing performance on rapidly growing data volumes in Hadoop