Pentaho Labs Unleashes Apache Spark™ Integration
May 12, 2015, San Francisco —
Delivering the future of analytics, Pentaho Corporation, today announced the native integration of Pentaho Data Integration (PDI) with Apache Spark ™, enabling orchestration of Spark jobs. A development effort initiated by Pentaho Labs, this integration will enable customers to increase productivity, reduce maintenance costs, and dramatically lower the skill sets required as Spark is incorporated into big data projects.
Spark is a powerful open source processing engine built around speed, ease of use, and machine learning. Engineered from the bottom-up for performance, Spark is a next-generation big data technology to store, blend, and govern data at entirely new levels of speed, scale and simplicity. Building on complementary open source foundations, allowed Pentaho to innovate early with this emerging big data technology.
Attend the webinar, Emerging Big Data Technologies: Pentaho Labs Presents Apache Spark
“For two years, we experimented with possible use cases based on our big data blueprints and sizing the enterprise market opportunity for Spark. Our customers now benefit from that work with simplified, real-time analytic capabilities,” said James Dixon, Chief Technology Officer at Pentaho. “Our open-source heritage and modern extensible platform, allows us to quickly evolve our capabilities keeping our customers big data technology options open, reducing risk and saving considerable development time while taking advantage of the latest innovations in popular big data stores.”
As big data technologies evolve at breakneck speed, the Pentaho Labs team continues to leverage and drive innovation in big data integration and analytics allowing customer’s to advance their big data deployments without risk. Today’s integration with Spark follows other labs efforts that have led to support for YARN and the Adaptive Big Data Layer. Following the native support of YARN alone, enterprise customers like RichRelevance, edo Interactive and MultiPlan have been able to innovate and drive greater value from Hadoop.
“Apache Spark couples high-performance, in-memory data processing and multiple computation models that make it well-suited to power next-generation data processing platforms,” said Matt Aslett, Research Director, Data Platforms and Analytics, 451 Research. “The integration with Spark illustrates how Pentaho’s open source approach enables it to respond as emerging technologies rise to prominence in the ever-evolving big data market. And integrate them with its data management and analytics platform."
Pentaho Data Integration for Apache Spark is currently available in Pentaho Labs. It will be GA in June 2015. To learn more about the innovation in Pentaho Labs visit: www.pentaho.com/labs.
Attend the webinar, Emerging Big Data Technologies: Pentaho Labs Presents Apache Spark on Tuesday, June 2, 2015 at 10am/pt. Register at http://events.pentaho.com/pentaho-labs-apache-spark-registration.html.
About Pentaho Labs
Pentaho Labs, led by Pentaho founders Richard Daley and James Dixon, is staffed with top industry experts to incubate breakthrough advanced analytic capabilities driven by big data. Pentaho Labs encourages seeding of new approaches and technologies that can over time be merged into the Pentaho roadmap based on market demand.
About Pentaho, a Hitachi Group company
Pentaho, a Hitachi Group company, is a leading data integration and business analytics company with an enterprise-class, open source-based platform for diverse big data deployments. Pentaho’s unified data integration and analytics platform is comprehensive, completely embeddable and delivers governed data to power any analytics in any environment. Pentaho’s mission is to help organizations across multiple industries harness the value from all their data, including big data and IoT, enabling them to find new revenue streams, operate more efficiently, deliver outstanding service and minimize risk. Pentaho has over 15,000 product deployments and 1,500 commercial customers today including ABN-AMRO Clearing, BT, Caterpillar Marine Asset Intelligence, EMC, Moody's, NASDAQ and Sears Holdings Corporation. For more information visit www.pentaho.com.