Pentaho Labs Develops Python Native Integration

Unlocks Accessibility to Data Science and Predictive Modeling for Developer Community

January 26, 2016, —

Pentaho, a Hitachi Group company, today announced Pentaho Labs has developed native integration for Python. Through Pentaho Data Integration (PDI), data scientists can now use one of the most flexible and powerful open-source languages of today to increase both productivity and data governance, while spending more time on predictive analytics and machine learning.

“As the field of data science continues to grow outside the world of research and statisticians, it is important for our team to arm developers with a wide range of programming languages. R was developed as a language by and for statisticians, while C++ or Java requires extensive coding,” said Will Gorman, VP of Pentaho Labs at Pentaho, a Hitachi Group Company. “Python provides developers another option for data science with a general purposes language. With these languages, data scientists have the ability to use the most appropriate language with increased use of data preprocessing though PDI.”

Building on complementary open source foundations allowed Pentaho to innovate early with this emerging big data technology. The Python programming language allows data scientists to work quickly, easily crunching big data sets and integrating systems more effectively. As the preferred language for deep learning researchers, Python provides engineers in data science the ability to more easily develop predictive models. Through this integration, the Pentaho Labs team continues to leverage and drive innovation in big data and analytics, allowing customers to advance their big data deployments without risk.

Python + Pentaho Simplify Data Prep for Data Science/Machine Learning

“Python is widely deployed by developers and engineers to create statistical analytic workflows, particularly in areas such as finance, oil and gas, and physics,” said Matt Aslett, research director, 451 Research. “We see Python as a primary language for artificial intelligence engines and Pentaho’s native integration of Python will allow organizations to apply their deep domain expertise and improve predictive analytics and machine learning algorithms.”


Pentaho Data Integration for Python is available to download in the Pentaho Marketplace. To learn more about the innovation in Pentaho Labs visit:


Pentaho Labs, is an applied lab staffed with top industry experts to incubate breakthrough data orchestration and advanced analytic capabilities driven by big data. Pentaho Labs is responsible for evaluating and developing innovative approaches and technologies, which, over time, can be merged into the Pentaho roadmap based on market demand.

Pentaho and Python full logo

About Pentaho, a Hitachi Group company

Pentaho, a Hitachi Group company, is a leading data integration and business analytics company with an enterprise-class, open source-based platform for diverse big data deployments. Pentaho’s unified data integration and analytics platform is comprehensive, completely embeddable and delivers governed data to power any analytics in any environment. Pentaho’s mission is to help organizations across multiple industries harness the value from all their data, including big data and IoT, enabling them to find new revenue streams, operate more efficiently, deliver outstanding service and minimize risk. Pentaho has over 15,000 product deployments and 1,500 commercial customers today including ABN-AMRO Clearing, BT, Caterpillar Marine Asset Intelligence, EMC, Moody's, NASDAQ and Sears Holdings Corporation. For more information visit