Data is everywhere. Providing a consistent, single version of the truth across all sources of information is one of the biggest challenges faced by IT
organizations today. Pentaho Data Integration delivers powerful Extraction, Transformation and Loading (ETL) capabilities using an innovative, metadata-driven
approach. With an intuitive, graphical, drag and drop design environment, and a proven, scalable, standards-based architecture, Pentaho Data Integration is
increasingly the choice for organizations over traditional, proprietary ETL or data integration tools.
Ease of Use
Pentaho Data Integration's metadata-driven approach means you simply specify WHAT you want to do, but not HOW you want to do it. Now administrators can create
complex transformations and jobs in a graphical, drag-and-drop environment without having to generate any custom code. Pentaho Data Integration is a
full-featured ETL solution including:
- Rich transformation library with over 100 out-of-the-box mapping objects
- Advanced data warehousing support for Slowly Changing and Junk Dimensions
- Proven enterprise-class performance and scalability
- A large ecosystem of partners and contributors that provide extensions like integrated data cleansing, ERP connectors, and more
Modern, Standards-based Architecture
Pentaho Data Integration's open, standards-based architecture is a natural fit for any environment or BI solution. Major benefits of the architecture include:
- 100% Java with broad, cross platform support
- Complete separation of user interface, data, and metadata
- Fully integrated with the Pentaho BI Suite providing advanced scheduling, process integration, reporting, and analysis
Enterprise-Class ETL
- Broad out-of-the-box data source support including packaged applications, over 30 open source and proprietary database platforms, flat files, Excel documents, and more
- Extensible architecture makes custom plug-in and connector development a breeze
- Repository-based providing easy re-use of transformation components, multi-developer collaboration, and structured management of models, connections, logs, and more
- Enterprise class performance and scalability with support for massively parallel processing (MPP) through clustered execution of transformations
- Integrated debugger to streamline troubleshooting of data integration processes
- Pentaho Data Integration Enterprise Console to allow administrators to monitor and manage ETL performance over time, to start, stop, and pause live jobs, and to set time boundaries for job execution
- Data Integration Enterprise Console allowing administrators to analyze job performance trends over time, to stop, pause, and restart live jobs, and set execution thresholds
The End of 'Build vs. Buy'
One of the most difficult decisions in any data warehousing project is whether populate your data warehouse manually using custom code or choose a proprietary
ETL tool like Informatica or Oracle Warehouse Builder.
The 'build' solution is appealing in that there are no up front costs associated with software licensing and you can build the solution to your exact
specifications. However, businesses today are in a constant state of change and the ongoing costs to maintain a custom solution often negate the initial
savings. Proprietary ETL offerings will get your project off the ground faster and provide dramatic savings in maintenance costs over time, but often
carry a six figure price tag just to get started. Pentaho Data Integration delivers the best of both worlds with no up front license costs and a significant
reduction in TCO compared to custom built solutions. Pentaho Data Integration provides advanced functionality, professional support, certified software,
software maintenance, product expertise and software assurance.
Common Use Cases
- Data warehouse population with built-in support for slowly changing dimensions, junk dimensions and much, much more.
- Export of database(s) to text-file(s) or other databases
- Import of data into databases, ranging from text-files to excel sheets
- Data migration between database applications
- Exploration of data in existing databases (tables, views, etc.)
- Information enrichment by looking up data in various information stores (databases, text-files, excel sheets and more )
- Data cleaning by applying complex conditions in data transformations
- Application integration
Pentaho Data Integration Enterprise Edition
Pentaho Data Integration Enterprise Edition extends Pentaho’s best-in-class open source business intelligence (BI) capabilities with additional software and services designed to help you and your organization:
- Achieve BI success
- Save time, resources, and money
- Mitigate risk
For more information on the features and benefits of Pentaho’s Data Integration Enterprise Editions, please see the
Pentaho Data Integration data sheet.
|