Data is everywhere. Providing a consistent, single version of the truth across all sources of information is one of the biggest challenges faced by IT organizations
today. Pentaho Data Integration delivers powerful Extraction, Transformation and Loading (ETL) capabilities using an innovative, metadata-driven approach.
The ease of use in our graphical, drag-and-drop design increases productivity and our extensible, standards based architecture ensures that you will never be
forced to adopt proprietary methodologies into your ETL solution.
Ease of Use
Pentaho Data Integration's metadata-driven approach means you simply specify WHAT you want to do, but not HOW you want to do it. Now administrators can create
complex transformations and jobs in a graphical, drag-and-drop environment without having to generate any custom code. Pentaho Data Integration is a
full-feature ETL solution including:
- Rich transformation library with over 80 out-of-the-box mapping objects
- Advanced data warehousing support for Slowly Changing and Junk Dimensions
- Enterprise-class performance and scalability
- ERP connectors and data quality plug-ins also available
Modern, Standards-based Architecture
Pentaho Data Integration's open, standards-based architecture is a natural fit for any environment or BI solution. Major benefits of the architecture include:
- 100% Java with broad, cross platform support
- Complete separation of user interface, data, and metadata
- Fully integrated with the Pentaho Open BI Suite providing advanced scheduling, workflow, reporting, and analysis
Enterprise-Class ETL
- Broad out-of-the-box data source support including packaged applications, over 30 open source and proprietary database platforms, flat files, Excel documents, and more
- Extensible architecture makes custom plug-in and connector development a breeze
- Repository-based providing easy re-use of transformation components, multi-developer collaboration, and structured management of models, connections, logs, and more
- Enterprise class performance and scalability with support for massively parallel processing (MPP) through clustered execution of transformations
- Fully integrated with the Pentaho Open BI Suite providing advanced scheduling, workflow, reporting, and analysis
- Integrated debugger to streamline troubleshooting of data integration processes
The End of 'Build vs. Buy'
One of the most difficult decisions in any data warehousing project is whether populate your data warehouse manually using custom code or choose a proprietary
ETL tool like Informatica or Oracle Warehouse Builder.
The 'build' solution is appealing in that there are no up front costs associated with software licensing and you can build the solution to your exact
specifications. However, businesses today are in a constant state of change and the ongoing costs to maintain a custom solution often negate the initial
savings. Proprietary ETL offerings will get your project off the ground faster and provide dramatic savings in maintenance costs over time, but often
carry a six figure price tag just to get started. Pentaho Data Integration delivers the best of both worlds with no up front license costs and a significant
reduction in TCO compared to custom built solutions. An annual subscription providing professional support, certified builds, and IP indemnification is also
available at a fraction of the cost of proprietary offerings.
Common Use Cases
- Data warehouse population with built-in support for slowly changing dimensions, junk dimensions and much, much more.
- Export of database(s) to text-file(s) or other databases
- Import of data into databases, ranging from text-files to excel sheets
- Data migration between database applications
- Exploration of data in existing databases (tables, views, etc.)
- Information enrichment by looking up data in various information stores (databases, text-files, excel sheets and more )
- Data cleaning by applying complex conditions in data transformations
- Application integration
|