Course:
Pentaho Data Integration I
Course Number: PDI2000
Audience: This course is intended for technical users who integrate disparate data sources (including big data sources), build/maintain data models for analysis, and manage BI data/metadata, including: Database Developers, Power Users, Technical Business Analysts, BI Solution Architects, Systems Integrators, and Data Scientists
Level: Introductory – This course is intended for students with database development or administration experience who are new to Pentaho Data Integration
Delivery Method: Public classroom, Instructor-led online, private on-site (please contact us for on-site pricing)
Duration: 4 Days
Public Training Cost: USD $2,600 (4 credits)
Course Placement: This course is the first course in the Database Developer path. Students with prior database development or administration experience who are new to Pentaho Data Integration should take this course.
In the schedule below, click the course title for detailed information on the class, click the provider link for information on the authorized training provider, or click the Register Now link to enroll.
| Location | Language | Provider | Date/Time | Availability |
| Geneva, Switzerland | English | Jun 25, 2013 9:00 AM | Register Now | |
| Tampa, FL | English | Jul 9, 2013 9:00 AM | Register Now | |
| Munich, Germany | German | Jul 9, 2013 9:00 AM | Register Now | |
| Washington, DC | English | Jul 23, 2013 9:00 AM | Register Now | |
| Chicago, IL | English | Aug 20, 2013 9:00 AM | Register Now | |
| Online | English | Sep 17, 2013 9:00 AM | Register Now | |
| Birmingham, UK | English | Oct 1, 2013 9:00 AM | Register Now | |
| Milano, Italy | Italian | Oct 7, 2013 9:00 AM | Register Now | |
| Washington, DC | English | Oct 8, 2013 9:00 AM | Register Now | |
| Online | English | Oct 29, 2013 9:00 AM | Register Now | |
| London, UK | English | Nov 5, 2013 9:00 AM | Register Now | |
| Zurich, Switzerland | English | Nov 12, 2013 9:00 AM | Register Now | |
| Cologne, Germany | German | Nov 19, 2013 9:00 AM | Register Now | |
| Paris,France | French | Nov 26, 2013 9:00 AM | Register Now | |
| Massa, Italy | Italian | Dec 2, 2013 9:00 AM | Register Now | |
| Madrid, Spanish | Spanish | Dec 3, 2013 9:00 AM | Register Now | |
| Online | English | Dec 10, 2013 9:00 AM | Register Now | |
| Birmingham, UK | English | Dec 10, 2013 9:00 AM | Register Now |
Note: All courses provided by Pentaho-EMEA are offered in the Central European time zone. All US courses (provided by Pentaho) are offered in the Eastern time zone. Beginning in July, 2013, all US courses will be offered in the Central time zone.
Course Overview:
With continuous volumes and increased variety and velocity of data, organizations need fast and easy ways to harness data and gain insight from it. However, one of the biggest challenges facing IT organizations today is to provide a consistent, single version of the truth across all sources of information in an analytics-ready format. With powerful data extract, transform and load (ETL) capabilities, an intuitive and rich graphical design environment, and an open and standards-based architecture, Pentaho Data Integration is increasingly the choice over proprietary and homegrown data integration tools.
Pentaho Data Integration provides a full ETL solution, including:
Through a series of lectures and hands-on exercises covering theory, best practices, and design patterns, Pentaho Data Integration for Database Developers provides students the skills they need to maximize the value of data to the organization. This course helps prepare you for the Pentaho Data Integration Certification Exam.
Course Benefits:
Skills Achieved:
At the completion of this course, you should be able to:
Course Requirements:
Students attending classroom courses in the United States are provided with a PC to use during class. Students attending courses outside the US should contact the Authorized Training Provider regarding PC requirements for Pentaho courses.
In general, if your training provider requires you to bring a PC to class, it must meet the following requirements. You can also verify your system against the Compatibility Matrix: List of Supported Products topic in the Pentaho InfoCenter:
Online courses require a broadband Internet connection, the use of a modern Web browser (such as Microsoft Internet Explorer or Mozilla Firefox), and the ability to connect to the WebEx Training Center. For more information on WebEx Training Center requirements, see http://www.webex.com. Online courses use Pentaho’s cloud-based exercise environment. Students are provided access to a virtual machine used to complete the exercises.
For online courses, students are provided with a secured, electronic course manual. Printed manuals are not provided for online courses. When an electronic manual is provided, students are encouraged to print the exercise book before class begins, though this is not required.
Students attending this course on-site should contact their Customer Success Manager for hardware and software requirements. You can also email us at training@pentaho.com for more information regarding on-site training requirements.
Course Agenda:
| Day 1 |
|---|
Course Introduction |
Module 1: Pentaho Data Integration Overview Exercise 1: Introducing Pentaho Data Integration |
Module 2: Inputs and Outputs |
Module 3: Introduction to the Training Data Exercise 2: Inputs and Outputs |
Module 4: Data Warehouse Steps Exercise 3: Data Warehouse Steps |
| Day 2 |
|---|
Module 5: Lookups |
Module 6: Field Transformations, Part 1 Exercise 4: Lookups and Field Transformations |
Module 7: Set Transformations Exercise 5: Set Transformations |
Module 8: Pivot Transformations Exercise 6: Pivot Transformations |
Module 9: Field Transformations, Part 2 |
Module 10: Loading the Time Dimension and the Fact Table Exercise 7: Loading a Fact Table |
| Day 3 |
|---|
Module 11: Introduction to Jobs Exercise 8: Creating a Job |
Module 12: Advanced Job Concepts Exercise 9: Advanced Job Concepts |
Module 13: Common Scripting Uses Exercise 10: Using JavaScript |
Module 14: Dynamic Transformations |
Module 15: Using XML in Pentaho Data Integration Exercise 11: Using XML |
Module 16: Portable Transformations and Jobs Exercise 12: Portable Transformations and Jobs |
| Day 4 |
|---|
Module 17: Logging Exercise 13: Configuring Logging |
Module 18: Error Handling in Transformations Exercise 14: Error Handling in Transformations |
Module 19: ETL Patterns Exercise 15: Calculating Time Between Orders |
(Optional) Module 20: Pentaho Enterprise Repository (Optional) Exercise 16: Pentaho Enterprise Repository |
Module 21: Scheduling and Monitoring Exercise 17: Scheduling and Monitoring |
Module 22: Pre and Post-Processing Exercise 18: Constraint and Index Management |
(Optional) Module 23: Tuning and Administration Topics |
Module 24: Interpreting Runtime Data |
Module 25: Clustering and Partitioning Exercise 19: Clustering and Partitioning |
(Optional) Module 26: Operational Patterns |