Get in Touch

Course Outline

Advanced Transformation Building Blocks

  • Handling complex data types
  • Managing fields, metadata, and dynamic structures
  • Implementing reusable transformation patterns

Parameters, Variables, and Job-Oriented Design

  • Managing runtime variables and scoping
  • Parameterizing transformations
  • Structuring parent-child jobs

Database Integration and Lookup Strategies

  • Utilizing advanced lookup steps
  • Applying effective caching strategies
  • Designing efficient joins

Working with Files, APIs, and External Systems

  • Processing JSON and XML data
  • Invoking REST and SOAP services
  • Executing streaming and batch loads

Error Handling and Data Quality Techniques

  • Capturing and routing errors
  • Applying data validation patterns
  • Conducting auditing and logging

Performance Tuning Essentials

  • Optimizing step design
  • Addressing memory and threading considerations
  • Identifying bottlenecks

Introduction to Repository-Based Development

  • Using the Pentaho repository
  • Managing versions
  • Adopting team collaboration practices

Deployment and Migration Practices

  • Promoting jobs across environments
  • Managing configurations
  • Following operational best practices

Summary and Next Steps

Requirements

  • Foundational understanding of ETL processes
  • Prior experience with Pentaho Data Integration
  • Basic knowledge of data warehousing principles

Target Audience

  • ETL developers
  • Data engineers
  • Technical professionals looking to expand their PDI skills
 21 Hours

Number of participants


Price per participant

Testimonials (2)

Upcoming Courses

Related Categories