Course Outline
Advanced Transformation Building Blocks
- Working with complex data types
- Managing fields, metadata, and dynamic structures
- Reusable transformation patterns
Parameters, Variables, and Job-Oriented Design
- Runtime variables and scoping
- Parameterizing transformations
- Parent-child job structures
Database Integration and Lookup Strategies
- Advanced lookup steps
- Caching strategies
- Efficient join designs
Working with Files, APIs, and External Systems
- Processing JSON and XML
- Calling REST and SOAP services
- Streaming and batch loads
Error Handling and Data Quality Techniques
- Capturing and routing errors
- Data validation patterns
- Auditing and logging
Performance Tuning Essentials
- Optimizing step design
- Memory and threading considerations
- Detecting bottlenecks
Introduction to Repository-Based Development
- Using the Pentaho repository
- Version management
- Team collaboration practices
Deployment and Migration Practices
- Promoting jobs between environments
- Configuration management
- Operational best practices
Summary and Next Steps
Requirements
- An understanding of ETL fundamentals
- Experience with Pentaho Data Integration
- Basic knowledge of data warehousing concepts
Audience
- ETL developers
- Data engineers
- Technical professionals expanding PDI skills
Testimonials (5)
Practical classes, exercises, possibility of applying the discussed solutions in practice.
Agnieszka - Izba Administracji Skarbowej
Course - Platforma analityczna KNIME - szkolenie kompleksowe
Machine Translated
knowledge, exemplary training
Krzysztof Kantorski - Santander
Course - Oracle GoldenGate
Machine Translated
Prepared material. Full professionalism. Very good contact with the trainer. Full engagement and openness to changing the planned training format (very valuable open discussions on the topics we prepared)
Kamil Trebacz - Bank Gospodarstwa Krajowego
Course - Pentaho Data Integration (PDI) - moduł do przetwarzania danych ETL (poziom zaawansowany)
Machine Translated
Very useful in because it helps me understand what we can do with the data in our context. It will also help me
Nicolas NEMORIN - Adecco Groupe France
Course - KNIME Analytics Platform for BI
It's a hands-on session.