Course Outline
Enterprise Architecture and Pipeline Design
- Multi-layer ETL architectures
- Designing modular and reusable components
- Hybrid approaches across systems
Advanced Performance Engineering
- Step-level optimization
- Parallelism and threading strategies
- Monitoring high-load pipelines
Automation, Scripting, and Custom Extensions
- Scripting inside transformations
- Developing custom plugins
- Extending PDI capabilities with Java and JavaScript
Complex Data Processing and Integrations
- Real-time and streaming integrations
- Working with big data platforms
- Advanced file and API processing
Data Governance, Security, and Compliance
- Securing transformations and credentials
- Data lineage and traceability
- Regulatory and compliance considerations
Enterprise Orchestration and Scheduling
- Managing large job networks
- Error recovery and failover design
- Environment-level orchestration
Repository, Version Control, and CI/CD
- Enterprise repository strategies
- Integrating PDI with Git
- Continuous deployment patterns
Deployment, Monitoring, and Production Operations
- Promoting solutions across environments
- Operational tooling and dashboards
- End-to-end production readiness
Summary and Next Steps
Requirements
- An understanding of ETL pipelines and data modeling
- Experience with intermediate-level PDI transformations
- Solid SQL and scripting skills
Audience
- Senior data engineers
- ETL architects
- Professionals managing complex data integration workloads
Testimonials (5)
Practical classes, exercises, possibility of applying the discussed solutions in practice.
Agnieszka - Izba Administracji Skarbowej
Course - Platforma analityczna KNIME - szkolenie kompleksowe
Machine Translated
knowledge, exemplary training
Krzysztof Kantorski - Santander
Course - Oracle GoldenGate
Machine Translated
Prepared material. Full professionalism. Very good contact with the trainer. Full engagement and openness to changing the planned training format (very valuable open discussions on the topics we prepared)
Kamil Trebacz - Bank Gospodarstwa Krajowego
Course - Pentaho Data Integration (PDI) - moduł do przetwarzania danych ETL (poziom zaawansowany)
Machine Translated
Very useful in because it helps me understand what we can do with the data in our context. It will also help me
Nicolas NEMORIN - Adecco Groupe France
Course - KNIME Analytics Platform for BI
It's a hands-on session.