LLMs and Agents in DevOps Workflows Training Course
LLMs and autonomous agent frameworks like AutoGen and CrewAI are redefining how DevOps teams automate tasks such as change tracking, test generation, and alert triage by simulating human-like collaboration and decision-making.
This instructor-led, live training (online or onsite) is aimed at advanced-level engineers who wish to design and implement DevOps automation workflows powered by large language models (LLMs) and multi-agent systems.
By the end of this training, participants will be able to:
- Integrate LLM-based agents into CI/CD workflows for smart automation.
- Automate test generation, commit analysis, and change summaries using agents.
- Coordinate multiple agents for triaging alerts, generating responses, and providing DevOps recommendations.
- Build secure and maintainable agent-powered workflows using open-source frameworks.
Format of the Course
- Interactive lecture and discussion.
- Lots of exercises and practice.
- Hands-on implementation in a live-lab environment.
Course Customization Options
- To request a customized training for this course, please contact us to arrange.
Course Outline
Introduction to LLMs and Agent Frameworks
- Overview of large language models in infrastructure automation
- Key concepts in multi-agent workflows
- AutoGen, CrewAI, and LangChain: use cases in DevOps
Setting Up LLM Agents for DevOps Tasks
- Installing AutoGen and configuring agent profiles
- Using OpenAI API and other LLM providers
- Setting up workspaces and CI/CD-compatible environments
Automating Test and Code Quality Workflows
- Prompting LLMs to generate unit and integration tests
- Using agents to enforce linting, commit rules, and code review guidelines
- Automated pull request summarization and tagging
LLM Agents for Alert Handling and Change Detection
- Designing responder agents for pipeline failure alerts
- Analyzing logs and traces using language models
- Proactive detection of high-risk changes or misconfigurations
Multi-Agent Coordination in DevOps
- Role-based agent orchestration (planner, executor, reviewer)
- Agent messaging loops and memory management
- Human-in-the-loop design for critical systems
Security, Governance, and Observability
- Handling data exposure and LLM safety in infrastructure
- Auditing agent actions and restricting scope
- Tracking pipeline behavior and model feedback
Real-World Use Cases and Custom Scenarios
- Designing agent workflows for incident response
- Integrating agents with GitHub Actions, Slack, or Jira
- Best practices for scaling LLM integration in DevOps
Summary and Next Steps
Requirements
- Experience with DevOps tooling and pipeline automation
- Working knowledge of Python and Git-based workflows
- Understanding of LLMs or exposure to prompt engineering
Audience
- Innovation engineers and AI-integrated platform leads
- LLM developers working in DevOps or automation
- DevOps professionals exploring intelligent agent frameworks
Open Training Courses require 5+ participants.
LLMs and Agents in DevOps Workflows Training Course - Booking
LLMs and Agents in DevOps Workflows Training Course - Enquiry
LLMs and Agents in DevOps Workflows - Consultancy Enquiry
Consultancy Enquiry
Testimonials (1)
Trainer responding to questions on the fly.
Adrian
Course - Agentic AI Unleashed: Crafting LLM Applications with AutoGen
Provisional Courses
Related Courses
Interactive AI Agents: AgentCore Memory, Code Interpreter & Browser Tool in Action
14 HoursAgentCore provides memory persistence, a secure code interpreter, and a browser tool that enable AI agents to deliver interactive, dynamic, and context-aware experiences.
This instructor-led, live training (online or onsite) is aimed at intermediate-level to advanced-level technical practitioners who wish to design and deploy AI agents capable of long-term context retention, on-the-fly computation, and direct interaction with web UIs.
By the end of this training, participants will be able to:
- Implement AgentCore memory for stateful, context-aware workflows.
- Leverage the secure code interpreter for dynamic calculations and transformations.
- Integrate the browser tool for real-time data retrieval and UI interaction.
- Design interactive agents for analytics, customer support, and research use cases.
Format of the Course
- Interactive lecture and discussion.
- Hands-on lab exercises with AgentCore memory and tools.
- Case studies in analytics, automation, and customer support scenarios.
Course Customization Options
- To request a customized training for this course, please contact us to arrange.
Accelerating AI Agent Deployment with AgentCore Runtime & Gateway
14 HoursAgentCore Runtime & Gateway is an AWS service pairing for packaging, deploying, and securely exposing AI agents with streamlined integrations to external systems.
This instructor-led, live training (online or onsite) is aimed at intermediate-level engineering teams who wish to move from agent prototypes to production by mastering the AgentCore Runtime for deployment and the Gateway for secure connectivity and API integration.
By the end of this training, participants will be able to:
- Stand up AgentCore Runtime environments and package agents for deployment.
- Expose agents through Gateway with authenticated, rate-limited endpoints.
- Integrate external tools and APIs into agent workflows using stable contracts.
- Instrument observability, logging, and usage monitoring for production operation.
Format of the Course
- Interactive lecture and discussion.
- Hands-on labs with Runtime deployments and Gateway integrations.
- Practical exercises focused on reliability, security, and rollout.
Course Customization Options
- To request a customized training for this course, please contact us to arrange.
AIOps Foundation – Accredited Training
14 HoursAIOps is a rapidly evolving field that addresses the needs of modern, complex IT environments—particularly those operating within cloud architectures. The AIOps Foundation course offers a comprehensive introduction to the concepts, technologies, and practices related to the use of artificial intelligence in IT operations.
The program covers the background of AIOps, its core principles, tools, and the organizational challenges faced by IT teams adopting these approaches.
The training concludes with an exam. Passing it grants the globally recognized AIOps Foundation certification, valid for three years.
Who is it for?
This course is designed for professionals and managers involved in:
IT operations
DevOps and Site Reliability Engineering (SRE)
Cloud architecture
Data analysis and Data Science
Software development
IT security
Product and project management
AIOps in Action: Incident Prediction and Root Cause Automation
14 HoursAIOps (Artificial Intelligence for IT Operations) is increasingly being used to predict incidents before they occur and automate root cause analysis (RCA) to minimize downtime and accelerate resolution.
This instructor-led, live training (online or onsite) is aimed at advanced-level IT professionals who wish to implement predictive analytics, automate remediation, and design intelligent RCA workflows using AIOps tools and machine learning models.
By the end of this training, participants will be able to:
- Build and train ML models to detect patterns leading to system failures.
- Automate RCA workflows based on multi-source log and metric correlation.
- Integrate alerting and remediation processes into existing platforms.
- Deploy and scale intelligent AIOps pipelines in production environments.
Format of the Course
- Interactive lecture and discussion.
- Lots of exercises and practice.
- Hands-on implementation in a live-lab environment.
Course Customization Options
- To request a customized training for this course, please contact us to arrange.
AIOps Fundamentals: Monitoring, Correlation, and Intelligent Alerting
14 HoursAIOps (Artificial Intelligence for IT Operations) is a practice that applies machine learning and analytics to automate and improve IT operations, particularly in the areas of monitoring, incident detection, and response.
This instructor-led, live training (online or onsite) is aimed at intermediate-level IT operations professionals who wish to implement AIOps techniques to correlate metrics and logs, reduce alert noise, and improve observability through intelligent automation.
By the end of this training, participants will be able to:
- Understand the principles and architecture of AIOps platforms.
- Correlate data across logs, metrics, and traces to identify root causes.
- Reduce alert fatigue through intelligent filtering and noise suppression.
- Use open-source or commercial tools to monitor and respond to incidents automatically.
Format of the Course
- Interactive lecture and discussion.
- Lots of exercises and practice.
- Hands-on implementation in a live-lab environment.
Course Customization Options
- To request a customized training for this course, please contact us to arrange.
Building an AIOps Pipeline with Open Source Tools
14 HoursAn AIOps pipeline built entirely with open-source tools allows teams to design cost-effective and flexible solutions for observability, anomaly detection, and intelligent alerting in production environments.
This instructor-led, live training (online or onsite) is aimed at advanced-level engineers who wish to build and deploy an end-to-end AIOps pipeline using tools like Prometheus, ELK, Grafana, and custom ML models.
By the end of this training, participants will be able to:
- Design an AIOps architecture using only open-source components.
- Collect and normalize data from logs, metrics, and traces.
- Apply ML models to detect anomalies and predict incidents.
- Automate alerting and remediation using open tooling.
Format of the Course
- Interactive lecture and discussion.
- Lots of exercises and practice.
- Hands-on implementation in a live-lab environment.
Course Customization Options
- To request a customized training for this course, please contact us to arrange.
AutoGen for Enterprise AI Automation
21 HoursAutoGen for Enterprise AI Automation is a hands-on course focused on implementing scalable, intelligent agent systems to automate complex business operations using the AutoGen framework.
This instructor-led, live training (online or onsite) is aimed at intermediate-level to advanced-level AI professionals who wish to deploy multi-agent architectures across enterprise platforms and processes using the AutoGen framework.
By the end of this training, participants will be able to:
- Design and automate enterprise workflows using AutoGen and LLM agents.
- Integrate AutoGen with LangChain for advanced orchestration and context handling.
- Build RAG pipelines and connect enterprise data for contextual automation.
- Connect agents with enterprise platforms like Slack, Jira, and SharePoint.
- Scale and monitor AutoGen deployments in production environments.
Format of the Course
- Interactive lecture and discussion.
- Lots of exercises and practice.
- Hands-on implementation in a live-lab environment.
Course Customization Options
- To request a customized training for this course, please contact us to arrange.
Building Fully Managed AI Agents with AgentCore: From Concept to Production
14 HoursAgentCore simplifies the process of building, enhancing, and monitoring fully managed AI agents by providing a unified suite of services tailored for deployment at scale.
This instructor-led, live training (online or onsite) is aimed at beginner-level to intermediate-level practitioners who wish to gain hands-on experience creating production-ready AI agents with AgentCore.
By the end of this training, participants will be able to:
- Understand the core capabilities of AgentCore for AI agent development.
- Design and configure simple AI agents using managed services.
- Integrate workflows to enhance agent functionality.
- Deploy and monitor AI agents for production environments.
Format of the Course
- Interactive lecture and discussion.
- Hands-on labs with AgentCore services.
- Guided exercises from agent concept to deployment.
Course Customization Options
- To request a customized training for this course, please contact us to arrange.
Enterprise Agentic AI with Amazon Bedrock AgentCore
14 HoursAmazon Bedrock AgentCore is an enterprise-ready framework for building, deploying, and scaling AI agents with integrated support for memory, observability, and secure identity management.
This instructor-led, live training (online or onsite) is aimed at intermediate-level to advanced-level engineers and architects who wish to design, secure, and operate agentic AI systems using AWS Bedrock AgentCore.
By the end of this training, participants will be able to:
- Understand the architecture and components of AgentCore.
- Deploy and manage AI agents with Runtime and Gateway.
- Implement persistent memory and stateful interactions.
- Apply identity, observability, and compliance controls.
- Design multi-agent systems for enterprise-scale workflows.
Format of the Course
- Interactive lecture and discussion.
- Hands-on AWS lab sessions with AgentCore.
- Practical exercises with deployment and monitoring scenarios.
Course Customization Options
- To request a customized training for this course, please contact us to arrange.
Enterprise AIOps with Splunk, Moogsoft, and Dynatrace
14 HoursEnterprise AIOps platforms like Splunk, Moogsoft, and Dynatrace provide powerful capabilities for detecting anomalies, correlating alerts, and automating responses across large-scale IT environments.
This instructor-led, live training (online or onsite) is aimed at intermediate-level enterprise IT teams who wish to integrate AIOps tools into their existing observability stack and operational workflows.
By the end of this training, participants will be able to:
- Configure and integrate Splunk, Moogsoft, and Dynatrace into a unified AIOps architecture.
- Correlate metrics, logs, and events across distributed systems using AI-driven analysis.
- Automate incident detection, prioritization, and response with built-in and custom workflows.
- Optimize performance, reduce MTTR, and improve operational efficiency at enterprise scale.
Format of the Course
- Interactive lecture and discussion.
- Lots of exercises and practice.
- Hands-on implementation in a live-lab environment.
Course Customization Options
- To request a customized training for this course, please contact us to arrange.
Implementing AIOps with Prometheus, Grafana, and ML
14 HoursPrometheus and Grafana are widely adopted tools for observability in modern infrastructure, while machine learning enhances these tools with predictive and intelligent insights to automate operations decisions.
This instructor-led, live training (online or onsite) is aimed at intermediate-level observability professionals who wish to modernize their monitoring infrastructure by integrating AIOps practices using Prometheus, Grafana, and ML techniques.
By the end of this training, participants will be able to:
- Configure Prometheus and Grafana for observability across systems and services.
- Collect, store, and visualize high-quality time series data.
- Apply machine learning models for anomaly detection and forecasting.
- Build intelligent alerting rules based on predictive insights.
Format of the Course
- Interactive lecture and discussion.
- Lots of exercises and practice.
- Hands-on implementation in a live-lab environment.
Course Customization Options
- To request a customized training for this course, please contact us to arrange.
Securing AI Agents: Identity, Observability, and Compliance with AgentCore
14 HoursAgentCore provides built-in identity, observability, and compliance features that enable organizations to deploy AI agents responsibly in enterprise environments.
This instructor-led, live training (online or onsite) is aimed at advanced-level practitioners who wish to design and operate secure, auditable, and compliant AI agent systems using Amazon Bedrock AgentCore.
By the end of this training, participants will be able to:
- Implement enterprise identity and permissioning models for agents.
- Enable observability through structured logging, metrics, and tracing.
- Apply compliance controls to align with regulatory frameworks.
- Audit agent activity and maintain secure session-level controls.
Format of the Course
- Interactive lecture and discussion.
- Hands-on labs with AWS security and monitoring tools.
- Case studies in regulated enterprise environments.
Course Customization Options
- To request a customized training for this course, please contact us to arrange.
Building LLM Agent Systems with AutoGen
21 HoursBuilding LLM Agent Systems with AutoGen is a hands-on course focused on developing multi-agent systems using Microsoft’s AutoGen framework for large language models (LLMs).
This instructor-led, live training (online or onsite) is aimed at intermediate-level AI and automation professionals who wish to design, implement, and orchestrate multi-agent systems using AutoGen with Python and LLMs.
By the end of this training, participants will be able to:
- Design multi-agent architectures using the AutoGen framework.
- Configure agent roles, capabilities, and coordination behaviors.
- Use function-calling and memory handling for agent interactions.
- Build and test Python-based LLM agent workflows for real use cases.
Format of the Course
- Interactive lecture and discussion.
- Lots of exercises and practice.
- Hands-on implementation in a live-lab environment.
Course Customization Options
- To request a customized training for this course, please contact us to arrange.
Agentic AI Unleashed: Crafting LLM Applications with AutoGen
7 HoursThis 1-day workshop, designed for developers, data scientists, and AI enthusiasts, will help you understand and harness the power of agentic AI systems using AutoGen v0.4.
Through a mix of hands-on exercises and practical demonstrations, you’ll learn how to build, manage, and deploy multi-agent applications powered by Large Language Models (LLMs).
By the end of the course, you'll gain a solid foundation in AutoGen’s layered architecture, master asynchronous communication between agents, and explore real-world use cases and best practices for developing scalable and intelligent LLM-driven applications.
Next-Gen Multi-Agent Systems Using Amazon Bedrock AgentCore
14 HoursAmazon Bedrock AgentCore provides the infrastructure to design, orchestrate, and manage multi-agent systems, enabling collaboration, specialization, and advanced automation patterns across multiple AI-driven entities.
This instructor-led, live training (online or onsite) is aimed at advanced-level practitioners who wish to design, deploy, and optimize next-generation multi-agent systems using AgentCore’s orchestration and governance features.
By the end of this training, participants will be able to:
- Understand the principles and architectures of multi-agent systems.
- Design orchestration strategies for agent collaboration.
- Implement communication and negotiation protocols between agents.
- Deploy scalable, enterprise-ready multi-agent solutions using AgentCore.
Format of the Course
- Interactive lecture and discussion.
- Hands-on labs with AgentCore orchestration features.
- Case studies of enterprise multi-agent applications.
Course Customization Options
- To request a customized training for this course, please contact us to arrange.