LLMs and Agents in DevOps Workflows Training Course
Large Language Models (LLMs) and autonomous agent frameworks such as AutoGen and CrewAI are transforming the way DevOps teams automate tasks like change tracking, test generation, and alert triage by mimicking human-like collaboration and decision-making.
This instructor-led, live training (available online or onsite) targets advanced-level engineers who aim to design and implement DevOps automation workflows driven by LLMs and multi-agent systems.
Upon completion of this training, participants will be able to:
- Integrate LLM-based agents into CI/CD workflows for intelligent automation.
- Automate test generation, commit analysis, and change summaries using agents.
- Coordinate multiple agents to triage alerts, generate responses, and provide DevOps recommendations.
- Develop secure and maintainable agent-powered workflows using open-source frameworks.
Format of the Course
- Interactive lecture and discussion.
- Extensive exercises and practice.
- Hands-on implementation in a live-lab environment.
Course Customization Options
- To request a customized training for this course, please contact us to arrange.
Course Outline
Introduction to LLMs and Agent Frameworks
- Overview of large language models in infrastructure automation
- Key concepts in multi-agent workflows
- AutoGen, CrewAI, and LangChain: use cases in DevOps
Setting Up LLM Agents for DevOps Tasks
- Installing AutoGen and configuring agent profiles
- Using OpenAI API and other LLM providers
- Setting up workspaces and CI/CD-compatible environments
Automating Test and Code Quality Workflows
- Prompting LLMs to generate unit and integration tests
- Using agents to enforce linting, commit rules, and code review guidelines
- Automated pull request summarization and tagging
LLM Agents for Alert Handling and Change Detection
- Designing responder agents for pipeline failure alerts
- Analyzing logs and traces using language models
- Proactive detection of high-risk changes or misconfigurations
Multi-Agent Coordination in DevOps
- Role-based agent orchestration (planner, executor, reviewer)
- Agent messaging loops and memory management
- Human-in-the-loop design for critical systems
Security, Governance, and Observability
- Handling data exposure and LLM safety in infrastructure
- Auditing agent actions and restricting scope
- Tracking pipeline behavior and model feedback
Real-World Use Cases and Custom Scenarios
- Designing agent workflows for incident response
- Integrating agents with GitHub Actions, Slack, or Jira
- Best practices for scaling LLM integration in DevOps
Summary and Next Steps
Requirements
- Experience with DevOps tooling and pipeline automation
- Working knowledge of Python and Git-based workflows
- Understanding of LLMs or exposure to prompt engineering
Audience
- Innovation engineers and AI-integrated platform leads
- LLM developers working in DevOps or automation
- DevOps professionals exploring intelligent agent frameworks
Open Training Courses require 5+ participants.
LLMs and Agents in DevOps Workflows Training Course - Booking
LLMs and Agents in DevOps Workflows Training Course - Enquiry
LLMs and Agents in DevOps Workflows - Consultancy Enquiry
Upcoming Courses
Related Courses
Agentic Development with Gemini 3 and Google Antigravity
21 HoursGoogle Antigravity serves as an agentic development environment for creating autonomous agents that plan, reason, code, and act via Gemini 3's multimodal capabilities.
This instructor-led, live training (available online or onsite) targets advanced-level technical professionals looking to design, build, and deploy autonomous agents using Gemini 3 and the Antigravity environment.
Upon completing this training, participants will be able to:
- Construct autonomous workflows leveraging Gemini 3 for reasoning, planning, and execution.
- Create agents in Antigravity capable of analysing tasks, writing code, and interacting with tools.
- Integrate Gemini-driven agents with enterprise systems and APIs.
- Enhance agent behaviour, safety, and reliability within complex environments.
Course Format
- Expert demonstrations combined with interactive discussions.
- Hands-on experimentation with autonomous agent development.
- Practical implementation using Antigravity, Gemini 3, and supporting cloud tools.
Course Customization Options
- If your team requires domain-specific agent behaviours or custom integrations, please contact us to tailor the program.
Advanced Antigravity: Feedback Loops, Learning & Long-Term Agent Memory
14 HoursGoogle Antigravity serves as a sophisticated framework designed for experimenting with long-lived agents and emergent interactive behaviors.
This instructor-led training session, available either online or on-site, targets advanced-level professionals eager to design, analyze, and optimize agents that can retain memories, improve via feedback, and evolve across extended operational periods.
After completing this course, participants will acquire the ability to:
- Architect long-term memory structures to ensure agent persistence.
- Deploy effective feedback loops to guide agent behavior.
- Assess learning progressions and monitor model drift.
- Integrate memory mechanisms into complex multi-agent ecosystems.
Course Format
- Expert-led discussions combined with technical demonstrations.
- Practical exploration through structured design challenges.
- Application of concepts within simulated agent environments.
Customization Options
- Should your organization require tailored content or case-specific examples, please reach out to us to customize this training.
Advanced Mastra Integrations: APIs, Tools, Enterprise Data & External Systems
21 HoursMastra is a framework designed to facilitate deep integration between AI agents, APIs, enterprise applications, and external data systems.
This instructor-led live training, available both online and onsite, targets intermediate-level engineers looking to build reliable, secure, and scalable integrations between Mastra agents and the wider enterprise ecosystem.
Upon completing this training, participants will be equipped to:
- Implement API-driven integrations between Mastra agents and external services.
- Connect enterprise data systems and tools to automated agent workflows.
- Apply secure data exchange and authentication best practices.
- Design integration layers that are scalable, maintainable, and production ready.
Course Format
- Interactive lectures and discussions.
- Hands-on integration engineering and API exercises.
- Live-lab implementation using real-world enterprise scenarios.
Customization Options
- Custom API scenarios, enterprise system mappings, or data-integration workshops are available upon request.
AIOps Foundation – Accredited Training
35 HoursAIOps is a rapidly developing discipline designed to meet the demands of contemporary, intricate IT landscapes, particularly those leveraging cloud infrastructure. The AIOps Foundation course provides a thorough overview of the fundamental concepts, technologies, and best practices associated with applying artificial intelligence to IT operations.
The curriculum explores the origins of AIOps, its foundational principles, essential tools, and the organizational hurdles that IT teams encounter when adopting these methodologies.
The training program concludes with an examination. Successfully passing this exam awards the globally recognized AIOps Foundation certification, which remains valid for three years.
Who is this course for?
This program is tailored for professionals and managers engaged in:
IT operations
DevOps and Site Reliability Engineering (SRE)
Cloud architecture
Data analysis and Data Science
Software development
IT security
Product and project management
AIOps in Action: Incident Prediction and Root Cause Automation
14 HoursAIOps (Artificial Intelligence for IT Operations) is gaining traction as a method to anticipate incidents before they happen and automate root cause analysis (RCA), thereby reducing downtime and speeding up resolution times.
This instructor-led training, available either online or onsite, targets advanced IT professionals looking to implement predictive analytics, automate remediation processes, and design intelligent RCA workflows using AIOps tools and machine learning models.
Upon completion of this training, participants will be equipped to:
- Construct and train ML models to identify patterns indicative of system failures.
- Automate RCA workflows by correlating logs and metrics from multiple sources.
- Integrate alerting and remediation processes into current platforms.
- Deploy and scale intelligent AIOps pipelines within production environments.
Course Format
- Interactive lectures and discussions.
- Ample exercises and practical sessions.
- Hands-on implementation in a live-lab environment.
Customization Options
- To request customized training for this course, please contact us to make arrangements.
AIOps Fundamentals: Monitoring, Correlation, and Intelligent Alerting
14 HoursAIOps (Artificial Intelligence for IT Operations) is a practice that leverages machine learning and analytics to automate and enhance IT operations, particularly in the areas of monitoring, incident detection, and response.
This instructor-led, live training (online or onsite) is aimed at intermediate-level IT operations professionals who wish to implement AIOps techniques to correlate metrics and logs, reduce alert noise, and improve observability through intelligent automation.
By the end of this training, participants will be able to:
- Understand the principles and architecture of AIOps platforms.
- Correlate data across logs, metrics, and traces to identify root causes.
- Reduce alert fatigue through intelligent filtering and noise suppression.
- Use open-source or commercial tools to monitor and respond to incidents automatically.
Format of the Course
- Interactive lecture and discussion.
- Lots of exercises and practice.
- Hands-on implementation in a live-lab environment.
Course Customization Options
- To request a customized training for this course, please contact us to arrange.
Building an AIOps Pipeline with Open Source Tools
14 HoursAn AIOps pipeline developed entirely with open-source tools empowers teams to create cost-effective and adaptable solutions for monitoring, anomaly detection, and intelligent alerting within production environments.
This instructor-led live training (available online or onsite) targets advanced-level engineers who aim to build and deploy a comprehensive AIOps pipeline utilizing tools such as Prometheus, ELK, Grafana, and custom ML models.
Upon completion of this training, participants will be able to:
- Design an AIOps architecture using exclusively open-source components.
- Gather and standardize data from logs, metrics, and traces.
- Implement ML models to identify anomalies and forecast incidents.
- Automate alerting and remediation processes using open tooling.
Course Format
- Interactive lectures and discussions.
- Extensive exercises and practical practice.
- Hands-on implementation in a live-lab environment.
Course Customization Options
- To request customized training for this course, please contact us to arrange.
Antigravity for Developers: Building Agent-First Applications
21 HoursAntigravity is a development platform designed to build AI-driven, agent-first applications.
This instructor-led, live training (online or onsite) is aimed at intermediate-level developers who wish to create real-world applications using autonomous AI agents within the Antigravity environment.
After completing this training, participants will be equipped to:
- Develop applications that rely on autonomous and coordinated AI agents.
- Use the Antigravity IDE, editor, terminal, and browser for end-to-end development.
- Manage multi-agent workflows with the Agent Manager.
- Integrate agent capabilities into production-grade software systems.
Format of the Course
- Blended presentations with in-depth demonstrations.
- Extensive hands-on practice and guided exercises.
- Real implementation work inside the Antigravity live environment.
Course Customization Options
- For tailored content aligned with your development stack, please contact us to arrange a customized version of this training.
Getting Started with Antigravity: An Introduction to Agent-First IDEs
14 HoursGoogle Antigravity is an agent-centric development environment designed to optimize engineering workflows through intelligent automation.
This instructor-led, live training (available online or onsite) targets beginner-level professionals looking to explore the fundamentals of Antigravity and understand how agent-driven coding environments can boost productivity.
After completing this training, participants will be able to:
- Install and set up Google Antigravity.
- Navigate and comprehend both the Editor View and Manager View.
- Collaborate effectively with agents to automate basic development tasks.
- Utilize Antigravity to generate, refine, and manage project files.
Course Format
- Instructor-led explanations accompanied by real-time demonstrations.
- Guided exercises focusing on hands-on interaction with agents.
- Practical exploration of core Antigravity features within a controlled lab environment.
Course Customization Options
- If you need a tailored version of this training, please contact us to arrange a customized program.
Antigravity for Web Automation & Browser-Based Tasks
21 HoursGoogle Antigravity serves as a platform for developing agents capable of interacting with web applications, browser environments, and multi-surface workflows.
This instructor-led, live training (available online or onsite) is designed for intermediate-level professionals who wish to build, automate, and test browser-based workflows using Google Antigravity.
Upon completion of the training, participants will be able to:
- Create agents that interact with web applications in a browser surface.
- Automate end-to-end workflows across browser contexts.
- Validate and troubleshoot agent behavior in UI-driven environments.
- Implement cross-surface automation strategies using Antigravity.
Format of the Course
- Guided instruction supported by demonstrations.
- Practical, hands-on activities and scenario-based exercises.
- Implementation of agent workflows in an interactive lab environment.
Course Customization Options
- For customized training requirements, please contact us to tailor the course to your objectives.
Enterprise AIOps with Splunk, Moogsoft, and Dynatrace
14 HoursEnterprise-grade AIOps platforms such as Splunk, Moogsoft, and Dynatrace deliver robust capabilities for identifying anomalies, correlating alerts, and automating responses across extensive IT environments.
This instructor-led, live training (available online or onsite) targets intermediate-level enterprise IT teams aiming to integrate AIOps solutions into their existing observability frameworks and operational processes.
Upon completing this training, participants will be capable of:
- Configuring and integrating Splunk, Moogsoft, and Dynatrace into a cohesive AIOps architecture.
- Correlating metrics, logs, and events across distributed systems through AI-driven analysis.
- Automating incident detection, prioritization, and response using both built-in and custom workflows.
- Optimizing performance, reducing Mean Time to Resolution (MTTR), and enhancing operational efficiency at an enterprise scale.
Course Format
- Interactive lectures and discussions.
- Extensive exercises and practical application.
- Hands-on implementation within a live-lab environment.
Customization Options
- To arrange a customized training session for this course, please contact us.
Implementing AIOps with Prometheus, Grafana, and ML
14 HoursPrometheus and Grafana are widely adopted tools for observability in modern infrastructure, while machine learning enhances these tools with predictive and intelligent insights to automate operations decisions.
This instructor-led, live training (online or onsite) is aimed at intermediate-level observability professionals who wish to modernize their monitoring infrastructure by integrating AIOps practices using Prometheus, Grafana, and ML techniques.
By the end of this training, participants will be able to:
- Configure Prometheus and Grafana for observability across systems and services.
- Collect, store, and visualize high-quality time series data.
- Apply machine learning models for anomaly detection and forecasting.
- Build intelligent alerting rules based on predictive insights.
Format of the Course
- Interactive lecture and discussion.
- Lots of exercises and practice.
- Hands-on implementation in a live-lab environment.
Course Customization Options
- To request a customized training for this course, please contact us to arrange.
AI Agent Development with Mastra
14 HoursThis instructor-led, live training session (available online or onsite) is designed for intermediate software developers and engineering teams looking to build scalable, observable AI systems using Mastra.
By the end of this training, participants will be able to:
- Comprehend Mastra’s architecture and its integration with Large Language Models (LLMs) and external APIs.
- Architect and implement AI agents and workflows using TypeScript.
- Leverage Mastra’s observability and memory capabilities to monitor and enhance agent performance.
- Deploy production-grade AI applications by exploiting the full potential of Mastra’s framework features.
Managing Agent Workflows in Google Antigravity: Orchestration, Planning and Artifacts
14 HoursGoogle Antigravity serves as an agent-centric development platform designed to orchestrate, supervise, and coordinate AI-driven coding and automation processes.
This instructor-led training, available online or onsite, targets intermediate-level professionals eager to design, manage, and refine multi-agent workflows within the Google Antigravity environment.
Upon completing this course, participants will be equipped with the skills to:
- Configure agent responsibilities and orchestration pipelines using the Manager interface.
- Generate and interpret Antigravity artifacts, such as task lists, plans, logs, and browser recordings.
- Apply verification strategies to maintain transparency and auditability in agent actions.
- Enhance multi-agent collaboration for complex development and operational tasks.
Course Format
- Guided presentations alongside practical demonstrations.
- Scenario-based exercises addressing real-world workflow challenges.
- Hands-on experimentation within a live Antigravity workspace.
Course Customization Options
- For a customized version of this course, please contact us to discuss your specific needs.
Testing & Verifying Agent-Driven Code: Quality Assurance in Antigravity
14 HoursAntigravity is a framework that embodies advanced, agent-driven development workflows.
This instructor-led, live training, available either online or onsite, is designed for intermediate to advanced professionals seeking to verify, validate, and secure the output generated by AI agents operating in Antigravity-driven environments.
Upon completion of this training, participants will be able to:
- Evaluate the accuracy and safety of code artifacts produced by agents.
- Employ structured techniques to verify tasks executed by agents.
- Analyze browser recordings and effectively trace agent activities.
- Apply quality assurance and security principles to ensure the reliability of agent-driven workflows.
Course Format
- Instructor-guided technical briefings and discussions.
- Practical exercises focused on verifying real-world agent workflows.
- Hands-on testing and validation within a controlled lab environment.
Course Customization Options
- Scenarios, workflows, and testing examples can be adapted upon request.