Skip to content

MLOps Training Documentation

Complete study guide for MLOps infrastructure and deployment

Study Path Overview

This documentation follows a hybrid structure:

  • docs/ (you are here) - Conceptual learning and theory
  • module-X/ - Hands-on labs and practice code

Quick Start

  1. Choose your module below
  2. Read the conceptual documentation in docs/module-X/
  3. Practice with labs in module-X/ folder

Module 1: Infrastructure & Prerequisites

Goal: Master Git, AWS, Kubernetes, and Terraform fundamentals for MLOps

Study Path

OrderTopicDescriptionLab Location
1Git for TeamsVersion control, branching strategies, and team collaborationmodule-01/git/
2AWS Cloud ServicesCloud services, security, networking, and AI/MLmodule-01/aws/
3KubernetesContainer orchestration for production workloadsmodule-01/k8s/
4TerraformInfrastructure as Code fundamentalsmodule-01/terraform/

Module 1 Documentation

Git for Teams:

AWS Cloud Services:

  • AWS Overview Guide - Complete AWS CLF-C02 reference
    • Cloud Concepts & Security
    • Core Services (Compute, Storage, Database, Networking, Analytics)
    • AI/ML Services
    • Deployment Methods
    • Billing & Pricing
    • LocalStack Practice Guides

Kubernetes:

Terraform:

Lab Locations

LabDescriptionLocation
Git for TeamsGit practice exercises and examplesmodule-01/git/
LocalStackAWS services practice locallymodule-01/aws/localstack/
KubernetesK8s hands-on practicemodule-01/k8s/
Terraform BasicsInfrastructure as Code fundamentalsmodule-01/terraform/basics/
Terraform ExamplesExample configurationsmodule-01/terraform/examples/
Terraform ExercisesPractice exercisesmodule-01/terraform/exercises/

Module 2: Model Deployment

Coming soon - Batch API deployment with FastAPI

Lab Location: module-02/batch-api/


Module 3: Deployment and Operation

Goal: Implement automated testing, CI/CD pipelines, and monitoring

Study Path

OrderTopicDescriptionLab Location
1TestingUnit, integration, and contract testingmodule-03/testing/
2CI/CDGitHub Actions workflows and pipelinesmodule-03/cicd/
3Monitoring & ObservabilityGrafana LGTM+P stackmodule-03/monitoring/

Module 3 Documentation

Testing:

CI/CD:

Monitoring & Observability:

Lab Locations

LabDescriptionLocation
TestingUnit, integration, contract testingmodule-03/testing/
CI/CDGitHub Actions workflowsmodule-03/cicd/github-actions/
MonitoringGrafana LGTM+P stack demomodule-03/monitoring/

Study Tips

For Each Module

  1. Read first - Start with the conceptual guide in docs/
  2. Practice second - Run the lab exercises in module-X/
  3. Experiment - Modify configurations and observe changes
  4. Review - Re-read documentation with practical context

For Hands-on Skills

  1. Complete all lab exercises - Don't skip!
  2. Break things intentionally - Learn to troubleshoot
  3. Build variations - Modify exercises to solve new problems
  4. Document your learnings - Keep notes

Example Study Workflow

bash
# 1. Read the conceptual guide (Git for Teams)
cat docs/module-01/git/README.md

# 2. Navigate to the lab
cd module-01/git

# 3. Practice Git workflows
# Create a practice repository, branches, merges, etc.

# 4. Read AWS guide
cat docs/module-01/aws/README.md

# 5. Navigate to the lab
cd ../aws/localstack

# 6. Start the lab environment
docker compose up -d

# 7. Practice the exercises
aws --endpoint-url=http://localhost:4566 s3 mb s3://my-bucket

# 8. Clean up
docker compose down -v

Additional Resources

External References

General:

AWS (Reference for CLF-C02 Exam):

Testing & CI/CD:

Monitoring & Observability:

Internal Tools


Progress Tracking

Track your progress by checking off completed modules:

Module 1: Infrastructure & Prerequisites

  • [ ] Git for Teams (Basics, Branching Strategies, Collaboration)
  • [ ] AWS Cloud Services (Core Services, Security, AI/ML)
  • [ ] Kubernetes (Core Objects, Workloads, Storage, Networking)
  • [ ] Terraform Basics
  • [ ] LocalStack Practice Labs

Module 2: Model Deployment

  • [ ] Batch API with FastAPI
  • [ ] Model Deployment Patterns

Module 3: Deployment and Operation

  • [ ] Testing (Unit, Integration, Contract)
  • [ ] CI/CD Pipelines (GitHub Actions)
  • [ ] Monitoring & Observability (Grafana LGTM+P)

Last Updated: January 2026

Released under the MIT License.