AI Guide to the Galaxy Episode 2: Running Local LLMs with Docker Model Runner

In this episode of AI Guide to the Galaxy, Principal Engineer Jacob Howard joins Oleg for a deep dive into Docker Model Runner, the open-source tool that brings large language models (LLMs) to your local Docker CE and Docker Desktop environments.

We cover:
✅ How to install and run Docker Model Runner on Docker CE
✅ GPU vs. CPU support, container-based architecture & runtime logic
✅ Running LLMs in CI environments like GitHub Actions
✅ Benchmarking performance on lightweight setups (e.g. 1 CPU, 16GB RAM)
✅ Selecting model sizes and quantizations for your hardware
✅ Using Model Runner in Kubernetes and Google Cloud Run
✅ Debugging with logs, Docker Desktop's new request inspector, and OpenAI API compatibility
✅ Upcoming features: VLLM backend support, multimodal inference, and more

Whether you're building agentic applications, experimenting with LLMs on Raspberry Pi, or deploying production-scale AI in Kubernetes—this episode will guide you through it.

🧠 Mentioned Tools & Concepts:
- Docker CE & Docker Desktop
- Docker Model Plugin
- LLaMA.cpp (Llama C++)
- Quantized LLMs & VRAM sizing
- OpenAI-compatible APIs
- OCI model artifacts
- Kubernetes YAML for AI workloads

🔗 GitHub Repos:
model-runner: https://github.com/docker/model-runner
model-cli: https://github.com/docker/model-cli
model-distribution: https://github.com/docker/model-distribution


👉 Don’t forget to like, subscribe, and hit the bell to stay in the loop for upcoming episodes!

#docker #ai #podcast #dockermodelrunner #llm #genai #softwaredevelopment #devops #aidevelopment #dockerpodacst Receive SMS online on sms24.me

TubeReader video aggregator is a website that collects and organizes online videos from the YouTube source. Video aggregation is done for different purposes, and TubeReader take different approaches to achieve their purpose.

Our try to collect videos of high quality or interest for visitors to view; the collection may be made by editors or may be based on community votes.

Another method is to base the collection on those videos most viewed, either at the aggregator site or at various popular video hosting sites.

TubeReader site exists to allow users to collect their own sets of videos, for personal use as well as for browsing and viewing by others; TubeReader can develop online communities around video sharing.

Our site allow users to create a personalized video playlist, for personal use as well as for browsing and viewing by others.

@YouTubeReaderBot allows you to subscribe to Youtube channels.

By using @YouTubeReaderBot Bot you agree with YouTube Terms of Service.

Use the @YouTubeReaderBot telegram bot to be the first to be notified when new videos are released on your favorite channels.

Look for new videos or channels and share them with your friends.

You can start using our bot from this video, subscribe now to AI Guide to the Galaxy Episode 2: Running Local LLMs with Docker Model Runner

What is YouTube?

YouTube is a free video sharing website that makes it easy to watch online videos. You can even create and upload your own videos to share with others. Originally created in 2005, YouTube is now one of the most popular sites on the Web, with visitors watching around 6 billion hours of video every month.