Recent posts

Building an LLM Chat Application

2 minute read

We walk through building a modern AI chat application that supports both OpenAI and local LLM models, with Kubernetes deployment and GPU acceleration.