llom2600 15 hours ago

I've been involved w/ cloud infrastructure for the past few years (mostly around optimizing cold start for large models, distributed systems, etc). Haven't went too deep into building applications with LLMs. Over the past week I tried building with BAML/MCP Servers etc. Thought I'd share what I learned in this repo - would love some thoughts on the architecture.