lYttNavMJN
:first-child]:h-full [&>:first-child]:w-full [&>:first-child]:mb-0 [&>:first-child]:rounded-[inherit] h-full w-full [&>:first-child]:overflow-hidden [&>:first-child]:max-h-full"> Go to ChatGPTCoding r/ChatGPTCoding • AdditionalWeb107 Italiano archgw (0.3.20) - Sometimes a small release is a big one ~500 MB of python deps gutted out. archgw (a models-native sidecar proxy for AI agents) offered two capabilities that required loading small LLMs in memory: guardrails to prevent jailbreak attempts, and function-calling for routing requests to the right downstream tool or agent.
Voice: OpenAI TTS
Transcript
Host A Welcome back to Exploring Next! Today we're looking at reddit.com/r/ChatGPTCoding/s/lYttNavMJN.
Host B Yeah, this one caught our eye because :first-child]:h-full [&>:first-child]:w-full [&>:first-child]:mb-0 [&>:first-child]:rounded-[inherit] h-full w-full [&>:first-child]:overflow-hidden [&>:first-child]:max-h-full"> Go to ChatGPTCoding r/ChatGPTCoding • AdditionalWeb107 Italiano archgw (0.3.20) - Sometimes a small release is a big one ~500 MB of python deps gutted out.
Host A So the big idea is Those models are loaded as a separate out-of-process server via ollama/lama.cpp which are built in C++/Go.
Host B What stood out to me is So that you can build AI agents in any language or framework and offload the plumbing work in AI (like agent routing/hand-off, guardrails, zero-code logs and traces, and a unified API for all LLMs) to a durable piece of infrastructure, deployed as a sidecar.
Host A If you're curious, give the original a read: https://www.reddit.com/r/ChatGPTCoding/s/lYttNavMJN.
Host B And let us know what you try next!