Exploring Next

Exploring Next — Ep 308 w/ Justy & Cody — marktechpost.com/2026/04/19/moonshot-ai-and-tsinghua-researchers-propose-prfaas-a-cross-datacenter-kvcache-architecture-that-rethinks-how-llms-are-served-at-scale

Justy and Cody unpack PRFaaS, a cross-datacenter KV-cache serving design from Moonshot AI and Tsinghua that tries to make LLM inference less wasteful by treating prefills as reusable networked assets instead of repeating them in every region.

Open source article

Full episode page with transcript →

Browse all Exploring Next episodes →