Exploring Next
Exploring Next — Ep 308 w/ Justy & Cody — marktechpost.com/2026/04/19/moonshot-ai-and-tsinghua-researchers-propose-prfaas-a-cross-datacenter-kvcache-architecture-that-rethinks-how-llms-are-served-at-scale
Justy and Cody unpack PRFaaS, a cross-datacenter KV-cache serving design from Moonshot AI and Tsinghua that tries to make LLM inference less wasteful by treating prefills as reusable networked assets instead of repeating them in every region.