Submitted by Hancheng Ye 4 KVCOMM: Online Cross-context KV-cache Communication for Efficient LLM-based Multi-agent Systems Duke Center for Computational Evolutionary Intelligence (CEI) 159 2
2 FlashSVD: Memory-Efficient Inference with Streaming for Low-Rank Models Duke Center for Computational Evolutionary Intelligence (CEI)