Tag

memory optimization

1 articles

VideoMLA cuts video KV cache memory 92.7%

Research/May 29

VideoMLA cuts video KV cache memory 92.7%

VideoMLA compresses video diffusion KV caches with a shared low-rank latent and cuts per-token memory 92.7%.