Skip to content
arXiv cs.CV · Papers

SSM Meets Video Diffusion Models: Efficient Long-Term Video Generation with Structured State Spaces

arXiv:2403.07711v5 Announce Type: replace Abstract: Given the remarkable achievements in image generation through diffusion models, the research community has shown increasing interest in extending these models to video generation. Recent diffusion models for video generation have predominantly utilized attention layer