Two-phase implementation: Phase 1 generates real, temporally coherent video on CPU using the correct AnimateDiff architecture. Phase 2 accelerates spatial denoising on Blackhole hardware using the ...