In this one I used a combination of ControlNets (hed & depth) and an Initial Denoise Strength of 0.7 - now it looks a lot like me and is quite close to the original sequence.
StableDiffusion ControlNets hed depth Multi-frame Video
Loading comments...