1

Voice replacement is getting faster to train, but seems to actually be getting worse with identifying pitch/keys.

There's still an issue with reverb/echo and doubled vocals. The only way I was able to make this passable was to find pre-separared vocals, and even still it struggled with the pitch drifting, so I had to rerecord parts of it.

Still, I trained these in so-vits-svc for about 2 hours each on a 3080ti. I spent more time producing it than the AI needed to completely replace someones voice with someone else's voice.

Combining these with deepfakes/wav2lip can give some damn good results. If anyone wants some guidance on the process for voice replacement, I can certainly share anything I've picked up along the way.

no comments (yet)
sorted by: hot top controversial new old
there doesn't seem to be anything here
this post was submitted on 12 Jun 2023
1 points (100.0% liked)

AI discussions

122 readers
1 users here now

Artificial Intelligence community. Chat bots, stable diffusion, ChatGPT, SO-VITS, anything AI.

Post news about new local chat models, discuss implementations, post your work, share your ideas, discuss our impending doom, or even news about new developments in the AI space. If it's about AI, it belongs here.

founded 1 year ago
MODERATORS