New features and feature requests #87

C0untFloyd · 2023-08-12T16:43:02Z

C0untFloyd
Aug 12, 2023
Maintainer

Here I would like to collect and show planned features for roop unleashed.
My wishlist so far would be:

More configuration for blending faces (bbox manipulation, blur, erode etc). Get rid of pesky hairlines!
Other text masking options, perhaps adding slow but perfect masking for still images (Grounding Dino, SAM)
Cleanup and improve the processing workflow
Rewrite enhancer code. Especially codeformer re-invents the wheel currently with it's own face detection etc.
Improve the extras tab to have basic but handy video cutting functions, perhaps batch text-masking if so desired
Investigate on hair or complete zero-shot head traversal. One can dream...
Have a plugin system to support other deepfake models e.g. Ghost
Processing only parts of a video

lysxelapsed · 2023-08-13T12:12:50Z

lysxelapsed
Aug 13, 2023

Investigate on hair or complete zero-shot head traversal. One can dream...

not sure what you mean by that exactly, but sounds interesting ;-) can you elaborate further? excited about further config-options for blending etc., too!

well, you know what's on @anxiousottergames ' and my wishlist.. ;-p but like I already said: you focus on what you deem necessary, first. it's coming along nicely :-)

2 replies

C0untFloyd Aug 13, 2023
Maintainer Author

not sure what you mean by that exactly, but sounds interesting ;-) can you elaborate further?

With Deepface Lab you can swap a whole head but this needs a lot of training. Having the possibility to transfer at least hairstyles would be great I think...I toyed around with this but it's unusable slow.

lysxelapsed Aug 13, 2023

bummer.. maybe something faster will show up eventually.

Fijitrix · 2023-08-20T20:30:59Z

Fijitrix
Aug 20, 2023

Sounds good to me. I'm not sure if you had them listed in order of importance but I personally feel like even the order of the planned features listed is perfect.

0 replies

JeetGuhaThakurta · 2023-08-21T11:23:54Z

JeetGuhaThakurta
Aug 21, 2023

With Roop development now shut down, this is the only place where we can expect improvements on this nice tool. After a lot of tests, I figured out (at least within my test results) that the biggest flip side is the eye movement. It is totally missing in a video swap. The model seems to be looking straight forward, irrespective of the actions. Hence, the model looks like visually impaired or blind.

This is not noticeable in a still image swap because it is a still image, there is no action. But in a video, this lack of eye movement is very easily noticeable. You can tell within 4-5 seconds that the video is fake. Above all of the other enhancements discussed, I think we first need to focus on how to copy the eye-movements of the target to the output while we swap the face from the source.

Basically, the face will come from the source, we need to detect the eyeballs, copy the eyeball direction from the target (not using the source eyeball directions). That will add life to our end product. This technology is already there because Adobe Photoshop can do this. We can change eye direction in a photo using Photoshop. We have to do that same thing with the output image after detecting the direction from the target frames. But I am not sure if there is an open source project already available for this. A close match would be Gaze Correction, but it has limited features. It may be used as a start point.

0 replies

C0untFloyd · 2023-08-21T14:59:30Z

C0untFloyd
Aug 21, 2023
Maintainer Author

That's why I linked https://github.com/ai-forever/ghost which especially cared about gaze and emotions. From my limited testing it did this better than the rest but the face mask was lacking or rather looked too artificial.

3 replies

JeetGuhaThakurta Aug 21, 2023

What I am doing right now is, I am using multiple source images having various angles. This is not difficult to do, if we have a source video and we extract several source frames before the swapper is called from main. Then in the swapper, I am selecting the best match (from the source image set) based upon the face angle of the target frame. This is giving a better result, at least there are eye movements. But these movements are not fully realistic, because they are not entirely driven by the target. Some half-way solution until we discover a better approach. I will look into Ghost and see if we can borrow just the eye tracking part from there and add a new processor between the swapper and the enhancer to rectify the gaze. Thanks for the leads!

aripsam Sep 19, 2023

Can you share the code for this? Would love to try

C0untFloyd Sep 22, 2023
Maintainer Author

based upon the face angle of the target frame

This is actually quite a good idea, perhaps even better than the blending of several images.

patientx · 2023-08-21T16:27:27Z

patientx
Aug 21, 2023

can we get a fps changer (probably doable with ffmpeg even without reencoding) this is useful for some high resolution videos that are 60fps and because of this takes a lot of time but can be 30fps cutting processing time in half , maybe processing every second frame instead of every frame ?

Also like in stable diffusion extension can we select faces by number they appear on screen ? (1 and 3 for example out of 4 faces)

Both of these are especially useful for 180vr videos where a persons face appear two times but since they are in different angles app doesnt recognize both as the same almost always. This is not a problem if only one person is in the video just select all faces BUT if there are more than one person on screen we either need to change all faces or just one persons face on one side of the video.

0 replies

C0untFloyd · 2023-09-22T09:07:01Z

C0untFloyd
Sep 22, 2023
Maintainer Author

I tested Fast Segment-Anything yesterday and am quite disappointed. It's faster than the current Clip2Seg but surprisingly very bad when using low-resolution inputs. If it can identify objects, the resulting masks are way better though. I don't know if it's worth the hassle.

0 replies

lost-in-emotions · 2023-09-26T21:43:46Z

lost-in-emotions
Sep 26, 2023

Suggestion on the Side: What about an Upload Source for an Audio File (mp3) for "directly" a new Lip Sync? -> Overwrits the Lip Sync of the Target Video and Change it with the "Manuellextra located Audio File.

:-)

3 replies

C0untFloyd Sep 27, 2023
Maintainer Author

Hm...uh...I don't know. With an audiofile of the exact same size this is very easy of course but I don't want to open a can of worms, because of different audio formats, weird sizes, special ffmpeg handling. I'd rather leaves this to other great, specialized tools like this

lost-in-emotions Sep 27, 2023

but isnt it lossless-cut only a editing programm?

you can limit the usage data of the audio files in roop. f.e. so only mp3 is allowed, or something.

sadtalker is really nice, but only can lipsync to a single file/frame. i made my cosuin a birthday present with his face on a famous movie scene, where the actor sing a song. i used another song and cut it together in after effects. but the lip sync was - of course wrong (on the old/original audiofile. so here it would be perfekt if i could adjust the lipsync with an audio file of my desire

C0untFloyd Sep 27, 2023
Maintainer Author

but isnt it lossless-cut only a editing programm?

Yes and adding a custom audio track is editing. From its description:
Combine arbitrary tracks from multiple files (ex. add music or subtitle track to a video file)
Remove unneeded tracks
👉 Replace or re-encode only some tracks 👈
Extract all tracks from a file (extract video, audio, subtitle, attachments and other tracks from one file into separate files)

The idea with your cousin is really neat but I won't do it.

sordidloam · 2023-09-27T20:55:49Z

sordidloam
Sep 27, 2023

Controlnet?Sent from my iPhoneOn Aug 21, 2023, at 6:24 AM, JeetGuhaThakurta ***@***.***> wrote: With Roop development now shut down, this is the only place where we can expect improvements on this nice tool. After a lot of tests, I figured out (at least within my test results) that the biggest flip side is the eye movement. It is totally missing in a video swap. The model seems to be looking straight forward, irrespective of the actions. Hence, the model looks like visually impaired or blind. This is not noticeable in a still image swap because it is a still image, there is no action. But in a video, this lack of eye movement is very easily noticeable. You can tell within 4-5 seconds that the video is fake. Above all of the other enhancements discussed, I think we first need to focus on how to copy the eye-movements of the target to the output while we swap the face from the source. Basically, the face will come from the source, we need to detect the eyeballs, copy the eyeball direction from the target (not using the source eyeball directions). That will add life to our end product. This technology is already there because Adobe Photoshop can do this. We can change eye direction in a photo using Photoshop. We have to do that same thing with the output image after detecting the direction from the target frames. But I am not sure if there is an open source project already available for this. A close match would be Gaze Correction, but it has limited features. It may be used as a start point. —Reply to this email directly, view it on GitHub, or unsubscribe.You are receiving this because you are subscribed to this thread.Message ID: ***@***.***>

1 reply

C0untFloyd Sep 29, 2023
Maintainer Author

I don't know how or why ControlNet should be added to a face swapper? This is no diffuser and no inpainting so far...

marlonio · 2023-11-07T14:02:25Z

marlonio
Nov 7, 2023

More configuration for blending faces (bbox manipulation, blur, erode etc). Get rid of pesky hairlines!

I'm searching for the possibility to erode and blur the face mask but on the last roop-unleashed (3.3.4) I can't find the options, is that possible? I'm asking that 'cause i'm still obtain a slight ghost-box around my swapped face and i want to improve the results blurring the box a bit.

0 replies

Tolga077 · 2023-11-09T00:02:13Z

Tolga077
Nov 9, 2023

Since I had a potato computer, I integrated gpen 256 thinking it was better than nothing. But I couldn't get the Original/Enhanced image blend ratio part to work correctly. It only works between 0.90 and 1. When it goes below that, it adds strange blurs and distorts the image. what would be the reason ?

1 reply

lysxelapsed Nov 11, 2023

did you just replace the 512 model with the 256 version, or have you made changes to the code? my guess would be, that (at the very least) you have to adjust Enhance_GPEN.py at line 23 (model_path = resolve_relative_path('../models/GPEN-BFR-512.onnx')) and 34 (temp_frame = cv2.resize(temp_frame, (512, 512), cv2.INTER_CUBIC)). can't say if that's the only thing you'd have to adjust (probably not 😆) - you'd have to ask @C0untFloyd (still on vacation)

eobard1991 · 2023-11-16T13:01:15Z

eobard1991
Nov 16, 2023

@C0untFloyd quick question. I noticed the TEMP folder gets really chunky after extended use and keeps facesets duplicated from the UI. Can i periodically delete the contents of the temp folder?

1 reply

lysxelapsed Nov 20, 2023

yes you can, there's even a button for it in the settings tab.

C0untFloyd · 2024-01-05T10:10:50Z

C0untFloyd
Jan 5, 2024
Maintainer Author

Transfered here from the discussion started by @aripsam

Since this version is pretty stable and has been running well for a while now, I'm just wondering if there are any plans to add any more features? Anything in the pipeline?

Some features I can think of -

offset face sides, like how you have top and bottom
mask specific face areas like nose or lips (is kinda handled with the clip but wondering if there's anything new)
adjustable blending settings
saw something in the dev branch around face rotation to improve quality
for colab, would be nice to point to a path on gdrive to access the target file and save output file

1 reply

C0untFloyd Jan 5, 2024
Maintainer Author

Well, it's not dead yet but admittedly development slowed down a lot because of the holidays, a lot of real life work and I still did not fully recover from my leg injury. I merged the PR for auto rotation into the dev-branch and would like to release this into main but before that I would like to improve this to rotate every face instead of the whole image. This isn't quickly done, as the PR is huge and I want to get it right. I also integrated a VR mode for swapping faces in VR images/movies but this too needs some more work to cope with fisheye distortion etc.
Then there is the new iteration feature, where the same face is blended multiple times together to restore even more of the face identity. Up to now I did not get it to work and it gives me headaches.
2 more additional features I would like to have:

example images to see how new images in a faceset influence the swapping result. Preferably the example images should be loaded from a folder so you can supply your own.
a way to save and load a complete "swap set" containing all source and destination faces

About your requested features, the first 3 were probably inspired by Rope? 😄
When I implemented the face offsets I could have easily integrated the sides as well, This however would clutter the GUI even more and from my experiments it's mostly unnecessary + doesn't work very well. I checked out the latest Rope version some days ago and I really wasn't able to do a simple face swap with it because I couldn't find my way through the billions of settings there! I'm trying to prevent feature creep myself, although I'm not always successful.
Face parsing was requested a lot over time but what is the use case? I mean if e.g. half of the face is occluded then the face detection would fail in the first place and without face there's no face parsing. The good thing about clip is you're not limited to face parts.
Blending can already be adjusted but perhaps you're talking about blur amount and erosion? Similar to the previous, easy to include but hard to get right with the limited web framework.
Colab: several people were reporting that it's not working anymore, didn't try myself though. Useful suggestion, which could be added to the settings tab.

lost-in-emotions · 2024-01-05T17:22:33Z

lost-in-emotions
Jan 5, 2024

i also have a suggestion, maybe its caused by its limitation?
when i render portraits in SDXL with a face aprx. 50% of the image,
and then i use a face swap the result is very bad, very pixelated.
maybe its limited by the resolution?

1 reply

C0untFloyd Jan 5, 2024
Maintainer Author

Insightface Swapper Resolution. Please see the 2nd issue to ever grace this repo 😄 >>--> #2

C0untFloyd · 2024-01-05T22:21:57Z

C0untFloyd
Jan 5, 2024
Maintainer Author

Btw. I forked a new promising hairstyle transfer repo today, which would be a great addition to roop unleashed once it is working:
https://github.com/C0untFloyd/HairCLIPv2_UI

I can't get it to work so far though, not in Colab and not on my machine. Perhaps one of you can enlighten me, the problem is the dynamic compilation of python modules which I can't seem to get right. Also there is a lot of hopefully unnecessary Jupyter stuff in it, this would have to be removed for our own local needs.

0 replies

New features and feature requests #87

C0untFloyd Aug 12, 2023 Maintainer

Replies: 14 comments · 13 replies

C0untFloyd Aug 13, 2023 Maintainer Author

C0untFloyd Aug 21, 2023 Maintainer Author

C0untFloyd Sep 22, 2023 Maintainer Author

C0untFloyd Sep 22, 2023 Maintainer Author

C0untFloyd Sep 27, 2023 Maintainer Author

C0untFloyd Sep 27, 2023 Maintainer Author

C0untFloyd Sep 29, 2023 Maintainer Author

C0untFloyd Jan 5, 2024 Maintainer Author

C0untFloyd Jan 5, 2024 Maintainer Author

C0untFloyd Jan 5, 2024 Maintainer Author

C0untFloyd Jan 5, 2024 Maintainer Author

C0untFloyd
Aug 12, 2023
Maintainer

Replies: 14 comments 13 replies

C0untFloyd Aug 13, 2023
Maintainer Author

C0untFloyd
Aug 21, 2023
Maintainer Author

C0untFloyd Sep 22, 2023
Maintainer Author

C0untFloyd
Sep 22, 2023
Maintainer Author

C0untFloyd Sep 27, 2023
Maintainer Author

C0untFloyd Sep 27, 2023
Maintainer Author

C0untFloyd Sep 29, 2023
Maintainer Author

C0untFloyd
Jan 5, 2024
Maintainer Author

C0untFloyd Jan 5, 2024
Maintainer Author

C0untFloyd Jan 5, 2024
Maintainer Author

C0untFloyd
Jan 5, 2024
Maintainer Author