You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Agree, but I'm not sure Apple's frameworks and libraries provide a way to work that directly with the latent space. Someone else who knows more than I do on this will hopefully give you a more definitive answer.
The BSRGAN module is ready to go. It is a drop-in replacement for the RealESRGAN that is used now, bu the UI needs a way to let the user select which to use, and that is on the to-do list.
@stuartjmoore offered to work on UI-centered upgrades. The heavy coding for this issue is already done, thanks to @vzsg with his build of BSRGAN that can directly substitute for the current RealESRGAN. (I renamed the BSRGAN file to RealESRGAN and copied it into the project, and it built and ran fine.)
So from here, implementing it means adding a new drop-down in Settings to let the user choose between the two upscalers (and maybe more in the future) and changing the actual code to allow the upscaler to be used to be set from the Settings. There is the question of whether this belongs here in Mochi or in ml-stable-diffusion, but I don't see the RealESRGAN model in their repo, so I assume it was added as part of assembling Mochi itself.
Running Latest Version
What do you want Mochi Diffusion to do?
While RealESRGAN is a common upscaler to use. I do like to have the option to utilize other models like BSRGAN for example.
Why do you think this should be added?
Theres a couple coreml versions here. They all render pretty quickly!
The text was updated successfully, but these errors were encountered: