Whisper GUI 0.1 (Patreon)
Content
Edit: Fixed the link with the missing license.txt, thanks Anthony for figuring it out.
Hi all, took a few slower days after the news eve, to try and relax my mind a little, still been working, but more limited and with more limited access to the internet.
Now I'm start going full steam again, to start I'm releasing a new GUI, this one is a lot simpler than Stable, it will not consume much of my time, in fact this first version have most of the stuff it will have in the final build.
Link:
https://drive.google.com/file/d/1gRXwPVtw9jL1J7cqUV9ruHHKlzrQzH-D/view?usp=share_link
Mirror:
https://grisk.itch.io/whisper-gui
Whisper is a AI by OpenAI that:
Whisper is a general-purpose speech recognition model. It is trained on a large dataset of diverse audio and is also a multi-task model that can perform multilingual speech recognition as well as speech translation and language identification.
https://openai.com/blog/whisper/
https://github.com/openai/whisper
That is, it can generate subtitles for videos and audios on multiple languages. It also allow to translate that subtitle to English after if you like.
Here are two examples I did:
https://streamable.com/bzjkcp
https://streamable.com/o17pts
There are multiple models to select, it will download them if you don't have it already.
If you have more than 10Vram, you will always want to use Large-V2
If not, use the larger model you can. If your input is in english, use the ".en" version.
Tomorrow I will release this version on Itchio, it will also be easier to download it from their servers.
This GUI I plan to always keep it free on itchio with the latest update.
That is because I think that Whisper can really help people with Auditory disability and people from other countries that need to learn from a foreign country (once the global translation is working on it)
About Stable Diffusion:
The next version should be ready this week, with a few new options and bug fixes. Now I will start answering some comments that been waiting for a reply for to long, sorry about that.