I downloaded an off line version of the new VQGAN+CLIP type AI that generates images from text input.
it will generate thumbnail or approx 300x300 sized images that clear up to something useful in about 30 mins or less even on a GTX 1070.
Here is some examples i generated.
Text input = Tall old fisherman’s shack in a coastal bay with rough ocean waves and an old row boat.
Cliffside rocks over grown with ivy and moss and a sea mist rolling in.
Some song lyrics input “looking down from Ethereal Skies
Silent crystalline tears I cry
For all must say their last goodbye -
Input text “imagining the unimaginable”
On a modern RTX card with tensor cores i think it might run at something like 300X the speed if it uses low precision compute.
Future improved versions might become really useful for generating larger and better images.
There is an online version up on google colab but its really a pain in the ass, it takes a long time to setup executing each codeblock one by one and for the non subscription users its limited to antiquated telsa K80 cards from the year 2014 and you can only use them for a few hrs a day until it blocks you…
Really not worth bothering with if you have your own PC.
Search for a program called clip app from here https://grisk.itch.io/clip-app
a very simple app version that just works.