New Technologies (AI)

Kologe · October 1, 2022, 9:31am

I kind of don’t see all the grandeur in those results, I’m sorry, I must be some kind of dumb-dumb. What I see is much foot-slipping and generally ill-resolved foot movements (especially for the faster movements like running etc.).
These kinds of movements (including the aforementioned problems), I can have for free from CMU right now. What do I need machine-learning for?

greetings, Kologe

DeepBlender · October 1, 2022, 11:13am

This is pretty basic research to find out whether it works at all.
The novelty of their work is to use a diffusion model to show the flexibility of the approach. They show it is overall quite stable, allows unconditional generation, but also other conditional generations, like text to motion and action to motion (and they also discuss motion in betweening).

There are other projects that are closer to being production ready which incorporate usually better/physically more plausible data, better/more consistent rigs, special treatment of contacts (like ground contact) to prevent sliding.

Naturally, when it comes to the quality of this paper, it doesn’t produce the best quality results, but that was also not their goal.

thetony20 · October 1, 2022, 1:16pm

So all of this is image type AI stuff it seems, any good/free audio applications?

Basically, what I’d like is a text to speech system that creates a natural sounding voice track, ideally with various accents and inflections, etc so it sounds like a normal person talking/acting rather then like a robot or Stephen Hawking.

DeepBlender · October 1, 2022, 1:24pm

One example:
https://jmhxxi.github.io/Diff-TTS-demo/index.html

The examples on the right are generated.

I am not aware on an easy to use one yet.

Bullit · October 1, 2022, 1:24pm

There is this AI generated presenter

thetony20 · October 1, 2022, 1:56pm

I guess that shows some potential, but yeah, nothing actually really usable for the average person.

Putting aside the bad lip-sync, etc the actual voice isn’t bad, assuming that was text to speech and not uploaded audio. The free trial works for some basic audio tests, but no real control over the speech, like what to emphases or pause over or stretch out, but then it is more geared towards video output. And the pricing could be an issue over time.

tischbein3 · October 1, 2022, 7:34pm

dream textures in blender is soooo coool

sorry if you feed it with some renders with some basic geometry, wich has enough perspective and visual clues you can not only art direct the ai, AND also use the result and the original scene as a modeling reference.

unluckily I don’t have that much time to bring a project from start to finish, (maybe in the coming month),
but for me this is imho a biggie. Really surprised how well stable diffusion keeps the perspective input.

Bullit · October 1, 2022, 7:39pm

Stable diffusion tut, how to change hair, cloths

xan2622 · October 4, 2022, 8:39pm

https://twitter.com/watson_nn/status/1577340905580789760?s=20&t=asDkdlc4a6ViYWcyjs-OTg

Ratchet · October 4, 2022, 8:58pm

Could be integrated on modeling and sculpting programs to generate some base model before modifying it, or generate level assets directly.

Bullit · October 6, 2022, 12:18am

Image to video AI

Bullit · October 8, 2022, 8:53am

Would this be possible in Blender?

Saku · October 8, 2022, 7:03pm

Not as realtime 3D graphics if that has been the case. But because they have used a motion control camera and car platform, it would be possible to take the pre planned camera and platform motions to Blender and render background animation as a video for virtual studio LED screen. Anyway, in this particular case the action is planned and timed precisely beforehand, so it doesn’t require Unreal Engine setup that would track manually operated camera in real-time during the actual shoot. Unreal Engine or other software could then be used just for the video playback.

So, the same result is doable in Blender for the background video.

Edit: Based on the behind the scenes part of the video they tracked and recorded handheld camera motions beforehand for animatic. Most likely the camera motion for actual shoot is not exactly same as the handheld version, but cleaned and modified afterwards, because motion control camera needs to go very precisely inside the car through windows and they didn’t have windows when recording handheld camera operation for animatic.

xan2622 · October 10, 2022, 4:26am

https://twitter.com/cg_kru/status/1579171973355040768

Bullit · October 10, 2022, 11:22am

3D from photo and video

New to nerfstudio? Here we walk you through a step-by-step process on how to turn your favorite capture into a trendy 3D video in minutes.

NeRF = Neural Radiance Fields

Nerfstudio offers NeRF, Nerfacto, Instant NGP, Mipnerf, NerfW e Semantic NeRF.

Bullit · October 11, 2022, 12:22pm

Colorize Pictures

Palette — a vibrant AI colorizer app. Think instagram filters, but more intelligent.

Bullit · October 13, 2022, 12:48pm

Portrait Style Transfer with VToonify

xan2622 · October 13, 2022, 2:46pm

Christian Cantrell has updated his “Stable Diffusion Photoshop plugin” : it can now generate images locally:
https://twitter.com/cantrell/status/1580562366629965824

xan2622 · October 13, 2022, 2:48pm

https://twitter.com/ka2aki86/status/1580042633694818304

Bullit · October 13, 2022, 3:19pm

What scan she is using? just video?