I was thinking about this before and I thought of having a 3d physics engine tha...

zone411 · on Nov 17, 2022

This has been a thing for a while. For example, here are a couple random papers from 2017: https://openaccess.thecvf.com/content_cvpr_2017/html/Varol_L..., https://openaccess.thecvf.com/content_ICCV_2017_workshops/w2... or a newer one about deformable objects: https://arxiv.org/abs/2107.08898.

You can even use a robot to manipulate things in real life to create synthetic data for a neural net.

ilaksh · on Nov 17, 2022

Yes I think that the latest in ML everything else will help to create those traditional simulations.

But also I think that what an AI that can for example really answer questions about a video would need to do to be really effective would be to basically do compressed versions of those simulations using the spatial-temporal-abstract latent space. Which should be a better model than just the textual space.