LLaMA-Mesh unifies text and 3D meshes in a uniform format by representing the numerical values of vertex coordinates and face definitions of a 3D mesh as plain text.
The model is trained using text and 3D interleaved data in an end-to-end manner.
This offers key advantages of :
- leveraging spatial knowledge already embedded in LLMs, derived from textual sources like 3D tutorials
- enabling conversational 3D generation and mesh understanding
NVIDIA LLama Mesh project;
Blender AddOn called MeshGen from LLama Mesh: