Fastest way to make a 3dim numpy array?

Well, I’ve got the bones of my numpy cloth engine in place. It’s working surprisingly well. The speed is amazing. For the sake of processing really large meshes/really fine grids, I’m looking at multithreading and compiling.

I’m new to this part of programming so any help is appreciated.

Anyone out there know how to make numba work with blender?