Virtual manufacturing environments need complex and accurate 3D human- computer interaction. One main problem of current virtual environments (VEs) is the heavy loads of the users both on cognitive aspect and motor operational aspect. The way of solving this problem is to augment machine’s cognitive capability. This paper first time put forward an idea of intent-driven VE software construction. It investigates multimodal intent delivery and intent understanding. A multimodal based intent-driven