Can I use HDP sandbox VM on our server to be used by the company for huge amount of data?
Or for that, there is an enterprise version to be installed ?
HDP Sandbox is a single node cluster machine for exploring and learning and for testing various components. It's purpose is not to provide an enterprise level environment.
The Sandbox is a straightforward, pre-configured, learning environment that contains the latest developments from Apache Hadoop, specifically the Hortonworks Data Platform (HDP). The Sandbox comes packaged in a virtual environment that can run in the cloud or on your personal machine. The Sandbox allows you to learn and explore HDP on your own.
For huge data processing better to create a multi-node cluster. Following doc will be a good start: https://docs.hortonworks.com/HDPDocuments/Ambari-188.8.131.52/bk_ambari-installation/content/ch_Deploy_an...