It could be many things.
1. What volume of data is under consideration in the Hive queries?
2. What file format is the data stored in?
3. How was the data prepared and loaded (sorting, partitioning, etc.)?
isn't enough information in your question to really give anyone a
single answer which will help you. You may have to explore a bit and
provide more details...
Yes, a single node has limitations.
It's not that it is intentionally deteriorating the performance, but
just that the system is designed for scaling through parallelism, and
you have just a single node, so you are limiting the abilities of the
software to scale (if that is what is needed)
Sandbox is meant for
tutorials and exploration of simple capabilities on small data. If you
want to try the actual HDP software on real data, you can install a
small multi-node cluster using the HDP installation processes documented