Support Questions

younggeun_park · ‎09-08-2016

Hi, All

I run spark-sql queries with a thrift server.

I know that if multiple sql queries are submitted through the thrift server each query would be run sequentially.

If many users want to query the table on a spark cluster over yarn at the same time, how these requested queries could be run concurrently?

The requested query do not update the table and just query

I have an idea that because a thrift server has dedicated executor cluster if multiple thrift servers are used multiple queries could be processed concurrently.

Is there any idea about this situation?

Thanks in advance.

Park.

myoung · ‎09-08-2016

@Young-Geun Park

Have you taken a look at http://spark.apache.org/docs/1.6.2/job-scheduling.html? Also, if you start the thrift server in yarn-client mode, you should be able to take advantage of YARN resource scheduling and queues.

https://docs.hortonworks.com/HDPDocuments/HDP2/HDP-2.4.0/bk_installing_manually_book/content/startin...

View solution in original post

myoung · ‎09-08-2016

@Young-Geun Park

Have you taken a look at http://spark.apache.org/docs/1.6.2/job-scheduling.html? Also, if you start the thrift server in yarn-client mode, you should be able to take advantage of YARN resource scheduling and queues.

https://docs.hortonworks.com/HDPDocuments/HDP2/HDP-2.4.0/bk_installing_manually_book/content/startin...

Cloudera Community

Support Questions

How multiple users run spark-sql query concurrently?

Spark (PySpark) to extract from SQL Server

Enable DoAs option Hive to allow users to runs que...

Apache NiFi user authentication + creation of mult...

JSON to SQL using Spark

Web interface for querying Spark SQL ?

Hive query to check mathematical values from multi...

Inability to execute multiple queries from the sam...

Run multiple queries on Hive / Phoenix?

Insert Into Multiple Partitions with one Query

user not found while running hive query from kerb...