Support Questions

younggeun_park · ‎09-08-2016

Hi, All

I run spark-sql queries with a thrift server.

I know that if multiple sql queries are submitted through the thrift server each query would be run sequentially.

If many users want to query the table on a spark cluster over yarn at the same time, how these requested queries could be run concurrently?

The requested query do not update the table and just query

I have an idea that because a thrift server has dedicated executor cluster if multiple thrift servers are used multiple queries could be processed concurrently.

Is there any idea about this situation?

Thanks in advance.

Park.

myoung · ‎09-08-2016

@Young-Geun Park

Have you taken a look at http://spark.apache.org/docs/1.6.2/job-scheduling.html? Also, if you start the thrift server in yarn-client mode, you should be able to take advantage of YARN resource scheduling and queues.

https://docs.hortonworks.com/HDPDocuments/HDP2/HDP-2.4.0/bk_installing_manually_book/content/startin...

View solution in original post

myoung · ‎09-08-2016

@Young-Geun Park

Have you taken a look at http://spark.apache.org/docs/1.6.2/job-scheduling.html? Also, if you start the thrift server in yarn-client mode, you should be able to take advantage of YARN resource scheduling and queues.

https://docs.hortonworks.com/HDPDocuments/HDP2/HDP-2.4.0/bk_installing_manually_book/content/startin...

Cloudera Community

Support Questions

How multiple users run spark-sql query concurrently?

Kafka producer running into multiple org.apache.ka...

Spark (PySpark) to extract from SQL Server

SPARK SQL query to modify values

Enable DoAs option Hive to allow users to runs que...

How to run spark sql in parallel?

JSON to SQL using Spark

Web interface for querying Spark SQL ?

Hive query to check mathematical values from multi...

Run multiple queries on Hive / Phoenix?

Insert Into Multiple Partitions with one Query