About mbigelow

VidyaSargur · ‎02-02-2023

@45, as this is an older post, you would have a better chance of receiving a resolution by starting a new thread. This will also be an opportunity to provide details specific to your environment that could aid others in assisting you with a more accurate answer to your question. You can link this thread as a reference in your new post.

Nandinin · ‎03-17-2021

This error shows up if you have selected Sentry/Ranger as dependencies but not checked true for the below config (i.e. did not enable Kerberos) kerberos.auth.enable

BigSpace · ‎10-01-2020

Hbase stores data as a sorted map by keys. HBase is considered a persistent, multidimensional, sorted map, where each cell is indexed by a row key and column key (family and qualifier). A rowkey, which is immutable and uniquely defines a row, usually spans multiple HFiles. Rowkeys are treated as byte arrays (byte[]) and are stored in a sorted order in the multi-dimensional sorted map. If you look for a row_key, Hbase is able to identify the node where this data is present. Hadoop runs its computation on the same node where the key is present and hence the performance with technologies like Spark is really good. This is called data localization.

HadoopHelp · ‎02-12-2020

Hi . any solution you found for same . i having the same issue the accessing the hive through python. Thanks HadoopHelp

AKR · ‎01-05-2020

Hi, This parameter spark.executor.memory (or) spark.yarn.executor.memoryOverhead can be set in Spark submit command or you can set it Advanced configurations. Thanks AKR

ABHIMAN · ‎12-04-2019

Do you have a documentation for this

raff0z · ‎07-25-2019

is a bit late but i post the solution that worked for me. the problem was the hostnames, impala with kerberos wants the hostnames in lowercase.

zuoseven · ‎07-04-2019

did you fixed it?

maziyar · ‎06-14-2019

The Spark 2 now is the only Spark that is supported by CDH 6.x so I am not sure you will get any reply here. Is there any reason you are still in Spark 1.6.x?

Lambzee · ‎06-06-2019

yarn logs -applicationId <application master ID> should help. It occurs typically due improper container memory allocation and physical memory availability on the cluster.

Online	Offline
Last Visited	‎03-25-2019 05:55 PM

Member Since	‎08-16-2016 08:51 PM
Last Visited	‎03-25-2019 05:55 PM
Posts	642
Kudos received	129

Cloudera Community

Re: Configuring the HDFS superuser in Kerberos

Re: Hive process crash

Re: Upgrade from CDH 5.11 Express to Enterprise

Re: Adding user to Cloudera Manager using REST AP...

Re: Running in non-interactive mode, and data appe...

Re: Error Connecting to Impala via HA Proxy Node

Re: Apache Kafka Failing To Start For First Time

Re: Hbase Vs MySQL database ( Hadoop ) vs (Convent...

Re: Hive sasl and python 3.5

Re: spark.yarn.executor.memoryOverhead

Re: Unattended/Fully automated deployment of a clu...

Re: Impala Catalog Server and Impala Daemons faili...

Re: Installation failed connection refused

Re: Spark 2.2 and Livy

Re: Map jobs are failing with exit code 143