Member since
07-01-2015
460
Posts
78
Kudos Received
43
Solutions
My Accepted Solutions
Title | Views | Posted |
---|---|---|
1346 | 11-26-2019 11:47 PM | |
1304 | 11-25-2019 11:44 AM | |
9481 | 08-07-2019 12:48 AM | |
2183 | 04-17-2019 03:09 AM | |
3497 | 02-18-2019 12:23 AM |
08-23-2018
01:56 AM
Hi I had a similar problem but not related to the Query size, but the query list. There were queries executed on the cluster and not returned by the API. I have found out that I need to timefilter and offeset to parse all the paginated data.
... View more
08-23-2018
01:50 AM
This just says you dont have a permission with mapred user to that directory. Without knowing any details, one of the solution could be to add mapred user to a supergroup group on every worker node.
... View more
08-22-2018
09:24 AM
Edit: adding query timeout does not affect this behaviour: Configured Hue to 30sec timeout, but the query is waiting to be closed for more than 2 minutes... This is directly from the Query profile: Query Options (set by configuration): MEM_LIMIT=419430400,QUERY_TIMEOUT_S=30
Query Options (set by configuration and planner): MEM_LIMIT=419430400,QUERY_TIMEOUT_S=30,MT_DOP=0
... View more
08-22-2018
08:56 AM
One more observation: during the query "fetch time" the query on Impala daemons is reported as: "waiting to be closed" But has a state=FINISHED, First row fetched, Scan progress 100%. So my additional question is why is Impala not closing automatically the queries when the state is in "FINISHED"? Is this a configurable behaviour? Thanks!
... View more
08-22-2018
08:25 AM
Hi, in CDH 5.13 some queries have a very high client fetch time (executed from Hue) This one example here produced just 10 records, but was running for more than 3 hours. I think Hue does not close the fetch procedure, and Impala Daemon thinks the client will fetch for more.. Even though this does not make sense, Impala Daemon "knows" that 100% of records are sent to the client so why does not cancel or close it? Here are the selected query stats: Query Type: QUERY
Query State: FINISHED
Start Time: Aug 22, 2018 9:11:03 AM
End Time: Aug 22, 2018 12:26:40 PM
Duration: 3h, 15m
Rows Produced: 10
Admission Result: Admitted (queued)
Admission Wait Time: 5ms
Bytes Streamed: 353 B
Client Fetch Wait Time: 3.3h
Client Fetch Wait Time Percentage: 100
Connected User: hue/xxx
Estimated per Node Peak Memory: 32.0 MiB
File Formats: PARQUET/SNAPPY
HDFS Average Scan Range: 1.3 KiB
HDFS Bytes Read: 1.3 KiB
HDFS Bytes Read From Cache: 0 B
HDFS Bytes Read From Cache Percentage: 0
HDFS Local Bytes Read: 1.3 KiB
HDFS Local Bytes Read Percentage: 100
HDFS Remote Bytes Read: 0 B
HDFS Remote Bytes Read Percentage: 0
HDFS Scanner Average Read Throughput: 0 B/s
HDFS Short Circuit Bytes Read: 1.3 KiB
HDFS Short Circuit Bytes Read Percentage: 100
Impala Version: impalad version 2.10.0-cdh5.13.3 RELEASE (build 15a453e15865344e75ce0fc6c4c760696d50f626)
Out of Memory: false
Per Node Peak Memory Usage: 197.1 KiB
Planning Wait Time: 1ms
Planning Wait Time Percentage: 0
Pool: root.pool1
Query Status: OK
Session ID: 9647e779051c0b0b:302f01f9698839ba
Session Type: HIVESERVER2
Statistics Corrupt: false
Statistics Missing: true
Threads: CPU Time: 13ms
Threads: CPU Time Percentage: 78
Threads: Network Receive Wait Time: 0ms
Threads: Network Receive Wait Time Percentage: 0
Threads: Network Send Wait Time: 1ms
Threads: Network Send Wait Time Percentage: 11
Threads: Storage Wait Time: 1ms
Threads: Storage Wait Time Percentage: 11
I have couple of questions: - is this a problem on Impala or in Hue side? - the impala has idle_session_timeout=7200 configured. Why did not closed the IDaemon the session after 2 hours of inactivity? - is this hanging query occupying a "slot" in resource pools - affecting Max Running Queries in Impala admission control? (My observation is yes, just want to be sure) Thanks!
... View more
Labels:
- Labels:
-
Apache Impala
-
Cloudera Hue
08-03-2018
10:08 AM
The problem was that the response contained an unicode string, and client.py from commnon tried to cast it to str. The hotfix: awk 'NR==230{print " if objClass in [str]:\n return objClass(obj.encode(\"utf-8\"))"}1' /usr/lib/python2.7/site-packages/cloudera/director/common/client.py | sudo tee /usr/lib/python2.7/site-packages/cloudera/director/common/client.py
... View more
08-03-2018
03:47 AM
Hi, I am using cm_api to get the information about the newly deployed cluster from ClouderaDirector. My code was working until yesterday, and now sudenly it gives me this error: CDH Environment CDHTEST Environment Traceback (most recent call last): File "/tmp/get_cluster_info2.py", line 45, in <module> ( ci, clName ) = getClusterInstance( client ) File "/tmp/get_cluster_info2.py", line 42, in getClusterInstance ci = c.get( cdhenvName , cdhdepName , clName ) File "/usr/lib/python2.7/site-packages/cloudera/director/latest/ClustersApi.py", line 312, in get responseObject = self.apiClient.deserialize(response, 'cloudera.director.latest.models.Cluster') File "/usr/lib/python2.7/site-packages/cloudera/director/common/client.py", line 264, in deserialize subClass)) File "/usr/lib/python2.7/site-packages/cloudera/director/common/client.py", line 268, in deserialize attrType)) File "/usr/lib/python2.7/site-packages/cloudera/director/common/client.py", line 268, in deserialize attrType)) File "/usr/lib/python2.7/site-packages/cloudera/director/common/client.py", line 264, in deserialize subClass)) File "/usr/lib/python2.7/site-packages/cloudera/director/common/client.py", line 230, in deserialize return objClass(obj) UnicodeEncodeError: 'ascii' codec can't encode characters in position 5442-5445: ordinal not in range(128) cm_api version 19.1.1 cloudera director verison 2.8.0 The code: import re
from cloudera.director.latest import ClustersApi
from cloudera.director.latest import EnvironmentsApi
from cloudera.director.latest import DeploymentsApi
from cloudera.director.common.client import ApiClient
from cloudera.director.latest import AuthenticationApi
from cloudera.director.latest.models import Login
from cm_api.api_client import ApiResource, ApiException, API_CURRENT_VERSION
import ssl
from socket import socket
from subprocess import Popen, PIPE
from time import sleep
username='admin'
password='admin'
client = ApiClient('http://localhost:7189', tls_enabled=False, cafile=None, hostname_verification_enabled=True)
auth = AuthenticationApi(client)
auth.login(Login(username=username, password=password))
def getClusterInstance( apiclient ):
cdhenv=EnvironmentsApi(apiclient)
cdhenvName = cdhenv.list()[0]
print('CDH Environment '+cdhenvName)
cdhdep=DeploymentsApi(apiclient)
cdhdepName = cdhdep.list( cdhenvName )[0]
c=ClustersApi(apiclient)
clName = c.list( cdhenvName, cdhdepName )[0]
ci = c.get( cdhenvName , cdhdepName , clName )
return ( ci, clName )
( ci, clName ) = getClusterInstance( client )
-> Here it fails
... View more
Labels:
08-03-2018
12:06 AM
I resolved the issue, the problem was: - the listing of hvm instance types has to be in one row separated by comma (I dont know why the escape on new lines did not worked) Here is the working configuration: cat << EOF | sudo tee -a /var/lib/cloudera-director-plugins/aws-provider-1.5.0/etc/aws-plugin.confx
virtualizationMappings {
customMappingsPath: ec2.customvirtualizationmappings.properties
}
EOF
cat << EOF | sudo tee /var/lib/cloudera-director-plugins/aws-provider-1.5.0/etc/ec2.customvirtualizationmappings.propertiesx
hvm=c5.large,c5.xlarge,c5.2xlarge,c5.4xlarge,c5.9xlarge,c5.18xlarge, m5.large,m5.xlarge,m5.2xlarge,m5.4xlarge,m5.12xlarge,m5.24xlarge,c5d.large,c5d.xlarge,c5d.2xlarge,c5d.4xlarge,c5d.9xlarge,c5d.18xlarge,r5.large,r5.xlarge,r5.2xlarge,r5.4xlarge,r5.12xlarge,r5.24xlarge
EOF
cat << EOF | sudo tee -a /var/lib/cloudera-director-plugins/aws-provider-1.5.0/etc/aws-plugin.confx
ephemeralDeviceMappings {
customMappingsPath: ec2.customephemeraldevicemappings.properties
}
EOF
cat << EOF | sudo tee /var/lib/cloudera-director-plugins/aws-provider-1.5.0/etc/ec2.customephemeraldevicemappings.propertiesx
c5d.large=1
c5d.xlarge=1
c5d.2xlarge=1
c5d.4xlarge=1
c5d.9xlarge=1
c5d.18xlarge=2
EOF
... View more
08-01-2018
02:18 AM
I did configure the c5d instance types, but it is still failing: Process logs can be found at /root/.cloudera-director/logs/application.log
Plugins will be loaded from /var/lib/cloudera-director-plugins
Java HotSpot(TM) 64-Bit Server VM warning: ignoring option MaxPermSize=256M; support was removed in 8.0
Cloudera Director 2.8.0 initializing ...
Connecting to http://localhost:7189
Current user roles: [ROLE_READONLY, ROLE_ADMIN]
Configuration file passes all validation checks.
Creating a new environment...
Creating external database servers if configured...
Creating a new Cloudera Manager...
Unexpected internal error (see logs): {
"validationErrors" : [ "Incompatible AMI virtualization type. Instance type c5d.2xlarge does not support hvm virtualization type of AMI ami-3548444c." ],
"validationWarnings" : [ ],
"conditions" : [ {
"scope" : "aws.com.cloudera.director.aws.ec2.EC2Provider.template",
"key" : "image",
"exceptionInfo" : {
"@class" : "com.cloudera.launchpad.common.error.ErrorInfo",
"code" : [ "com.cloudera.launchpad.pluggable.common.ProviderErrorCode", "PROVIDER_EXCEPTION" ],
"properties" : {
"message" : "Incompatible AMI virtualization type. Instance type c5d.2xlarge does not support hvm virtualization type of AMI ami-3548444c."
},
"causes" : [ ],
"message" : "Incompatible AMI virtualization type. Instance type c5d.2xlarge does not support hvm virtualization type of AMI ami-3548444c."
},
"message" : "Incompatible AMI virtualization type. Instance type c5d.2xlarge does not support hvm virtualization type of AMI ami-3548444c."
} ],
"formatWarnings" : [ ]
}
[centos@ip-10-197-19-68 ~]$
[centos@ip-10-197-19-68 ~]$ cat /var/lib/cloudera-director-plugins/aws-provider-1.5.0/etc/ec2.customvirtualizationmappings.properties
hvm=c5d.xlarge,c5d.2xlarge,c5.large,m5.xlarge
[centos@ip-10-197-19-68 ~]$ cat /var/lib/cloudera-director-plugins/aws-provider-1.5.0/etc/ec2.ephemeraldevicemappings.properties
# (c) Copyright 2015 Cloudera, Inc.
#
# Licensed under the Apache License, Version 2.0 (the "License");
# you may not use this file except in compliance with the License.
# You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
# Comprehensive list of EC2 ephemeral device mappings at:
# http://docs.aws.amazon.com/AWSEC2/latest/UserGuide/InstanceStorage.html#StorageOnInstanceTypes
c5d.2xlarge=1
[centos@ip-10-197-19-68 ~]$
... View more