Member since
03-20-2016
21
Posts
5
Kudos Received
0
Solutions
06-02-2023
08:59 AM
how can I check for the largest file in a certain tenant in hadoop?
... View more
04-17-2016
06:59 PM
@Benjamin Leonhardi - This was indeed part of the reason. Thank you very much for your help!
... View more
03-29-2016
09:21 AM
Thanks a lot Benjamin - I did realise after posting the above that I needed a UDF to use with the rank function on its own. It's working now so thank you.
... View more
12-27-2016
12:47 PM
Hi all,
Has anyone a workaround for this problem ? I have exactly the same case.
I have similar issues on the Sandbox 2.5 (VirtualBox-5.1.12-112440-Win - HDP_2.5_virtualbox).
I killed
the jobs with putty as root : yarn application -kill
application_1482410373661_0002 but they are still visible on Ambari.
[root@sandbox ~]# yarn application -kill application_1482410373661_0002
16/12/24 12:26:40 INFO impl.TimelineClientImpl: Timeline service address: http://sandbox.hortonworks.com:8188/ws/v1/timeline/
16/12/24 12:26:40 INFO client.RMProxy: Connecting to ResourceManager at sandbox.hortonworks.com/172.17.0.2:8050
16/12/24 12:26:40 INFO client.AHSProxy: Connecting to Application History server at sandbox.hortonworks.com/172.17.0.2:10200
16/12/24 12:26:44 WARN retry.RetryInvocationHandler: Exception while invoking ApplicationClientProtocolPBClientImpl.getApplicationReport over null. Not retrying because try once and fail.
org.apache.hadoop.yarn.exceptions.ApplicationNotFoundException: Application with id 'application_1482410373661_0002' doesn't exist in RM.
I've found an issue corresponding :
Tez
client keeps trying to talk to RM even if RM does not know about the
application
https://issues.apache.org/jira/browse/TEZ-3156
This patch should be included as it was fixed for version 0.7.1
In the log (Ambary query) I can read 993 time :
INFO
: Map 1: 0/1 Reducer 2: 0/2
The query is the proposed in the tutorial : (
http://fr.hortonworks.com/hadoop-tutorial/hello-world-an-introduction-to-hadoop-hcatalog-hive-and-pig/#section_4)
<code>SELECT truckid, avg(mpg) avgmpg FROM truck_mileage GROUP BY truckid;
Any idea how to clear the history and restart without the running state ? Thanks in advance
... View more
03-21-2016
07:04 PM
1 Kudo
Ah - understood now. This worked! Thank you 🙂
... View more