Member since
12-17-2017
13
Posts
1
Kudos Received
0
Solutions
03-07-2018
05:35 PM
You can use the Big SQL Sandbox: https://www.ibm.com/us-en/marketplace/big-sql Choose try the free edition. Thanks.
... View more
01-14-2018
04:10 PM
@Mohammed Syam Error is at time of writing a file in HDFS : 'File /tmp/all_2-1515942428 could only be replicated to 0 nodes instead of minReplication (=1). There are 1 datanode(s) running and no node(s) are excluded in this operation.' Your HDFS space might be full (you only have 1 datanode) or your user is not granted write on /tmp or /tmp/tpch-generate. Run hadoop dfsadmin -report to check HDFS status and hdfs dfs -ls /tmp/* to check rigths. https://hadoop.apache.org/docs/r1.2.1/commands_manual.html#dfsadmin
... View more
01-12-2018
11:21 AM
@Jay Kumar SenSharma I followed your instructions and following what I got: I'm more a UI guy so I am not very familiar with command based systems, so the problem now when I connect to port 2222 I can't see the folder that contains the project , would you please help on this, appricate the help, [root@sandbox ~]# pwd /root [root@sandbox ~]# ls anaconda-ks.cfg Desktop initial-setup-ks.cfg Public testbench apache-maven-3.0.5 Documents Music start_scripts Videos apache-maven-3.0.5-bin.tar.gz Downloads Pictures Templates [root@sandbox ~]# ssh root@localhost -p 2222 root@localhost's password: Last login: Fri Jan 12 11:17:06 2018 from 172.17.0.1 [root@sandbox ~]# pwd /root [root@sandbox ~]# ls anaconda-ks.cfg build.out install.log sandbox.info start_hbase.sh blueprint.json hdp install.log.syslog start_ambari.sh [root@sandbox ~]# which hive /usr/bin/hive [root@sandbox ~]# echo $PATH /usr/lib64/qt-3.3/bin:/usr/lib/jvm/java/bin:/usr/local/sbin:/usr/local/bin:/sbin:/bin:/usr/sbin:/usr/bin:/usr/hdp/current/falcon-client/bin:/usr/hdp/current/hadoop-mapreduce-historyserver/bin:/usr/hdp/current/oozie-client/bin:/usr/hdp/current/falcon-server/bin:/usr/hdp/current/hadoop-yarn-client/bin:/usr/hdp/current/oozie-server/bin:/usr/hdp/current/flume-client/bin:/usr/hdp/current/hadoop-yarn-nodemanager/bin:/usr/hdp/current/pig-client/bin:/usr/hdp/current/flume-server/bin:/usr/hdp/current/hadoop-yarn-resourcemanager/bin:/usr/hdp/current/slider-client/bin:/usr/hdp/current/hadoop-client/bin:/usr/hdp/current/hadoop-yarn-timelineserver/bin:/usr/hdp/current/sqoop-client/bin:/usr/hdp/current/hadoop-hdfs-client/bin:/usr/hdp/current/hbase-client/bin:/usr/hdp/current/sqoop-server/bin:/usr/hdp/current/hadoop-hdfs-datanode/bin:/usr/hdp/current/hbase-master/bin:/usr/hdp/current/storm-client/bin:/usr/hdp/current/hadoop-hdfs-journalnode/bin:/usr/hdp/current/hbase-regionserver/bin:/usr/hdp/current/storm-nimbus/bin:/usr/hdp/current/hadoop-hdfs-namenode/bin:/usr/hdp/current/hive-client/bin:/usr/hdp/current/storm-supervisor/bin:/usr/hdp/current/hadoop-hdfs-nfs3/bin:/usr/hdp/current/hive-metastore/bin:/usr/hdp/current/zookeeper-client/bin:/usr/hdp/current/hadoop-hdfs-portmap/bin:/usr/hdp/current/hive-server2/bin:/usr/hdp/current/zookeeper-server/bin:/usr/hdp/current/hadoop-hdfs-secondarynamenode/bin:/usr/hdp/current/hive-webhcat/bin:/usr/hdp/current/hadoop-mapreduce-client/bin:/usr/hdp/current/knox-server/bin:/usr/hdp/current/hadoop-client/sbin:/usr/hdp/current/hadoop-hdfs-nfs3/sbin:/usr/hdp/current/hadoop-yarn-client/sbin:/usr/hdp/current/hadoop-hdfs-client/sbin:/usr/hdp/current/hadoop-hdfs-portmap/sbin:/usr/hdp/current/hadoop-yarn-nodemanager/sbin:/usr/hdp/current/hadoop-hdfs-datanode/sbin:/usr/hdp/current/hadoop-hdfs-secondarynamenode/sbin:/usr/hdp/current/hadoop-yarn-resourcemanager/sbin:/usr/hdp/current/hadoop-hdfs-journalnode/sbin:/usr/hdp/current/hadoop-mapreduce-client/sbin:/usr/hdp/current/hadoop-yarn-timelineserver/sbin:/usr/hdp/current/hadoop-hdfs-namenode/sbin:/usr/hdp/current/hadoop-mapreduce-historyserver/sbin:/usr/hdp/current/hive-webhcat/sbin:/root/bin
... View more
12-24-2017
06:06 PM
1 Kudo
@Mohammed Syam If you want to compare the response time of select queries 1.By using Ambari Hive View won't display the execution time of the query. Hive View Execution:- Logs Tab Hive View:- If you click on Logs tab also there is no logs for the query. It is impossible compare the time taken for select query to execute, because for select queries there is no map reduce job is going to be initialized, if application id is not created then we cannot compare them programatically. 2.Another way to get at least the execution times for select query is running from hive from command line(not from ambari hive view) Open Hive from command line and execute the select query. Once select query finishes at bottom of the results hive shows how many rows has been selected with the response time will be displayed. Example:- hive# select * from text_table;
+----------------+----------------+------------------+----------------+------------------------+--+
| text_table.id | text_table.dt | text_table.name | text_table.ts | text_table.dtts |
+----------------+----------------+------------------+----------------+------------------------+--+
| 1 | 2017-10-10 | hcc | 12:00:00 | 2017-10-10 12:00:00.0 |
| 1 | 2017-10-11 | foo | 12:00:00 | 2017-10-11 12:00:00.0 |
| 1 | 2017-10-12 | foo | 12:00:00 | 2017-10-12 12:00:00.0 |
| 2 | 2017-11-10 | bar | 09:00:00 | 2017-12-23 09:00:00.0 |
+----------------+----------------+------------------+----------------+------------------------+--+
4 rows selected (0.105 seconds) Scroll down to the last line in the above snippet and you can view that 4 rows and execution time is 0.105 seconds. If you are running from Hive from command line hive displays how many rows are displayed and what is the execution time for the query as a last line in results. If you are using Ambari Hive View won't display these stats also. Only way to compare results of select query is Execute your select from Hive command lines and compare them manually.
... View more
12-26-2017
10:11 AM
Why don't you refer following link ? https://github.com/hortonworks/hive-testbench
... View more
12-19-2017
02:44 PM
1 Kudo
I have use Microsoft PolyBase with HDP HAWQ is available and supported by Pivotal SparkSQL comes with HDP and is fully supported JethroData is a major partner and runs well on HDP Apache Drill works with HDP, but is not supported TeraData is a partner and works with HDP Shark is old and deprecated Stinger is a name for Hive Presto works with HDP Progress has drivers for HDP IBM BigSQL is well integrated with HDP Hortonworks HDP is standard unmodified Apache Hadoop, Apache Hive, Apache Phoenix, Apache Spark SQL and can run anything that works with regular Hadoop. The built-in engines are Apache Hive, Apache Phoenix and Apache Spark SQL. BigSQL and Jethro are well integrated partners. Apache Calcite is everywhere.
... View more