Member since
09-29-2015
155
Posts
205
Kudos Received
18
Solutions
My Accepted Solutions
Title | Views | Posted |
---|---|---|
8362 | 02-17-2017 12:38 PM | |
1334 | 11-15-2016 03:56 PM | |
1874 | 11-11-2016 05:27 PM | |
15357 | 11-11-2016 12:16 AM | |
3076 | 11-10-2016 06:15 PM |
10-31-2016
11:03 PM
HI @azeltov, I am trying to install R-studio on Hortonworks sandbox 2.5, running through the exception in verify installation step: initctl: Unable to connect to Upstart: Failed to connect to socket /com/ubuntu/upstart: Connection refused I have tried starting, stopping rstudio server, it shows the same message. PS: Since it is a docker container, 8787 port is not opened so I have configured /etc/rstudio/rserver.conf to use port 9000.
... View more
09-06-2016
12:36 PM
Hi, looks like simple error: I see s3a in your exception, but I think s3 or s3n should be there.
... View more
11-07-2016
03:28 PM
you shouldn't be seeing this on HDP2.5; everything needed to talk to S3A is on the classpath for spark already (we have done a lot of work on S3a performance for this release) Is the job actually failing, or is it just warning you that it couldn't create the s3a filesystem, but carrying on?
... View more
03-31-2016
06:58 PM
On the exam you should always use Ambari when possible, especially for tasks like enabling NameNode HA.
... View more
03-22-2017
04:27 AM
Hi Artem, I'm currently stuck in a particular use case where in I'm trying to access Hive Table data using spark.read.jdbc as shown below: export SPARK_MAJOR_VERSION=2 spark-shell import org.apache.spark.sql.{DataFrame, Row,SparkSession} val connectionProperties = new java.util.Properties() val hiveQuery = "(SELECT * from hive_table limit 10) tmp" val hiveResult = spark.read.jdbc("jdbc:hive2://hiveServerHostname:10000/hiveDBName;user=hive;password=hive", hiveQuery, connectionProperties).collect() But when I check for the results in hiveResult it's just empty. Could you please suggest what's going on here? I know we can access Hive tables using HiveSesssion and I've successfully tried that but is it possible to run hive queries and access Hive data using the above method?
... View more
04-01-2016
06:08 PM
1 Kudo
@eorgadn You should wrap the geoDistance functions as hive UDF’s it will be a lot friendlier for most people that will want to use it in hive.
... View more
03-08-2016
04:32 PM
2 Kudos
@Artem Ervits your suggestion worked. This is what i ran to get it to run on my sandbox : yum install -y numpy
... View more
04-25-2016
07:19 PM
Ancil, I have question regarding: hive.tez.container.size is multiple of yarn.scheduler.minimum-allocation-mb, why so? if yarn.scheduler.maximum-allocation-mb = 24GB, yarn.scheduler.minimum-allocation-mb = 4GB, hive.tez.container.size=5B, would not Yarn smart enough to assign 5GB to a container to satisfy tez needs? Thanks, Richard
... View more
08-24-2016
01:39 PM
@Alexander is there a full list of these hdi scripts available? If not how did you discover the ones above?
... View more