Member since
05-19-2017
25
Posts
1
Kudos Received
0
Solutions
09-03-2019
05:26 PM
Hi @Shelton I followed your suggestion and modified HOSTS file in the C:\Windows\System32\drivers\etc on my Windows 7 machine where the VirtualBox/Sandbox is installed, it still prompts me the 502 error. Here are the hosts file: 127.0.0.1 localhost ::1 localhost 127.0.0.1 sandbox.hortonworks.com sandbox I used the Windows7 built-in Resource Monitor to inspect which application is using 8080, and it shows VirtualBox: Since VirtualBox is where the sandbox resides, there is no way for me to terminate it. How do I then access Ambari from localhost? Thank you very much if you can kindly help me to sort it out at your earliest convenience.
... View more
01-20-2019
02:29 PM
Is it possible to create a flow in NiFi that after all the data ingestion and ETL complete, the pipeline triggers a Jupyter Notebook directly to kick off the Data Science model on the completed dataset? Thank you.
... View more
Labels:
- Labels:
-
Apache NiFi
12-02-2018
03:40 PM
Impala rejected the change of:
ROW FORMAT SERDE 'org.apache.hadoop.hive.serde2.OpenCSVSerde'
... View more
12-02-2018
03:09 PM
doesn't work here, full script is as below:
CREATE TABLE sr.sr2013 (
creation_date STRING,
status STRING,
first_3_chars_of_postal_code STRING,
intersection_street_1 STRING,
intersection_street_2 STRING,
ward STRING,
service_request_type STRING,
division STRING,
section STRING )
ROW FORMAT DELIMITED FIELDS TERMINATED BY ','
WITH SERDEPROPERTIES (
'colelction.delim'='\u0002',
'mapkey.delim'='\u0003',
'serialization.format'=',',
'field.delim'=',',
'skip.header.line.count'='1',
'quoteChar'= "\"") ;
... View more
07-11-2018
02:23 PM
Thank you @Krish E, did you sort it out now? I am having the same issue. What is your table's size?
... View more
02-12-2018
07:22 PM
I am doing some practices, for some questions, there could be various solutions, for example, I can use RDD operations to do some filtering, sorting, and grouping; with DataFrame and SparkSQL, it is even easier to me to get the same result. My question is will there be a requirement in the exam that some questions must be resolved using RDD, not DataFrame+SparkSQL. or vice versa? Thank you.
... View more
02-02-2018
08:20 PM
I am not able to list any sqoop jobs despite I run the following command during the sqoop job running time: sqoop job --list Here is a screenshot: Thank you.
... View more
Labels:
- Labels:
-
Apache Sqoop
01-30-2018
12:54 PM
1 Kudo
Thanks Mr. Jervis, There have been a lots complaint about the cluster/VM performance, do you have any update from the certification department, I know you have been talking to them about those complaints.
... View more