1973
Posts
1225
Kudos Received
124
Solutions
My Accepted Solutions
| Title | Views | Posted |
|---|---|---|
| 1913 | 04-03-2024 06:39 AM | |
| 3011 | 01-12-2024 08:19 AM | |
| 1642 | 12-07-2023 01:49 PM | |
| 2419 | 08-02-2023 07:30 AM | |
| 3360 | 03-29-2023 01:22 PM |
02-27-2018
09:34 PM
yes by default. You can change ports as well in ambari https://nifi.apache.org/docs/nifi-docs/html/administration-guide.html https://docs.hortonworks.com/HDPDocuments/HDF3/HDF-3.1.0/bk_installing-nifi/content/ch02s04.html or it could be 8443
... View more
02-27-2018
08:44 PM
1 Kudo
This is for people preparing to attend my talk on Deep Learning at DataWorks Summit Berling 2018 (https://dataworkssummit.com/berlin-2018/#agenda) on Thursday April 19, 2018 at 11:50AM Berlin time. See: https://community.hortonworks.com/content/kbentry/174399/apache-deep-learning-101-using-apache-mxnet-on-apa.html To do proper analytics and provide fast SQL access to our inception data generated by Apache MXNet from our images, we need to land it into Apache Hive Transactional tables. We will use the Apache NiFi PutHiveStreaming processor to insert data into our ACID table at a rapid rate. This only works if you create a transactional table with Apache ORC, see the DDL below. You must also be running a new version of HDP 2.6+ that has ACID turned on. Tip: In HDP 2.6.4, you will need to create and work with Apache Hive ACID tables with Hive. Not sql in Apache Zeppelin, since that is Apache Spark. jdbc(hive) is Apache Hive. See the configuration below to hive CBO and TEZ enabled as well. Ambari View of Hive SQL DDL %jdbc(hive)
CREATE TABLE `inception`(
uuid STRING, top1pct STRING, top1 STRING, top2pct STRING, top2 STRING, top3pct STRING, top3 STRING, top4pct STRING, top4 STRING, top5pct STRING, top5 STRING, imagefilename STRING,
runtime STRING)
CLUSTERED BY ( top1)
INTO 3 BUCKETS
ROW FORMAT SERDE
'org.apache.hadoop.hive.ql.io.orc.OrcSerde'
STORED AS INPUTFORMAT
'org.apache.hadoop.hive.ql.io.orc.OrcInputFormat'
OUTPUTFORMAT
'org.apache.hadoop.hive.ql.io.orc.OrcOutputFormat'
TBLPROPERTIES ( 'transactional'='true')
%jdbc(hive)
select * from inception The PutHiveStreaming processor requires that you have a table that is bucketed, uses Apache ORC and you have permissions. See the example above for a table DDL to use. You also need ACID and LLAP enabled on your Apache Hive cluster. Details for PutHiveStreaming Processor An Example Apache MXNet to Hive Streaming View The Hive View 2.0 of the Data Apache Zeppelin Table DDL and Query
... View more
Labels:
02-27-2018
06:59 PM
The table needs to be created properly with permissions Example Table DDL CREATE TABLE `inception`(
uuid STRING, top1pct STRING, top1 STRING, top2pct STRING, top2 STRING, top3pct STRING, top3 STRING, top4pct STRING, top4 STRING, top5pct STRING, top5 STRING, imagefilename STRING,
runtime STRING)
CLUSTERED BY ( top1)
INTO 3 BUCKETS
ROW FORMAT SERDE
'org.apache.hadoop.hive.ql.io.orc.OrcSerde'
STORED AS INPUTFORMAT
'org.apache.hadoop.hive.ql.io.orc.OrcInputFormat'
OUTPUTFORMAT
'org.apache.hadoop.hive.ql.io.orc.OrcOutputFormat'
TBLPROPERTIES ( 'transactional'='true')
... View more
02-27-2018
06:48 PM
In order for Hive Streaming to work the following has to be in place:
Table is stored as ORC Transactional Property is set to “True” The Table is Bucketed
... View more
02-27-2018
06:46 PM
The Nifi user may not have permissions to the /apps/hive/warehouse directory What is it's permissions? What is your Hive local scheme? Does NiFi users have HDFS read/write permissions?
... View more
02-27-2018
06:14 PM
Is your cluster kerberized? Can you access Hive from the command line of that machine? That article is a bit old. can you post any log messages. The HDF version of NiFi is configured just for this and the addition of ambari makes everything easier. https://docs.hortonworks.com/HDPDocuments/HDF3/HDF-3.1.0/bk_installing-hdf-on-hdp/content/upgrading_ambari.html
... View more
02-27-2018
05:29 PM
Ignore unmatched fields for both attributes check the names quote column identifiers -true translate field names true
... View more
02-27-2018
05:05 PM
see https://docs.hortonworks.com/HDPDocuments/HDP2/HDP-2.6.4/bk_data-access/content/ch_hbase_bar.html
... View more
02-27-2018
04:56 PM
See this: https://community.hortonworks.com/questions/45426/convertjsontosql-problemsjsontosql-issues.html Could be this: What is the "Translate Field Names" property of ConvertJSONtoSQL set to? If your columns are identical to the json field names, be sure to set that property to false. None of the fields are matching by name: https://github.com/apache/nifi/blob/master/nifi-nar-bundles/nifi-standard-bundle/nifi-standard-processors/src/main/java/org/apache/nifi/processors/standard/PutDatabaseRecord.java I am thinking it's case sensitivity issues. This is a difficult name and not supported by AVRO: _of_progression varchar(255), If you have any weird or non-AVRO safe names, try my Attributename cleaner https://community.hortonworks.com/articles/154760/generating-avro-schemas-and-ensuring-field-names-m.html
... View more
02-27-2018
04:43 PM
include template, screen shot, example CSV
... View more