About sandyy006

sandyy006 · ‎04-01-2019

@Michael Bronson, Permission issue 🙂 Either run this command with hdfs user or change the ownership of /benchmarks/TestDFSIO to root. java.io.IOException: Permission denied: user=root, access=WRITE, inode="/benchmarks/TestDFSIO/io_control/in_file_test_io_0":hdfs:hdfs:drwxr-xr-x

sandyy006 · ‎04-01-2019

@Sampath Kumar, Please refer this article : https://community.hortonworks.com/articles/217295/ambari-270-how-to-reset-ambari-admin-password-from.html

sandyy006 · ‎02-07-2019

Create Kafka topic /usr/hdp/current/kafka-broker/bin/kafka-topics.sh --create --zookeeper `hostname`:2181 --replication-factor 1 --partitions 1 --topic kafka_hive_topic Create Hive table. (update the Kafka broker hostname below) CREATE EXTERNAL TABLE kafka_hive_table (`Country Name` string , `Language` string, `_id` struct<`$oid`:string>) STORED BY 'org.apache.hadoop.hive.kafka.KafkaStorageHandler' TBLPROPERTIES ("kafka.topic" = "kafka_hive_topic", "kafka.bootstrap.servers"="c2114-node2.labs.com:6667"); Download the sample json data. wget -O countries.json https://github.com/ozlerhakan/mongodb-json-files/blob/master/datasets/countries.json?raw=true Produce data into Kafka topic. cat countries.json | /usr/hdp/current/kafka-broker/bin/kafka-console-producer.sh --broker-list c2114-node2.-labs.com:6667 --topic kafka_hive_topic Describe table (to see additional Kafka specific columns) describe kafka_hive_table; +---------------+----------------------+--------------------+ | col_name | data_type | comment | +---------------+----------------------+--------------------+ | country name | string | from deserializer | | language | string | from deserializer | | _id | struct<$oid:string> | from deserializer | | __key | binary | from deserializer | | __partition | int | from deserializer | | __offset | bigint | from deserializer | | __timestamp | bigint | from deserializer | +---------------+----------------------+--------------------+ Run some sample queries. SELECT count(*) from kafka_hive_table; +--------+ | _c0 | +--------+ | 21640 | +--------+ SELECT `__partition`, max(`__offset`), CURRENT_TIMESTAMP FROM kafka_hive_table GROUP BY `__partition`, CURRENT_TIMESTAMP; +--------------+--------+--------------------------+ | __partition | _c1 | _c2 | +--------------+--------+--------------------------+ | 0 | 21639 | 2019-02-07 08:49:50.918 | +--------------+--------+--------------------------+ select * from kafka_hive_table limit 10; +--------------------------------+----------------------------+--------------------------------------+-------------------------+-------------------------------+----------------------------+-------------------------------+ | kafka_hive_table.country name | kafka_hive_table.language | kafka_hive_table._id | kafka_hive_table.__key | kafka_hive_table.__partition | kafka_hive_table.__offset | kafka_hive_table.__timestamp | +--------------------------------+----------------------------+--------------------------------------+-------------------------+-------------------------------+----------------------------+-------------------------------+ | Afrika | af | {"$oid":"55a0f1d420a4d760b5fbdbd6"} | NULL | 0 | 0 | 1549529251002 | | Oseanië | af | {"$oid":"55a0f1d420a4d760b5fbdbd7"} | NULL | 0 | 1 | 1549529251010 | | Suid-Amerika | af | {"$oid":"55a0f1d420a4d760b5fbdbd8"} | NULL | 0 | 2 | 1549529251010 | | Wêreld | af | {"$oid":"55a0f1d420a4d760b5fbdbd9"} | NULL | 0 | 3 | 1549529251011 | | አፍሪካ | am | {"$oid":"55a0f1d420a4d760b5fbdbda"} | NULL | 0 | 4 | 1549529251011 | | ኦሽኒያ | am | {"$oid":"55a0f1d420a4d760b5fbdbdb"} | NULL | 0 | 5 | 1549529251011 | | ዓለም | am | {"$oid":"55a0f1d420a4d760b5fbdbdc"} | NULL | 0 | 6 | 1549529251011 | | ደቡባዊ አሜሪካ | am | {"$oid":"55a0f1d420a4d760b5fbdbdd"} | NULL | 0 | 7 | 1549529251011 | | أمريكا الجنوبية | ar | {"$oid":"55a0f1d420a4d760b5fbdbde"} | NULL | 0 | 8 | 1549529251011 | | أمريكا الشمالية | ar | {"$oid":"55a0f1d420a4d760b5fbdbdf"} | NULL | 0 | 9 | 1549529251011 | +--------------------------------+----------------------------+--------------------------------------+-------------------------+-------------------------------+----------------------------+-------------------------------+

sandyy006 · ‎01-28-2019

Seems to be the same script which i mentioned above. Isn't it?

sandyy006 · ‎01-28-2019

@Marcel-Jan Krijgsman Do run the /usr/hdp/current/atlas-server/hook-bin/import-hive.sh utility which imports the existing hive tables into atlas.

sandyy006 · ‎01-22-2019

Good one @Jagatheesh Ramakrishnan

sandyy006 · ‎01-11-2019

@rajendra you can use the below sql statement in the metastore database. mysql -u root -e "use hive;SELECT NAME, TBL_NAME FROM DBS as a, TBLS as b where a.DB_ID=b.DB_ID;"> tables.txt tables.txt will have the list of all tables.

sandyy006 · ‎12-28-2018

@Anjali Shevadkar There isn't Spark + Ranger support as of now. If you want to apply ranger policies you'd need to use Spark LLAP. Ref: https://hortonworks.com/blog/sparksql-ranger-llap-via-spark-thrift-server-bi-scenarios-provide-row-column-level-security-masking/ https://community.hortonworks.com/articles/72454/apache-spark-fine-grain-security-with-llap-test-dr.html Hope this helps.

sandyy006 · ‎12-15-2018

@Michael Mester My HDP-3.1.0.0-78 cluster shows the right Kafka version 2.0.0. Looks like your installation is bugged, do verify the repos which are configured.

sandyy006 · ‎11-15-2018

@Mayank Bhatt do add "--protocol https --port 8443" options if ambari is running on HTTPS.

Online	Offline
Last Visited	‎01-23-2020 02:33 AM

Member Since	‎02-01-2019 10:51 AM
Last Visited	‎01-23-2020 02:33 AM
Posts	650
Kudos received	142

Cloudera Community

Re: Distributed I/O Benchmark of HDFS

Re: How to reset Ambari admin password in Ambari 2...

Re: Discovering existing Hive tables in Atlas

Re: Does Distcp use Tez now in HDP 3.0 instead of ...

Re: hive server2 interactive logs

Re: Distributed I/O Benchmark of HDFS

Re: How to reset Ambari admin password in Ambari 2...

Hive Kafka storage handler example

Re: Discovering existing Hive tables in Atlas

Re: Discovering existing Hive tables in Atlas

Re: "I would suggest you engage Professional Servi...

Re: How to get total number of hive tables in a da...

Re: How to apply ranger policies on apache spark

Re: HDP 3.1 kafka versions

Re: configs.sh not supported from 2.6.4 HDP versio...