Member since
06-07-2016
923
Posts
322
Kudos Received
115
Solutions
My Accepted Solutions
Title | Views | Posted |
---|---|---|
3989 | 10-18-2017 10:19 PM | |
4253 | 10-18-2017 09:51 PM | |
14627 | 09-21-2017 01:35 PM | |
1769 | 08-04-2017 02:00 PM | |
2356 | 07-31-2017 03:02 PM |
05-02-2017
02:34 PM
@heta desai Thanks. If the answer helped, can you please accept it.
... View more
05-02-2017
02:15 PM
@Zack Riesland You can use ambari REST API to update YARN capacity scheduler queues. An example is given in the following link: https://community.hortonworks.com/questions/33578/api-to-manage-yarn-capacity-queue.html
... View more
05-02-2017
01:50 PM
@Tech Gig Yes. If you see the directory structure in Zookeeper as described below, you will have a more clear idea. But your understanding is right: https://cwiki.apache.org/confluence/display/KAFKA/Kafka+data+structures+in+Zookeeper
... View more
05-01-2017
06:44 PM
1 Kudo
@Tech Gig No, they are not created under Zookeeper but Zookeeper is used by Kafka for state management, kafka topics and partitions. For example, consumers mark the offset for the record they have read to know what record they will be reading next. I'd recommend following deck (slide 10) to see how this works: https://www.slideshare.net/rahuldausa/apache-kafka-16727853?next_slideshow=1 and slide 15 of following deck to understand how Zookeeper is used by Kafka: https://www.slideshare.net/rahuldausa/introduction-to-kafka-and-zookeeper
... View more
04-30-2017
08:27 PM
@heta desai I think the database option should be "--hcatagol-database" otherwise the default database is used which is what the behavior you are seeing. Checkt he following document for hcatalog integration with sqoop. https://docs.hortonworks.com/HDPDocuments/HDP2/HDP-2.3.2/bk_dataintegration/content/sqoop-hcatalog-integration.html
... View more
04-29-2017
03:26 AM
@Abhijeet Rajput Numeric is preferred which you are already doing. You don't run into case sensitivity issues (your database sorting records in case insensitive way for example). Do you have a column which is unique but not primary key? Even distribution is important because otherwise your sqoop job can be skewed. Number of mappers definitely matter if you have slots available. More mappers, more parallelism, faster job. See the following link if you haven't already: http://stackoverflow.com/questions/37206232/sqoop-import-composite-primary-key-and-textual-primary-key
... View more
04-28-2017
11:03 PM
@Karan AlangI agree. I tried and was duplicate your issue when I don't have a semi colon. I wish beeline would complain but apparently it doesn't. Did this resolve your issue?
... View more
04-28-2017
09:50 PM
@Karan Alang Try removing single quotes from the url. If that doesn't help add default database as the db you are connecting to. You can later change your db.
... View more
04-28-2017
05:37 AM
@Abhijeet Rajput Sqoop should load data in UTF-8 by default. run the following get db cfg for db_name and see the value for Database_code_set. In your mapred-site.xml, can you please try adding the following for mapreduce.map.java.opts: -Ddb2.jcc.charsetDecoderEncoder=3
... View more
04-26-2017
09:00 PM
@Bala Vignesh N V Once you delete data, you lose all copies. Purpose of deletion majority of the time is reclaiming capacity. Now, what to do when you accidentally delete data? That's exactly why we have DR clusters or some backups in other places. As for retrieving single copy, like in this case is to use the process in the link I shared or in the extreme cases when you delete something and I realize it right away, then shutdown everything and use a Forensic software.
... View more