About mqureshi

mqureshi · ‎05-02-2017

@heta desai Thanks. If the answer helped, can you please accept it.

mqureshi · ‎05-02-2017

@Zack Riesland You can use ambari REST API to update YARN capacity scheduler queues. An example is given in the following link: https://community.hortonworks.com/questions/33578/api-to-manage-yarn-capacity-queue.html

mqureshi · ‎05-02-2017

@Tech Gig Yes. If you see the directory structure in Zookeeper as described below, you will have a more clear idea. But your understanding is right: https://cwiki.apache.org/confluence/display/KAFKA/Kafka+data+structures+in+Zookeeper

mqureshi · ‎05-01-2017

@Tech Gig No, they are not created under Zookeeper but Zookeeper is used by Kafka for state management, kafka topics and partitions. For example, consumers mark the offset for the record they have read to know what record they will be reading next. I'd recommend following deck (slide 10) to see how this works: https://www.slideshare.net/rahuldausa/apache-kafka-16727853?next_slideshow=1 and slide 15 of following deck to understand how Zookeeper is used by Kafka: https://www.slideshare.net/rahuldausa/introduction-to-kafka-and-zookeeper

mqureshi · ‎04-30-2017

@heta desai I think the database option should be "--hcatagol-database" otherwise the default database is used which is what the behavior you are seeing. Checkt he following document for hcatalog integration with sqoop. https://docs.hortonworks.com/HDPDocuments/HDP2/HDP-2.3.2/bk_dataintegration/content/sqoop-hcatalog-integration.html

mqureshi · ‎04-29-2017

@Abhijeet Rajput Numeric is preferred which you are already doing. You don't run into case sensitivity issues (your database sorting records in case insensitive way for example). Do you have a column which is unique but not primary key? Even distribution is important because otherwise your sqoop job can be skewed. Number of mappers definitely matter if you have slots available. More mappers, more parallelism, faster job. See the following link if you haven't already: http://stackoverflow.com/questions/37206232/sqoop-import-composite-primary-key-and-textual-primary-key

mqureshi · ‎04-28-2017

@Karan AlangI agree. I tried and was duplicate your issue when I don't have a semi colon. I wish beeline would complain but apparently it doesn't. Did this resolve your issue?

mqureshi · ‎04-28-2017

@Karan Alang Try removing single quotes from the url. If that doesn't help add default database as the db you are connecting to. You can later change your db.

mqureshi · ‎04-28-2017

@Abhijeet Rajput Sqoop should load data in UTF-8 by default. run the following get db cfg for db_name and see the value for Database_code_set. In your mapred-site.xml, can you please try adding the following for mapreduce.map.java.opts: -Ddb2.jcc.charsetDecoderEncoder=3

mqureshi · ‎04-26-2017

@Bala Vignesh N V Once you delete data, you lose all copies. Purpose of deletion majority of the time is reclaiming capacity. Now, what to do when you accidentally delete data? That's exactly why we have DR clusters or some backups in other places. As for retrieving single copy, like in this case is to use the process in the link I shared or in the extreme cases when you delete something and I realize it right away, then shutdown everything and use a Forensic software.

Online	Offline
Last Visited	‎10-31-2017 03:17 AM

Member Since	‎06-07-2016 09:05 AM
Last Visited	‎10-31-2017 03:17 AM
Posts	923
Kudos received	310

Cloudera Community

Re: YARN recommended configuration

Re: How to resolve for NULL values when they are c...

Re: Why is spark has better speed than Hadoop

Re: Is it possible to assign Hadoop queues to Hado...

Re: Kafka NiFi HDF Installation

Re: sqoop export syntax to export hive table to Sq...

Re: CLI command(s) to update YARN Capacity Schedul...

Re: Are Kafka Topics created under zookeeper when ...

Re: Are Kafka Topics created under zookeeper when ...

Re: sqoop export syntax to export hive table to Sq...

Re: SQOOP - Split By Key Manual

Re: HiveServer2 is up - Beeline not showing up dat...

Re: HiveServer2 is up - Beeline not showing up dat...

Re: Sqoop import - Special characters

Re: Hive - external table schema got dropped unfor...