About bkosaraju

VidyaSargur · ‎07-09-2021

Hi @singhvNt, as this is an older post, you would have a better chance of receiving a resolution by starting a new thread. This will also be an opportunity to provide details specific to your environment that could aid others in assisting you with a more accurate answer to your question. You can link this thread as a reference in your new post.

SaidaVa · ‎12-29-2020

Below export command worked for me. CREATE table departments_export (departmentid int(11), department_name varchar(45), created_date T1MESTAMP); sqoop export --connect jdbc:mysql://<host>:3306/DB --username cloudera --password *** \ --table departments_export \ --export-dir '/user/cloudera/departments_new/*' \ -m 1 \ --input-fields-terminated-by ','; Sample input: 103,Finance,2020-10-10 10:10:00

lvazquez · ‎12-01-2020

The following map rule is wrong: RULE:[2:\$1@\$0](rm@MY_REALM)s/.*/rm/ the user for the ResourceManager is not "rm" but "yarn" and this should be the replacement value. This is the same as for the hadoop.security.auth_to_local in Hadoop/HDFS configuration.

Janard · ‎10-02-2020

Wont it results into Shuffle Spill without proper memory configuration in Spark Context?

Gangadhar · ‎03-28-2020

I want single file in the output which having all the records from array

tmons · ‎11-11-2019

I believe it's a typo. We should use " (double quotes) rather than ' (single quotes). The environment variable $token will be expanded. curl -k -X GET 'https://<nifi-hostname>:9091/nifi-api/flow/status' -H "Authorization: Bearer $token" --compressed

afm_ryu · ‎08-02-2018

Thank you for your reply @bkosaraju, but it seems I have no luck with suggested query. I don't see any differences after submit both queries. I just found this (Hive Transactional Tables are not readable by Spark) question. as per JIRA tickets, my situation seems caused by exact same problem that is still exists in latest Spark version. Is there any workarounds to use Hive 3.0 Table (sure, with 'transaction = true', it is mandatory for Hive 3.0 as I know) with Spark? If not, maybe I should rollback to HDP 2.6...

jayanthimala_ja · ‎04-16-2018

@bkosaraju Thanks, exactly what i needed. Works great!

bkosaraju · ‎03-06-2018

Hi @yogesh turkane, As I was across, We can achieve this with two ways. Post the load of the data or with schedule intervals run the "ALTER TABLE <table_name> CONCATENATE" on the table in SQL api this will merge all the small orc files associated to that table. - Please not that this is specific to ORC Use the data frame to load the data and re-partition write back with overwrite in spark. The code snippet would be val tDf = hiveContext.table("table_name") tdf.rePartition(<num_Files>).write.mode("overwrite").saveAsTable("targetDB.targetTbale") the second option will work with any type of files. Hope this helps !!

fernando_lopez · ‎03-07-2018

For me, proxy settings (no matter if they were set at Intellij, SBT.conf or environment variables), did not work. A couple of considerations that solved this issue (for me at least): - if you use SBT 0.13.16 (not newer that that) - set Use Auto Import Then, no "FAILED DOWNLOADS" messages appear.

Online	Offline
Last Visited	‎04-09-2019 11:41 AM

Member Since	‎01-03-2017 05:05 AM
Last Visited	‎04-09-2019 11:41 AM
Posts	181
Kudos received	44

Cloudera Community

Re: Api to help pull yarn metrics and RM metrics

Re: NiFi Cluster Setup

Re: Hive LLAP ranger insert issue (requires defaul...

Re: Ranger Audit Log (Add filter)

Re: HDFS is not rebalancing after adding new DataN...

Re: Facing issue from spark-sql.

Re: Export from HDFS to mysql using Sqoop

Re: Zookeeper problem after hadoop kerberization

Re: How to reduce Spark shuffling caused by join w...

Re: NiFi JSON array to CSV file

Re: Use REST API to access a secured NiFi cluster

Re: SparkSQL returns empty result when accessing H...

Re: NiFi Cluster Setup

Re: how to append data in part file while insert i...

Re: Problem with proxy settings for SBT