Member since
09-18-2015
3274
Posts
1159
Kudos Received
426
Solutions
My Accepted Solutions
| Title | Views | Posted |
|---|---|---|
| 2691 | 11-01-2016 05:43 PM | |
| 9136 | 11-01-2016 05:36 PM | |
| 5027 | 07-01-2016 03:20 PM | |
| 8440 | 05-25-2016 11:36 AM | |
| 4609 | 05-24-2016 05:27 PM |
02-16-2016
08:45 PM
I have tested this and this is the best/easy way to configure the connection
... View more
02-16-2016
08:06 PM
@Roberto Sancho I had to reduce it to 64mb from 1024 to make it work. Link
... View more
02-16-2016
07:54 PM
@Roberto Sancho I have accepted this answer. Thanks for following up. You had to define the schema as suggested in the reply.
... View more
02-16-2016
07:49 PM
1 Kudo
@Roberto Sancho You are correct. I have seen this behavior and I had to play with mapreduce.task.io.sort.mb setting You may have to test it few times with different values.
... View more
02-16-2016
07:45 PM
@hoda moradi http://spark.apache.org/docs/latest/streaming-kafka-integration.html It has java code
... View more
02-16-2016
07:42 PM
@Deepak Girdher 1. I figured out that once the data is loaded into the External Table, Only then the file will appear in HDFS as create table just stores the metadata and data is not present for HDFS. Incorrect : Data resides in HDFS. HCatalog stores the data 2. My root was not having write access to /tmp . When I tried to load the data I got the error. That is how it stuck my mind that there could be some permission issue. 🙂 You never point a table to /tmp ---> Permission issue is there because you are trying to read other users directries Once I gave the path for which my root had the access i.e. /user/root in my case which i created and loaded the data into the table, I could see the file and its contents. That's correct
... View more
02-16-2016
06:44 PM
@Deepak Girdher run this show create table tablename; Example: hive> create external table ns1 (name string) location '/tmp/';
OK
Time taken: 0.597 seconds
hive> show create table ns1;
OK
CREATE EXTERNAL TABLE `ns1`(
`name` string)
ROW FORMAT SERDE
'org.apache.hadoop.hive.serde2.lazy.LazySimpleSerDe'
STORED AS INPUTFORMAT
'org.apache.hadoop.mapred.TextInputFormat'
OUTPUTFORMAT
'org.apache.hadoop.hive.ql.io.HiveIgnoreKeyTextOutputFormat'
LOCATION
'hdfs://phdns01.cloud.hortonworks.com:8020/tmp'
TBLPROPERTIES (
'COLUMN_STATS_ACCURATE'='false',
'numFiles'='0',
'numRows'='-1',
'rawDataSize'='-1',
'totalSize'='0',
'transient_lastDdlTime'='1455648183')
Time taken: 0.354 seconds, Fetched: 17 row(s)
... View more
02-16-2016
06:41 PM
@Deepak Girdher You ended up giving whole /tmp as your table location. What do you see when you run select * from table;
... View more