Created on 02-24-2017 12:53 PM - edited 09-16-2022 04:09 AM
Hello,
I have a table that is pointing to a location in HDFS, like this:
# col_name data_type comment
treceived string
unix_tstamp bigint
tryear int
trmonth int
trday int
trhour int
trminute int
trsec int
tdur float
sip string
dip string
sport int
dport int
proto string
flag string
fwd int
stos int
ipkt bigint
ibyt bigint
opkt bigint
obyt bigint
input int
output int
sas int
das int
dtos int
dir int
rip string
# Partition Information
# col_name data_type comment
y int
m int
d int
h int
# Detailed Table Information
Database: spotdb
Owner: spot
CreateTime: Thu Feb 23 16:41:20 CLST 2017
LastAccessTime: UNKNOWN
Protect Mode: None
Retention: 0
Location: hdfs://HDFS-namenode:8020/user/spot/flow/hive
Table Type: EXTERNAL_TABLE
Table Parameters:
EXTERNAL TRUE
avro.schema.literal {\n \"type\": \"record\"\n , \"name\": \"FlowRecord\"\n , \"namespace\" : \"com.cloudera.accelerators.flows.avro\"\n , \"fields\": [\n {\"name\": \"treceived\", \"type\":[\"string\", \"null\"]}\n , {\"name\": \"unix_tstamp\", \"type\":[\"long\", \"null\"]}\n , {\"name\": \"tryear\", \"type\":[\"int\", \"null\"]}\n , {\"name\": \"trmonth\", \"type\":[\"int\", \"null\"]}\n , {\"name\": \"trday\", \"type\":[\"int\", \"null\"]}\n , {\"name\": \"trhour\", \"type\":[\"int\", \"null\"]}\n , {\"name\": \"trminute\", \"type\":[\"int\", \"null\"]}\n , {\"name\": \"trsec\", \"type\":[\"int\", \"null\"]}\n , {\"name\": \"tdur\", \"type\":[\"float\", \"null\"]}\n , {\"name\": \"sip\", \"type\":[\"string\", \"null\"]}\n , {\"name\": \"sport\", \"type\":[\"int\", \"null\"]}\n , {\"name\": \"dip\", \"type\":[\"string\", \"null\"]}\n , {\"name\": \"dport\", \"type\":[\"int\", \"null\"]}\n , {\"name\": \"proto\", \"type\":[\"string\", \"null\"]}\n , {\"name\": \"flag\", \"type\":[\"string\", \"null\"]}\n , {\"name\": \"fwd\", \"type\":[\"int\", \"null\"]}\n , {\"name\": \"stos\", \"type\":[\"int\", \"null\"]}\n , {\"name\": \"ipkt\", \"type\":[\"bigint\", \"null\"]}\n , {\"name\": \"ibytt\", \"type\":[\"bigint\", \"null\"]}\n , {\"name\": \"opkt\", \"type\":[\"bigint\", \"null\"]}\n , {\"name\": \"obyt\", \"type\":[\"bigint\", \"null\"]}\n , {\"name\": \"input\", \"type\":[\"int\", \"null\"]}\n , {\"name\": \"output\", \"type\":[\"int\", \"null\"]}\n , {\"name\": \"sas\", \"type\":[\"int\", \"null\"]}\n , {\"name\": \"das\", \"type\":[\"int\", \"null\"]}\n , {\"name\": \"dtos\", \"type\":[\"int\", \"null\"]}\n , {\"name\": \"dir\", \"type\":[\"int\", \"null\"]}\n , {\"name\": \"rip\", \"type\":[\"string\", \"null\"]}\n ]\n}
transient_lastDdlTime 1487878880
# Storage Information
SerDe Library: org.apache.hadoop.hive.ql.io.parquet.serde.ParquetHiveSerDe
InputFormat: org.apache.hadoop.hive.ql.io.parquet.MapredParquetInputFormat
OutputFormat: org.apache.hadoop.hive.ql.io.parquet.MapredParquetOutputFormat
Compressed: No
Num Buckets: -1
Bucket Columns: []
Sort Columns: []
Storage Desc Params:
field.delim ,
serialization.format ,But when I select that table it says that it's empty. I checked that HDFS loacation and it has parquet files:
sudo -u hdfs hdfs dfs -ls /user/spot/flow/hive/y=2017/m=02/d=23/h=23 Found 12 items -rwxr-xr-x 3 spot supergroup 440 2017-02-23 20:05 /user/spot/flow/hive/y=2017/m=02/d=23/h=23/000000_0 -rwxr-xr-x 3 spot supergroup 440 2017-02-23 20:10 /user/spot/flow/hive/y=2017/m=02/d=23/h=23/000000_0_copy_1 -rwxr-xr-x 3 spot supergroup 440 2017-02-23 20:55 /user/spot/flow/hive/y=2017/m=02/d=23/h=23/000000_0_copy_10 -rwxr-xr-x 3 spot supergroup 440 2017-02-23 21:00 /user/spot/flow/hive/y=2017/m=02/d=23/h=23/000000_0_copy_11 -rwxr-xr-x 3 spot supergroup 440 2017-02-23 20:15 /user/spot/flow/hive/y=2017/m=02/d=23/h=23/000000_0_copy_2 -rwxr-xr-x 3 spot supergroup 440 2017-02-23 20:20 /user/spot/flow/hive/y=2017/m=02/d=23/h=23/000000_0_copy_3 -rwxr-xr-x 3 spot supergroup 440 2017-02-23 20:25 /user/spot/flow/hive/y=2017/m=02/d=23/h=23/000000_0_copy_4 -rwxr-xr-x 3 spot supergroup 440 2017-02-23 20:30 /user/spot/flow/hive/y=2017/m=02/d=23/h=23/000000_0_copy_5 -rwxr-xr-x 3 spot supergroup 440 2017-02-23 20:35 /user/spot/flow/hive/y=2017/m=02/d=23/h=23/000000_0_copy_6 -rwxr-xr-x 3 spot supergroup 440 2017-02-23 20:40 /user/spot/flow/hive/y=2017/m=02/d=23/h=23/000000_0_copy_7 -rwxr-xr-x 3 spot supergroup 440 2017-02-23 20:45 /user/spot/flow/hive/y=2017/m=02/d=23/h=23/000000_0_copy_8 -rwxr-xr-x 3 spot supergroup 440 2017-02-23 20:50 /user/spot/flow/hive/y=2017/m=02/d=23/h=23/000000_0_copy_9
What did i do wrong?
Regards,
Joaquín Silva
Created 02-24-2017 02:32 PM
Created 02-24-2017 01:05 PM
Created 02-24-2017 01:08 PM
This is the result:
show partitions spotdb.flow; OK y=2017/m=02/d=23/h=20 y=2017/m=02/d=23/h=21 y=2017/m=02/d=23/h=22 y=2017/m=02/d=23/h=23 y=2017/m=02/d=24/h=00 y=2017/m=02/d=24/h=01 y=2017/m=02/d=24/h=02 y=2017/m=02/d=24/h=03 y=2017/m=02/d=24/h=04 y=2017/m=02/d=24/h=05 y=2017/m=02/d=24/h=06 y=2017/m=02/d=24/h=07 y=2017/m=02/d=24/h=08 y=2017/m=02/d=24/h=09 y=2017/m=02/d=24/h=10 y=2017/m=02/d=24/h=11 y=2017/m=02/d=24/h=12 y=2017/m=02/d=24/h=13 y=2017/m=02/d=24/h=14 y=2017/m=02/d=24/h=15 y=2017/m=02/d=24/h=16 y=2017/m=02/d=24/h=17 y=2017/m=02/d=24/h=18 y=2017/m=02/d=24/h=19 y=2017/m=02/d=24/h=20
As I can see it recognize the partitions.
Created 02-24-2017 01:19 PM
Created 02-24-2017 01:42 PM
This is the result:
/opt/cloudera/parcels/CDH-5.8.0-1.cdh5.8.0.p0.42/lib/parquet/bin/parquet-tools cat /user/spot/flow/hive/y=2017/m=02/d=24/h=20/000000_0 File /user/spot/flow/hive/y=2017/m=02/d=24/h=20/000000_0 does not exist
But the file exists
sudo -u hdfs hdfs dfs -ls /user/spot/flow/hive/y=2017/m=02/d=24/h=20 Found 12 items -rwxr-xr-x 3 spot supergroup 440 2017-02-24 17:05 /user/spot/flow/hive/y=2017/m=02/d=24/h=20/000000_0 -rwxr-xr-x 3 spot supergroup 440 2017-02-24 17:10 /user/spot/flow/hive/y=2017/m=02/d=24/h=20/000000_0_copy_1 -rwxr-xr-x 3 spot supergroup 440 2017-02-24 17:55 /user/spot/flow/hive/y=2017/m=02/d=24/h=20/000000_0_copy_10 -rwxr-xr-x 3 spot supergroup 440 2017-02-24 18:00 /user/spot/flow/hive/y=2017/m=02/d=24/h=20/000000_0_copy_11 -rwxr-xr-x 3 spot supergroup 440 2017-02-24 17:15 /user/spot/flow/hive/y=2017/m=02/d=24/h=20/000000_0_copy_2 -rwxr-xr-x 3 spot supergroup 440 2017-02-24 17:20 /user/spot/flow/hive/y=2017/m=02/d=24/h=20/000000_0_copy_3 -rwxr-xr-x 3 spot supergroup 440 2017-02-24 17:25 /user/spot/flow/hive/y=2017/m=02/d=24/h=20/000000_0_copy_4 -rwxr-xr-x 3 spot supergroup 440 2017-02-24 17:30 /user/spot/flow/hive/y=2017/m=02/d=24/h=20/000000_0_copy_5 -rwxr-xr-x 3 spot supergroup 440 2017-02-24 17:35 /user/spot/flow/hive/y=2017/m=02/d=24/h=20/000000_0_copy_6 -rwxr-xr-x 3 spot supergroup 440 2017-02-24 17:40 /user/spot/flow/hive/y=2017/m=02/d=24/h=20/000000_0_copy_7 -rwxr-xr-x 3 spot supergroup 440 2017-02-24 17:45 /user/spot/flow/hive/y=2017/m=02/d=24/h=20/000000_0_copy_8 -rwxr-xr-x 3 spot supergroup 440 2017-02-24 17:50 /user/spot/flow/hive/y=2017/m=02/d=24/h=20/000000_0_copy_9
Created 02-24-2017 01:50 PM
Created 02-24-2017 02:32 PM