In folder "A" have many file, I need load into external table Hive:
I need load file 1 into table 1, file 2 into table 2...v.. But I don't want restructuring folder "A".
I think Hive (location) has regex to load special file in a folder. But I can't find it.
I try use "load data inpath" but which remove file source. I don't know which has option don't remove file source.?
Can you help me? I need a solution.
Thanks all. :))
I am not sure if I got you right, but what you want is to create three external hive tables where each of them has exactly 1 file as the data? Or do you need a filename pattern?
If you used 'LOAD DATA INPATH...' as a command it is the expected behaviour that it will move a hdfs path into the hive controlled path of hdfs: https://cwiki.apache.org/confluence/display/Hive/GettingStarted
To create the schema based on a file outside the hive controlled hdfs path you need to use the CREATE EXTERNAL TABLE with the json serde and provide the file name as location.
If you need a regex as the filename pattern, try using WITH SERDEPROPERTIES ( "input.regex" = "*.json") and provide only the path as the location.