Usually If we are using sqoop to import the data from any RDBMS,following is the folder structure we maintain in hdfs
raw_tbl->/data/raw (Landing folder for the initial text data after sqoop import)
source_tbl ->/data/source/(Create source table as ORC by selecting the data from raw tbl)
master_tbl -> /data/publish/<partion> (Create a master table in ORC by creating a parttiion and move the data from the source_tbl)
If you are using spark to save the dataframe,better save the dataframe in ORC format,since it gives better compression than any other format( such as avro ,parquet)