Alert: Welcome to the Unified Cloudera Community. Former HCC members be sure to read and learn how to activate your account here. Want to know more about what has changed? Check out the Community News blog.
1) Import and export data between an external RDBMS and your cluster, including the ability to import specific subsets, change the delimiter and file format of imported data during ingest, and alter the data access pattern or privileges [bold part is not cleared to me]
2) Deduplication and merge data (what do we mean by this?)
3) Tune data for optimal query performance [Have to apply DML in hive ?](what comes in the scope of this items)