I am ingesting multiple tables in hadoop [CSV files],
during the ingestion process i want to combine /find relation between tables [primary key and foreign key] , join them and create single de-normalize table from all the table
In this case i want to find key by algorithm/code so automatically identify key and merge all tables
Is there any paper available ?
of can any one suggest me some steps to find keys and relations
Hi, @Paresh Baldaniya. It's irrelevant to Spark SQL itself. If you use MySQL, the same issue exists there. It's completely up to you. You had better search and find some analytics tools for that.