- Subscribe to RSS Feed
- Mark Question as New
- Mark Question as Read
- Float this Question for Current User
- Bookmark
- Subscribe
- Mute
- Printer Friendly Page
[TEZ] are partition, sort and shuffle built-in ?
- Labels:
-
Apache Hadoop
-
Apache Tez
Created 12-27-2017 04:37 PM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Hello,
If the concept of MapReduce is pretty clear in my mind, i can't say so much for Tez.
MapReduce performs its work through Map > Partition, Sort, Shuffle > Reduce, and I know well each of these phases...
But for Tez, and more precisely, between two Vertices (considering a Vertices Map and a Vertices Reduce), how is it ?
Is there a built-in "partition, sort, shuffle" like in MR ? Or is it to us to manage this internal logic (i read a word count example, it seems it is, but I prefer to be sure) ?
Thanks !
Created 12-28-2017 06:13 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Background execution of tez and mr has many similarities. Differences lies in the where the data are in placed to transform it. Tez uses DAG to process the data whereas mr doesn't use DAG.
This link would answer your question. Hope it helps!!
Created 12-28-2017 06:13 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Background execution of tez and mr has many similarities. Differences lies in the where the data are in placed to transform it. Tez uses DAG to process the data whereas mr doesn't use DAG.
This link would answer your question. Hope it helps!!
Created 01-01-2018 11:52 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
thanks @Bala Vignesh N V ; it helps 🙂
