I made an Oozie workflow that populates 2 hive tables. Every day a workflow inserts into Hive managed tables, one managed pure and the other managed partitioned by data. In the future these tables will become big tables.
In your opinion there is a limit for this tables in terms of maximu size?
The answer depends on how much retention you're looking at. Hive does scale pretty well for very large numbers of partitions or large files managed under it, but how many years of data are you asking about?