- Subscribe to RSS Feed
- Mark Question as New
- Mark Question as Read
- Float this Question for Current User
- Bookmark
- Subscribe
- Mute
- Printer Friendly Page
How to get the total number of rows from multiple tables in a database?
- Labels:
-
Apache Hive
Created on ‎09-16-2015 03:18 AM - edited ‎09-16-2022 02:41 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
In my db i have many tables that start with 'bza' in table name.
I wrote all those table names into a text file name as tables.txt
Now using a shell script (count_row.sh) I am reading each line from tables.txt and execuing the hive commnad:
while IFS='' read -r line || [[ -n "$line" ]]; do echo "" echo "" echo "" echo "Counting the table : $line" eval "hive -e 'select count(*) from $line'" done < "$1"
This results in total rows from each table, which I am writting it into a text file rows.txt as
$ ./count_row.sh tables.txt > row.txt
Now my task is I want to count the total number of rows from all the tables by summing the individual result.
Is there any way to achieve that ? Or any alternate way to count the rows from mutliple tables.
Created ‎09-20-2015 07:37 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
./count_row.sh tables.txt | paste -s -d+ - | bc
Ref: http://stackoverflow.com/questions/450799/shell-command-to-sum-integers-one-per-line#comment12469220...
P.s. It may be more efficient to generate a list of queries and run it via a single hive command, cause each command runs a whole new JVM.
Created ‎09-20-2015 07:37 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
./count_row.sh tables.txt | paste -s -d+ - | bc
Ref: http://stackoverflow.com/questions/450799/shell-command-to-sum-integers-one-per-line#comment12469220...
P.s. It may be more efficient to generate a list of queries and run it via a single hive command, cause each command runs a whole new JVM.
Created on ‎09-22-2015 01:38 AM - edited ‎09-22-2015 01:39 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
@Harsh J
By saying "Generate a list of queries" you mean, creating a list of queries and separate by ;(semicolon) in hive command like
hive -e 'query1;query2;...queryN' ?
Created ‎09-22-2015 02:26 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Created ‎09-22-2015 06:17 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Using hive -f a file can be passed, but the main purpose of writing the loop in script is to automate the process. I have to apply the same command for 250 tables.
Created ‎09-22-2015 06:19 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
given the described example. Does the bc command not solve your original
question?
Created ‎09-22-2015 10:05 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Created ‎09-22-2015 10:17 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
similar question can find a solution quicker.
Feel free to post a new thread with any further questions.
Created ‎05-05-2017 06:22 PM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
It doesn't work for me, here is the screenshot:
What am I missing here?
Thank you very much.
Created ‎07-31-2019 04:26 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Done keyword is missing in your script. In your loop body use done at last.
