Member since
12-01-2017
5
Posts
0
Kudos Received
0
Solutions
12-06-2017
08:49 PM
Hello all! Has anyone successfully connected to Hive via Windows Powershell? We've connected to both ORACLE and MSSQL databases via ODBC connections quite easily in the past, but we're having a hard time finding any documentation/code for connecting to Hive via Windows Powershell. We currently don't have any Azure subscriptions, so Azure Powershell/HdInsight is off the table. Thanks for your time!
... View more
Labels:
- Labels:
-
Apache Hive
12-04-2017
07:08 PM
Thanks for the reply! Unfortunately, that will not work in this case because the names are spelled differently. Example: Table 1 Name = Tyler LeMartin Table 2 Name = Tyler Le-Martin Table 3 Name = Tyler Martin I'd need to join the tables on to each based on the similiarity being >= 95%
... View more
12-04-2017
05:53 PM
Hello! I have been stuck on this issue for a few days and am at a complete standstill as to what I can do next. Here is a description of my problem. There are four tables imported into Hive with multiple data points. Unfortunately, the only join-able key is employee names. The problem with this key is that the names are spelled differently/formatted differently across data sources. I downloaded a fuzzy match UDF, but I'm having a hard time LEFT OUTER JOINING the tables onto each other using the fuzzy match UDF because the results are non-equality. I'm looking for suggestions on how to join these tables onto each other in HIVE without inner joining. I would do this on the server side sql, but these tables are from different data sources/databases. Thanks for the help!
... View more
Labels:
- Labels:
-
Apache Hive