Support Questions
Find answers, ask questions, and share your expertise
Announcements
Alert: Welcome to the Unified Cloudera Community. Former HCC members be sure to read and learn how to activate your account here.

How do I apply where condition on 2 datasets

How do I apply where condition on 2 datasets

New Contributor

I want to implement this query using spark dataset.

select a.*, b.* from a, b where a.col1 = b.col1 or a.col1 = b.col2

Created datasets namely "a" and "b". How to apply condition?

Progress till now:

Considered using join with or. However, job went to hung state. Similar issue mentioned here: https://community.hortonworks.com/questions/85718/dataframe-join-with-or-condition.html

Don't have an account?
Coming from Hortonworks? Activate your account here