Member since
04-27-2016
60
Posts
20
Kudos Received
0
Solutions
11-30-2016
11:05 AM
Hi all, Is possible to create an workflow on Oozie that automatically execute some Hive, Pig and Spark scripts in order to automate my analytics process? Many thanks!
... View more
Labels:
- Labels:
-
Apache Oozie
-
Apache Spark
09-29-2016
06:38 PM
Hi experts,
How can I overwrite an existing file by a new one (data update). Imagine that I've this:
result.map(pair => pair.swap).sortByKey(true).saveAsTextFile("FILE/results")
And Imagine that I want to do this: test.map(pair => pair.swap).sortByKey(false).saveAsTextFile("FILE/results")
How can I overwrite the results of the var result to the results of the val test in same directory?
... View more
Labels:
- Labels:
-
Apache Spark
09-17-2016
11:22 PM
gkeys, many thanks! This was a fantastic answer and cover all of my doubts! 😄 😄
... View more
09-17-2016
08:15 PM
2 Kudos
What is the bigger advantage of using Hadoop instead SQL Server or ODI when we aren't in a Big Data Scenario?
Many thanks!
... View more
Labels:
- Labels:
-
Apache Hadoop
09-04-2016
02:53 PM
Hi experts,
I've this statment in Apache PIG:
...
Count = FOREACH data GENERATE SUM(Field);
...
How can do a IF Statement like this:
IF(SUM(Field) > 10) Store into X;
ELSE
STORE into Y;
Is possible to do this?
Many thanks!
... View more
Labels:
- Labels:
-
Apache Hadoop
-
Apache Pig
08-08-2016
04:09 PM
Hi mqureshi, many thanks for your help 🙂 I will look for good articles/tutorials that show me how to use complex Types in Hive.
Thanks!
... View more
08-08-2016
03:37 PM
Hi,
I have four tables in .csv. All of them can be conected through a fact table (that are in .csv too). I wanna to do some data cleansing to this files and next put them into a Big Table in Have. But in Apache PIG should I've to create a script by table individually, or is better to join in PIG and then aplly some data cleansing in this normalized table?
Thanks!
... View more
Labels:
- Labels:
-
Apache Pig
08-04-2016
10:46 AM
1 Kudo
I was missing some Jar files 🙂
... View more
08-03-2016
08:07 AM
If I use Python inside a file.py in my HDFS I can run Pytho UDFs but with Java I'm getting error... I think I'm not getting all the files
... View more
08-03-2016
08:06 AM
Perfect Lester 🙂 It's exactly what I need!!! 🙂 Many thanks!!!
... View more