Member since
10-06-2016
40
Posts
1
Kudos Received
0
Solutions
03-03-2017
04:22 PM
hi @Maher Hattabi I am seeing a similar question of yours in the link below. Here is one where i answered the question combining any files whether it be csv or txt https://community.hortonworks.com/questions/85230/erge-csv-files-in-one-file.html#answer-85245
... View more
03-02-2017
04:06 PM
A quick hack would be to use scala "substring" http://alvinalexander.com/scala/scala-string-examples-collection-cheat-sheet So what you can do is write a UDF and run the "new_time" column through it and grab upto time stamp you want. For example if you want just "yyyy-MM-dd HH:MM" as seen when you run the "df.show", your sub string code will be new_time.substring(0,15) which will yield "2015-12-06 12:40" pseudo code def getDateTimeSplit = udf((new_time:String) => {
val s = new_time.substring(0,15)
return s
})
... View more
11-08-2018
09:05 AM
You can do that by passing a list of csv files in csv df = sqlContext.read.load("com.databricks.spark.csv").option("header","true").option("inferSchema","true").load(["/tmp/test_1.csv","/tmp/test_2.csv","/tmp/test_3.csv"])
... View more
02-15-2017
11:44 PM
@Maher Hattabi, If it works when connected to your local sandbox, but fails when you try to go to io-dell-svr8:50070, you need to see if your computer can correctly resolve the name io-dell-svr8 and make sure there is no firewall blocking the connection to 50070 on that host. From a cmd prompt, you should be able to 'nslookup io-dell-svr8' to see if the name resolves. Telnet is not installed by default on windows 10, but if you have putty, you can try to connect to port 50070 and see if it connects. If it times out, you have firewalls between you that you will have to have opened. I hope this helps! John
... View more
02-14-2017
09:14 AM
Hi ,same problem here i want to make these inerface hidden and running in background thanks
... View more