About wbekker

wbekker · ‎03-29-2017

Yes will do

wbekker · ‎03-29-2017

I'm using the email alert processor as per demo instructions.

wbekker · ‎03-29-2017

Hi, I've created the demo topology from the SAM docs. When deploying, it fails instantly with the following message: An exception with message [/tmp/b7ea53f1-77ca-495f-b9f7-ed28f3097ac1.jar (No such file or directory)] was thrown while processing request.

wbekker · ‎03-22-2017

Great thx!

wbekker · ‎03-22-2017

Hi, If you remove this component from the '/usr/hdp/current/oozie-server/oozie.war', you should be able to start the service: zip -d oozie.war ext-2.2/* Regards, ward

wbekker · ‎03-19-2017

Thx @yvora !

wbekker · ‎03-18-2017

When running spark code in Zeppelin via Livy interpreter, I only see a few containers allocated in yarn. What settings do I need to change to make sure I leverage full cluster capacity? I'm using a cluster created by Hortonworks Datacloud on AWS

wbekker · ‎03-17-2017

solved by not having to many partitions for parallelize

wbekker · ‎03-16-2017

The exception seems to happen when nFiles is larger, like 1000, not when it's 10. spark-submit --master yarn-cluster --class com.cisco.dfsio.test.Runner hdfs:///user/$USER/mantl-apps/benchmarking-apps/spark-test-dfsio-with-dependencies.jar --file data/testdfsio-write --nFiles 1000 --fSize 200000 -m write --log data/testdfsio-write/testHdfsIO-WRITE.log btw: not my code.

wbekker · ‎03-16-2017

When running this small piece of Scala code I get a "org.apache.hadoop.mapred.FileAlreadyExistsException: Output directory hdfs://xxx.eu-west-1.compute.internal:8020/user/cloudbreak/data/testdfsio-write". Below the piece of code where the `saveAsTextFile` is executed. The directory does not exist before running this script. Why is this FileAlreadyExistsException being raised? // Create a Range and parallelize it, on nFiles partitions // The idea is to have a small RDD partitioned on a given number of workers // then each worker will generate data to write val a = sc.parallelize(1 until config.nFiles + 1, config.nFiles) val b = a.map(i => { // generate an array of Byte (8 bit), with dimension fSize // fill it up with "0" chars, and make it a string for it to be saved as text // TODO: this approach can still cause memory problems in the executor if the array is too big. val x = Array.ofDim[Byte](fSizeBV.value).map(x => "0").mkString("") x }) // Force computation on the RDD sc.runJob(b, (iter: Iterator[_]) => {}) // Write output file val (junk, timeW) = profile { b.saveAsTextFile(config.file) }

Online	Offline
Last Visited	‎08-21-2018 01:16 PM

Member Since	‎09-06-2016 09:38 AM
Last Visited	‎08-21-2018 01:16 PM
Posts	108
Kudos received	36

Cloudera Community

Re: data disk unmounted

Re: I am new to oozie, please suggest me process ...

Re: Apache Nifi - JSON to XML or XML Translation

Re: How can I Revert to previous code version on Z...

Re: Hive - Create external table on database (MySQ...

Re: Deploying the trucking demo topology results f...

Re: Deploying the trucking demo topology results f...

Deploying the trucking demo topology results fails...

Re: Oozie restart fails after upgrading ambari to...

Re: Oozie restart fails after upgrading ambari to...

Re: How to increase amount of containers (and exec...

How to increase amount of containers (and executor...

Re: FileAlreadyExistsException when calling saveAs...

Re: FileAlreadyExistsException when calling saveAs...

FileAlreadyExistsException when calling saveAsText...