About sakitha

sakitha · ‎06-16-2021

[mirrormaker-thread-0] Mirror maker thread failure due to java.lang.RuntimeException: { is an invalid regex. at kafka.utils.TopicFilter.<init>(TopicFilter.scala:38) at kafka.utils.Whitelist.<init>(TopicFilter.scala:46) at kafka.tools.MirrorMaker$ConsumerWrapper$$anonfun$init$2.apply(MirrorMaker.scala:316) at kafka.tools.MirrorMaker$ConsumerWrapper$$anonfun$init$2.apply(MirrorMaker.scala:314) at scala.Option.foreach(Option.scala:257) at kafka.tools.MirrorMaker$ConsumerWrapper.init(MirrorMaker.scala:314) at kafka.tools.MirrorMaker$MirrorMakerThread.run(MirrorMaker.scala:215) please help!

sakitha · ‎05-08-2021

I am building a log analysis planform to monitor spark jobs on a yarn cluster and I want to get a clear idea about spark/yarn logging. I have searched a lot about this and these are the confusions I have. The directory specified in spark.eventLog.dir or spark.history.fs.logDirectory get stored all the application master logs and through log4j.properties in spark conf we can customize those logs ? In default all data nodes output their executor logs to a folder in /var/log/. with log-aggregation enabled you can get those executer logs to the spark.eventLog.dir location as well? I've managed to set up a 3 node virtual hadoop yarn cluster, spark installed in the master node. When I'm running spark in client mode I'm thinking this node becomes the application master node. I'm a beginner to Big data and appreciate any effort to help me out with these confusions.

Online	Offline
Last Visited	‎10-20-2021 09:52 AM

Member Since	‎05-08-2021 11:24 PM
Last Visited	‎10-20-2021 09:52 AM
Posts	2

Cloudera Community

MirrorMaker wont start due to java.lang.RuntimeExc...

Which directory spark applications on yarn output ...