About balakumar_b05

bhagan · ‎05-26-2017

I think if one of the columns in the dataframe is the key of the HBase table, the lookup will be very efficient; that is, if there is a bottleneck, I don't believe the bottleneck will be the lookup.

bwalter1 · ‎02-23-2017

You could also do in the Spark code: import org.apache.log4j.{Level, Logger} def main(args: Array[String]) = { Logger.getRootLogger.setLevel(Level.ERROR) var conf = new SparkConf().setAppName("KafkaToHdfs") val sc = new SparkContext(conf)

Online	Offline
Last Visited	‎05-12-2017 06:27 AM

Member Since	‎02-18-2017 10:55 AM
Last Visited	‎05-12-2017 06:27 AM
Posts	10

Cloudera Community

Re: How to perform lookup operation in spark dataf...

Re: How to override default log4j properties in ya...