About JasonChen

JasonChen · ‎09-03-2015

Another question about Oryx 2. The CSV training data is with Unix timestamp. (1) What's it for ? (2) Does it matter in the unit of seconds or milliseconds ? Thanks. Jason

JasonChen · ‎08-25-2015

Quick check... Does it imply Oryx 2 serving layer can read model from HDFS directly (if the model is big)? Thanks. Jason

JasonChen · ‎08-23-2015

Sean, Several questions about Oryx 2: (1) I know Oryx 2 uses kafka for data pipeline. Does Oryx2 also use Spark Streaming ? (2) Regarding the update and input topics saved with kafka... If the model is big (say, ~50 GB), it occupies kafka mem (and disk) usage.. right ? Is there a way that serving layer getting model from HDFS directly, while speed layer still able to approximate the predictions based on real-time events ? (3) Is the model saved in kafka distributed across the cluster nodes ? Thanks. Jason

JasonChen · ‎08-22-2015

Sean, I tried to run Oryx in a node that in the same LAN as the Hadoop cluster. We tested Oryx 1 fine without problems (we used to have firewall issue. After moving node to the same LAN as Hadoop cluster, it runs fine).... We just start to test Oryx 2, using the same network (that's, no firewall issues). I do have the /etc/hafoop/config in the node I am running Oryx 2. However; I got the following errors when starting Oryx 2 batch layer.. It looks it's looking for cloudera CDH jar files... Any thought? I need to copy the jar files over ? errors: ls: cannot access /opt/cloudera/parcels/CDH/jars/zookeeper-*.jar: No such file or directory ls: cannot access /opt/cloudera/parcels/CDH/jars/spark-assembly-*.jar: No such file or directory Thanks. Jason

Online	Offline
Last Visited	‎03-13-2016 10:21 AM

Member Since	‎08-19-2015 10:25 AM
Last Visited	‎03-13-2016 10:21 AM
Posts	13

Cloudera Community

Re: Overall questions about Oryx 2

Re: Overall questions about Oryx 2

Overall questions about Oryx 2

Re: Run Oryx on a machine that is not part of the ...