Member since
08-19-2015
13
Posts
0
Kudos Received
0
Solutions
09-03-2015
03:18 PM
Another question about Oryx 2. The CSV training data is with Unix timestamp. (1) What's it for ? (2) Does it matter in the unit of seconds or milliseconds ? Thanks. Jason
... View more
08-25-2015
07:08 PM
Quick check... Does it imply Oryx 2 serving layer can read model from HDFS directly (if the model is big)? Thanks. Jason
... View more
08-23-2015
11:14 AM
Sean, Several questions about Oryx 2: (1) I know Oryx 2 uses kafka for data pipeline. Does Oryx2 also use Spark Streaming ? (2) Regarding the update and input topics saved with kafka... If the model is big (say, ~50 GB), it occupies kafka mem (and disk) usage.. right ? Is there a way that serving layer getting model from HDFS directly, while speed layer still able to approximate the predictions based on real-time events ? (3) Is the model saved in kafka distributed across the cluster nodes ? Thanks. Jason
... View more
Labels:
- Labels:
-
Apache Kafka
-
Apache Spark
-
HDFS
08-22-2015
07:19 PM
Sean, I tried to run Oryx in a node that in the same LAN as the Hadoop cluster. We tested Oryx 1 fine without problems (we used to have firewall issue. After moving node to the same LAN as Hadoop cluster, it runs fine).... We just start to test Oryx 2, using the same network (that's, no firewall issues). I do have the /etc/hafoop/config in the node I am running Oryx 2. However; I got the following errors when starting Oryx 2 batch layer.. It looks it's looking for cloudera CDH jar files... Any thought? I need to copy the jar files over ? errors: ls: cannot access /opt/cloudera/parcels/CDH/jars/zookeeper-*.jar: No such file or directory ls: cannot access /opt/cloudera/parcels/CDH/jars/spark-assembly-*.jar: No such file or directory Thanks. Jason
... View more