Created on 10-04-2017 07:49 PM - edited 09-16-2022 08:47 AM
Sean,
I recall there is a local computation mode in Oryx 1.x
This is useful for us to play ALS with smaller data set and see how it performs without going into all the Spark, Kafka setup.
However; I cannot locate the releases (or git source). Where can I find it ?
Thanks.
Created 10-04-2017 08:08 PM
Oh, I forgot, we have made many obsolete repos in github.com/cloudera private. I can still see it but of course you can't.
Here's a tarball of the final release:
https://drive.google.com/open?id=0B_hfrkaWlLi4MVlxQWVJaVd0ZGs
If there's any significant demand, I could revive the repo in my personal account
Created 10-04-2017 07:55 PM
That implementation is obsolete at this point, I'd say, but sure you're welcome to go dig it out. It worked well.
The releases and source are still on the 1.x project site:
Created 10-04-2017 08:03 PM
Sean,
Thanks for the quick reply.
That links show GitHub 404.
Perhaps it's not public available any more.
Can you fix it ?
Thanks.
Created 10-04-2017 08:08 PM
Oh, I forgot, we have made many obsolete repos in github.com/cloudera private. I can still see it but of course you can't.
Here's a tarball of the final release:
https://drive.google.com/open?id=0B_hfrkaWlLi4MVlxQWVJaVd0ZGs
If there's any significant demand, I could revive the repo in my personal account
Created 10-04-2017 08:19 PM
Sean,
Got it and downloaded it.
It would be nice if you can revive the repo. I used Oryx 1.0 before and it worked very well.
If you cannot make the repo in short term, can you pass along the latest release jar files ?
Thanks.
Created 10-06-2017 12:05 AM
Are you looking for the .jar files that were produced as part of the release? those are still in the repo and will stay there indefinitely as far as I know, just because it could be part of people's builds:
https://repository.cloudera.com/artifactory/cloudera-repos/com/cloudera/oryx/
Created 10-06-2017 10:53 AM
I maven built from sources of Oryx 1 and ran well.
Good to know the link, though.
Thanks
Created 10-24-2017 04:06 PM
Sean,
Two more questions about Oryx 1:
(1)
It has "model.local-data=false"
I set it to false to use HDFS
I am thinking "model.instance-dir=..." will point to HDFS.
My question is whether it supports S3? Something like "model.instance-dir=s3://bucket/..."
(2) Is it running with Java 7 as well (in local computation mode) ?
Thanks.
Created 10-25-2017 02:52 AM
Even I have forgotten exactly how it works off the top of my head, but yes, you are correct that you should be able to use HDFS paths. Yes it runs on Java 7 -- or 8, I believe, though I don't recall if that was tested. It doesn't require Java 8.
Created 10-25-2017 10:16 AM
Sean,
Not clear what you mean "...you should be able to use HDFS paths....". Yeah, I know that for sure HDFS path. However, I am refering to S3 path, somthing like "s3://bucket/....".
Is S3 path possible ?