Member since
02-11-2019
3
Posts
1
Kudos Received
1
Solution
My Accepted Solutions
Title | Views | Posted |
---|---|---|
732 | 04-02-2018 05:19 PM |
04-02-2018
05:25 PM
Try creating as an external table first, and then create an ORC table. Then read from the external table into the ORC table. Docs here: https://docs.hortonworks.com/HDPDocuments/HDP2/HDP-2.6.4/bk_data-access/content/moving_data_from_hdfs_to_hive_external_table_method.html
... View more
04-02-2018
05:19 PM
1 Kudo
Do you have a target variable that you can predict? Or do you have logic that will allow you to convert a "low" CPU value into a target variable? Spark has a wide variety of models that are available for classification modeling: https://spark.apache.org/docs/latest/mllib-classification-regression.html If you are interested in seeing which factor is contributing to a specific instance, I would recommend starting with a logistic regression model as that will provide more explanatory power -- providing more insight into which factor is contributing to a particular CPU failure
... View more
04-02-2018
05:11 PM
Are you trying to create an ORC table in the create table interface? When trying to create an ORC formatted table, a temporary external table has to be created first and then the ORC table can be created from that external table
... View more