Community Articles
Find and share helpful community-sourced technical articles
Announcements
Alert: Welcome to the Unified Cloudera Community. Former HCC members be sure to read and learn how to activate your account here.
Rising Star

An article on the challenges and solutions to predicting machine failures in the field.

The full details can be found here: https://github.com/kirkhas/zeppelin-notebooks/tree/master/Preventive_maintenance

Step #1 Feature Selection

64940-corrimg.png

Step #2 Geolocation

64941-map.png

Step #3 - Scythe is a time-series library authored by Kirk Haslbeck for these purposes

- Needed to Resample the data into trips or route segments (Scythe Resample)

- Needed to Step Interpolate the miles since last service to be 4K, 5K and less continuous regression

64942-time-series.png

Step #4

- Indexing and OneHotEncoding to the Rescue. Found a relationship of a particular "Make" that was more problematic than most.

64943-categorical.png

Roc Curve - A near perfect model

64944-roc.png

631 Views
Comments
New Contributor

Nice work!

New Contributor

Awesome work Kirk!

Super Guru

This is awesome.

Don't have an account?
Coming from Hortonworks? Activate your account here
Version history
Revision #:
2 of 2
Last update:
‎08-17-2019 08:01 AM
Updated by:
 
Contributors
Top Kudoed Authors