Community Articles

Find and share helpful community-sourced technical articles.
Announcements
Celebrating as our community reaches 100,000 members! Thank you!
avatar
Expert Contributor

An article on the challenges and solutions to predicting machine failures in the field.

The full details can be found here: https://github.com/kirkhas/zeppelin-notebooks/tree/master/Preventive_maintenance

Step #1 Feature Selection

64940-corrimg.png

Step #2 Geolocation

64941-map.png

Step #3 - Scythe is a time-series library authored by Kirk Haslbeck for these purposes

- Needed to Resample the data into trips or route segments (Scythe Resample)

- Needed to Step Interpolate the miles since last service to be 4K, 5K and less continuous regression

64942-time-series.png

Step #4

- Indexing and OneHotEncoding to the Rescue. Found a relationship of a particular "Make" that was more problematic than most.

64943-categorical.png

Roc Curve - A near perfect model

64944-roc.png

2,564 Views
Comments
avatar
Contributor

Nice work!

avatar
Cloudera Employee

Awesome work Kirk!

avatar
Master Guru

This is awesome.