Predictive Maintenance ML: Low Performance Despite Preprocessing — Suspecting Structural Issues (Time Leakage, Feature Engineering, Imbalance Handling) #191278
Replies: 2 comments
-
|
Hi @Hari-2782, The Fix:
Fix the time-split first, and you’ll see the real performance of your model |
Beta Was this translation helpful? Give feedback.
-
|
Yes , the main issue likely comes from treating this as a standard tabular classification problem rather than a time-dependent predictive maintenance problem. A few key points to consider:
✅ Takeaway: Predictive maintenance is temporal by nature. Treat your dataset as a time series rather than tabular, use a time-based split, engineer temporal features, and be careful with imbalance techniques to avoid leakage. Mark this as answered please :D |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
🏷️ Discussion Type
Question
Body
Harisanth_vimalarjah_Technical Challenge 2.ipynb
predictive_maintenance_dirty_dataset_3000-45-15-03-2026-2202.csv
predictive_maintenance_improved_dataset_3500-1.xlsx
I am working on a predictive maintenance task using sensor data to predict binary failure (0/1). I have applied preprocessing (outlier handling, encoding, imputation), handled class imbalance using SMOTETomek and class weighting, and trained a tree-based model (XGBoost). I also engineered basic time features (hour, day, month), but used a random train-test split.
Despite this, my model struggles to reliably detect failures.
Could the main issue be that I am incorrectly treating this as a standard tabular classification problem instead of a time-dependent problem? Specifically:
I would appreciate guidance on whether my pipeline has structural flaws and how to properly approach failure prediction in this type of sensor dataset.
Guidelines
Beta Was this translation helpful? Give feedback.
All reactions