Run a prediction to see results here
ManuFast: Small Language Model for Predictive Maintenance AI
Paste a structured industrial sensor log and get fault detection, remaining useful life, quality anomaly flags, and a plain-English diagnostic - in a single forward pass.
~30M params
lightweight
3 machine types
turbofan · bearing · CNC
4 simultaneous outputs
multi-task learning
The Core Idea
Industrial machines: jet engines, factory bearings, CNC cutting tools - degrade gradually over time. Mechanics and operators already document this degradation by writing maintenance logs describing sensor readings, anomalies, and operating conditions. ManuFast treats those logs as natural language and runs them through a transformer to extract actionable insights.
Unlike traditional statistical models that handle one task at a time, ManuFast performs four predictions simultaneously: whether a fault is present, how many operational cycles remain before failure, whether quality is within tolerance, and a plain-English explanation of what it found. One model, one pass, four outputs.
The Three Machines
ManuFast was trained on three real industrial datasets, each representing a different machine type with its own sensor signature and failure modes.
How the Model Works
A hybrid encoder-decoder transformer trained end-to-end with a multi-task loss.
// Encoder path
// Classification heads (simultaneous)
├── Fault Head (binary classifier, 30% loss weight)
├── RUL Head (regression, 30% loss weight)
└── Quality Head (binary classifier, 20% loss weight)
// Decoder path
Multi-task learning
One forward pass produces four outputs. Shared encoder representations improve all tasks simultaneously.
Weight tying
Encoder and decoder share their embedding matrices — saves ~2.7M parameters with no accuracy cost.
Custom BPE tokenizer
7,000-token vocabulary trained specifically on industrial sensor logs. No pretrained tokenizer used.
Training Results
Evaluated on held-out test sets from each dataset.
Training setup
Precision
BF16
Epochs
20
Batch size
64
Warmup steps
1,000
GPU
L4 (24 GB)
RAM
53 GB
Tech Stack
PyTorch
Model training and inference
Custom BPE Tokenizer
7K vocab, trained from scratch
CMAPSS Dataset
NASA Jet Engine Simulation data
NASA IMS Bearing Dataset
Rolling element bearing data
AI4I 2020 Dataset
CNC machine failure data
Built from scratch - No Hugging Face models used
Custom tokenizer, custom architecture, custom training loop.
A huge shoutout to our dataset providers and all the contributors who made this project possible. Dataset links mentioned in the footer below.