FAQ

What data quality rules should I enforce before calculating KPIs?

Enforce data quality rules at two levels: first on the source data itself, and then on the KPI definition logic. If either layer is weak, the KPI can be numerically correct and still operationally misleading.

The minimum rule set usually includes:

  • Definition control: Every KPI needs an approved definition for numerator, denominator, exclusions, time basis, aggregation method, and source systems. If plants or functions use different interpretations, do not combine the results.
  • Timestamp integrity: Check that event times are present, in the correct timezone, sequenced logically, and tied to the right production day or shift boundary. A large share of KPI distortion comes from bad time handling rather than bad arithmetic.
  • Completeness thresholds: Set explicit minimum completeness rules for required fields. For example, reject or flag records missing work order, part, operation, equipment, quantity, disposition, or start/stop times when those fields are required for the KPI.
  • Duplicate and replay detection: Prevent double-counting from interface retries, manual re-entry, batch reloads, or overlapping integrations. Brownfield MES, ERP, historian, and spreadsheet workflows commonly create this problem.
  • Unit and measure consistency: Validate units of measure, quantity precision, currency basis, and conversion logic before aggregation. Mixed units can quietly corrupt yield, throughput, scrap, or cost KPIs.
  • Master data alignment: Enforce valid references for part, revision, routing step, asset, work center, supplier, and reason code. If master data is stale or inconsistent across systems, the KPI may be directionally wrong even when all transactions are present.
  • Status and lifecycle rules: Only include records in valid business states. For example, decide whether cancelled orders, quarantined material, rework loops, partially closed operations, and late backflushes are included or excluded, and apply that rule consistently.
  • Range and plausibility checks: Reject or quarantine impossible values such as negative runtime, scrap greater than input, cycle time outside physical limits, or completion before release.
  • Exception handling: Define how to treat missing values, late-arriving records, manual overrides, estimated values, and sensor dropouts. Silent defaulting to zero is often the wrong choice.
  • Lineage and traceability: Preserve the source, transformation path, version of the KPI logic, and any manual adjustments. In regulated environments, this matters for internal review, investigations, and change control, even when the KPI is not itself a formal quality record.

A practical rule is this: if you cannot explain why a record was included, excluded, corrected, or rolled up, you are not ready to use the KPI for management decisions.

What should be blocked versus flagged

Not every data issue should stop KPI calculation. Some should block publication, and some should publish with a visible warning. That threshold depends on process criticality, reporting purpose, and data maturity.

  • Block publication when definition conflicts exist, timestamps are unreliable, duplicates are unresolved, or required joins to master data fail at material volume.
  • Publish with warning when late data is below a defined threshold, a small number of records are quarantined, or a noncritical enrichment field is missing.
  • Allow provisional values only if they are clearly marked as preliminary and recalculation rules are documented.

If the KPI is used for cross-plant benchmarking, capacity commitments, supplier escalation, or quality management review, the tolerance for provisional or partially qualified data should be much lower.

Rules that are often missed

  • Shift calendar governance: OEE, downtime, and labor metrics break when shift calendars, holiday rules, or planned downtime definitions differ by site.
  • Reason code quality: Downtime, scrap, and rework KPIs are only as good as the classification discipline behind them. Free-text categories usually degrade comparability.
  • Revision effectivity: Part revision, routing version, and work instruction version can change denominator assumptions. Trend lines across revisions may not be comparable without segmentation.
  • Rework and nonconformance treatment: Decide whether rework counts as additional throughput, lost capacity, quality loss, or all three in different KPIs. This is a business rule, not a math detail.
  • Manual data provenance: If operators, supervisors, or planners can edit source records, capture who changed what, when, and why.

Brownfield reality

In mixed environments, you usually cannot enforce all rules inside one platform. Data may originate in ERP, MES, QMS, historian, CMMS, spreadsheets, or supplier portals, each with different timing and semantics. In that case, define a canonical KPI layer with explicit validation rules at system boundaries.

Do not assume a full system replacement is the best fix. In regulated, long-lifecycle operations, replacement programs often fail or stall because of qualification burden, validation cost, downtime risk, integration complexity, and the need to preserve traceability across legacy processes. In many plants, a governed coexistence model is more realistic: validate source data where it originates, reconcile key entities across systems, and version-control KPI logic centrally.

Minimum release criteria before a KPI is trusted

  1. The KPI definition is approved and version controlled.
  2. Required source fields meet documented completeness thresholds.
  3. Duplicate handling and late-arriving data rules are tested.
  4. Master data mappings are reconciled across contributing systems.
  5. Exception rates are measured and visible.
  6. Recalculation and restatement rules are documented.
  7. An owner is assigned for both the KPI definition and the data quality controls.

If those controls are not in place, you can still calculate the KPI, but you should treat it as indicative, not authoritative.

Get Started

Built for Speed, Trusted by Experts

Whether you're managing 1 site or 100, Connect 981 adapts to your environment and scales with your needs—without the complexity of traditional systems.

Get Started

Built for Speed, Trusted by Experts

Whether you're managing 1 site or 100, C-981 adapts to your environment and scales with your needs—without the complexity of traditional systems.