Glossary

data warehouse

A centralized, structured database optimized for querying and analytics, integrating historical data from multiple operational systems.

Core meaning

A **data warehouse** is a centralized database designed and structured specifically for querying, reporting, and analytics rather than day‑to‑day transaction processing. It typically stores large volumes of historical, integrated data from multiple source systems in a consistent, well‑defined schema.

In industrial and manufacturing environments, a data warehouse commonly aggregates information from MES, ERP, LIMS, QMS, maintenance, and other OT/IT systems to support cross‑site and long‑term analysis.

Key characteristics

Common characteristics of a data warehouse include:

– **Subject oriented**: Organized around key business domains (e.g., production, quality, maintenance, inventory), not around individual applications.
– **Integrated**: Data from different source systems is reconciled, standardized (units, codes, identifiers), and stored in a common model.
– **Time variant**: Maintains historical snapshots (e.g., daily, batch, or event‑based loads) to enable trend and performance analysis.
– **Non‑volatile**: Once loaded, data is rarely updated or deleted; changes are usually captured as new records to preserve history.
– **Optimized for analytics**: Uses structures such as star or snowflake schemas, facts and dimensions, and indexing strategies that support complex queries and aggregations.

Use in manufacturing and regulated operations

In regulated and multi‑site manufacturing, a data warehouse commonly:

– Consolidates production, quality, supply chain, and maintenance data from multiple plants or suppliers.
– Provides a stable source for **operations intelligence**, KPI dashboards, and management reporting.
– Supports traceability and investigation workflows by combining batch, lot, equipment, and material data from different systems.
– Enables cross‑system analysis (e.g., comparing OEE, yield, or deviation patterns across lines, plants, or contract manufacturers).

The warehouse often sits between operational systems (MES, ERP, historians) and reporting/analytics tools, acting as an intermediate data layer with controlled data models and governance.

Boundaries and what it is not

– **Not an operational database**: It does not typically run production transactions (e.g., MES work execution, ERP order posting). Latency is usually minutes to hours, not real time.
– **Not a data lake**: A data warehouse stores structured, modeled data. A data lake can store raw, semi‑structured, or unstructured data with less predefined structure.
– **Not a single tool or vendor product**: The term describes an architectural role. It may be implemented using different database platforms, cloud services, or appliances.

Common architectures and components

A data warehouse implementation typically includes:

– **Source systems**: MES, ERP, historians, LIMS, QMS, CMMS, supplier portals, and other OT/IT systems.
– **Data integration/ETL or ELT**: Processes that extract data from sources, transform and standardize it (e.g., units of measure, identifiers), and load it into the warehouse.
– **Core warehouse schema**: Fact tables (e.g., production, quality results, maintenance events) and dimension tables (e.g., product, equipment, site, supplier, time).
– **Semantic layer or data marts**: Curated subsets of the warehouse for specific domains such as production performance, quality, or supply chain.
– **Access layer**: BI tools, dashboards, reporting systems, and analytics platforms that query the warehouse.

Relation to MES dashboards and cross‑site views (site context)

When MES dashboards combine data from multiple plants or suppliers, an intermediate data warehouse (or similar analytical store) is often used to:

– Normalize data models across different MES/ERP instances and plants.
– Provide a single, governed source for enterprise‑wide KPIs and cross‑site comparisons.
– Manage versioning and history of production and quality data for long‑term analysis.

In brownfield environments with heterogeneous systems, the data warehouse commonly acts as the consolidation layer where data alignment, validation, and historical structuring are performed before dashboards access it.

Common confusions

– **Data warehouse vs. data mart**: A data mart is usually a smaller, subject‑specific subset or view of the warehouse (e.g., a quality data mart). The warehouse is the broader, enterprise‑level store.
– **Data warehouse vs. data lakehouse / analytics platform**: Some modern platforms combine warehousing and data lake capabilities. The term “data warehouse” still refers to the structured, analytical store component within such platforms.
– **Data warehouse vs. historian**: A process historian stores high‑frequency time‑series data from equipment and sensors. A data warehouse may ingest summarized or selected historian data but is not optimized as a raw time‑series store.

Related FAQ

Let's talk

Ready to See How C-981 Can Accelerate Your Factory’s Digital Transformation?