Skip to main content

🗓️ 27052025 1356
📎

data_warehouse_naming_conventions

Common Layer Prefixes

PrefixMeaningUsage
ods_Operational Data StoreRaw, unprocessed data ingested from sources (e.g., logs, Kafka, RDS dumps). Often partitioned by day/hour.
dwd_Data Warehouse DetailCleansed and detailed data, usually 1:1 with business events/entities. Normalized and enriched.
dws_Data Warehouse SummaryAggregated data for specific business domains. Used for reporting, metrics, dashboards.
ads_Application Data StoreFinal layer used directly by applications or APIs (reports, dashboards, downstream services).
dim_Dimension TableLookup tables with descriptive attributes (e.g., dim_user, dim_region).
tmp_Temporary TableIntermediate tables used in ETL processing. Not meant for long-term use.
rpt_Report TableTables generated specifically for business reports, often updated on schedule.

Suffixes / Contextual Tags

Suffix / TagMeaningNotes
_diDaily IncrementalUpdated once per day, may include only changes (upserts/inserts).
_fiFull IncrementalFull daily snapshot, usually for slowly changing dimensions or syncs.
_riReal-Time IncrementalUpdated in near-real-time, often with Flink/Kafka pipelines.
_hiHistorical IncrementalStores historical data with full SCD (slowly changing dimension) support. Usually partitions over long time spans.
_fullFull SnapshotTypically refers to a full data snapshot without incremental logic.
_bak / _backupBackup TableUsed for temporary backups, often before destructive operations.

References

  • ChatGPT