What is Data Warehouse
Structured storage for analytics
What is Data Warehouse
Data Warehouse is a centralized structured storage for business analytics that consolidates data from various sources into a unified model.
Data Warehouse Architecture
| Layer | Description | |-------|-------------| | Staging Area | Intermediate data loading zone | | ODS | Operational Data Store | | Data Warehouse | Main storage (facts + dimensions) | | Data Marts | Department-specific views (sales, marketing) |
Modeling Schemas
- Star Schema — central fact table + dimensions
- Snowflake Schema — normalized dimensions
- Galaxy Schema — multiple fact tables
ETL Process
| Stage | Description | |-------|-------------| | Extract | Extraction from sources | | Transform | Cleaning, transformation, aggregation | | Load | Loading into warehouse |
Popular Solutions
| Solution | Type | |----------|------| | Snowflake | Cloud-native | | Amazon Redshift | AWS | | Google BigQuery | GCP | | Azure Synapse | Microsoft | | Teradata | Enterprise on-premise | | Vertica | Columnar analytics |
Benefits
- Single source of truth
- Historical data (SCD)
- Analytics optimization
- Reporting consistency
- OLTP/OLAP workload separation