A data warehouse is a large scale database designed for querying massive sets of data. Typically, businesses archive data into a data warehouse from multiple databases so that they can run analytics against all of the data sets that they have. Cloud computing makes data warehousing more attainable, as it is as simple as launching a data warehouse cluster and paying for how long it is running and how much data it is storing. Amazon offers AWS Redshift as a data warehouse offering, while Microsoft offers SQL Warehouse. In order to get data into a data warehouse, typically other data sources are extracted, transformed and loaded into the warehouse, so that it can be queried by analysts. This process is known as ETL, and requires tools and developers to facilitate.
What do we mean by this?
Your pantry full of data