Introduction to Snowflake
Snowflake, a data warehouse, runs entirely on cloud infrastructure.
It's faster, more user-friendly, and much more flexible than traditional data warehouses.
Snowflake uses a new SQL database engine that has a unique architecture for cloud computing.
Only available in AWS and Azure clouds
Snowflake is a hybrid architecture that can be shared and shared without sharing. It has three layers.
Storage Layer
Compute Layer/Query Processing layer
Cloud Services Layer
Storage Layer
Snowflake organizes data into micro-partitions that are internal optimized and compressed.
Snowflake is provisioned in the Cloud. Storage is elastic and charged according to the usage per month.
Compute Layer
Snowflake uses the "Virtual Warehouse", to run queries.
Virtual Warehouses are clusters of MPP compute nodes that have CPU and memory provisioned in the cloud.
Multiple Virtual Warehouses have been created to manage the workload.
Cloud Services Layer:
Cloud service layer handles all the authentication, security, load management, and query optimization activities.
What are we learning from each other?
Snowflake Cloud Data Warehouse offers many interesting features such as Time Travel, Copy Clone, Secure Data Sharing, and more