Making Sense of Schema-on-Read: Modeling JSON
Making Sense of Schema-on-Read: Modeling JSON
Schema-on-Read
Modeling JSON
KentGraziano
About me
• Chief Technical Evangelist, Snowflake Computing
• Oracle ACE Director, Alumni (DW/BI)
• OakTable Network
• Blogger – The Data Warrior
• Certified Data Vault Master and DV 2.0 Practitioner
• Former Member: Boulder BI Brain Trust (#BBBT)
• Member: DAMA Houston & DAMA International
• Data Architecture and Data Warehouse Specialist
• 30+ years in IT
• 25+ years of Oracle-related work
• 20+ years of data warehousing experience
• Author & Co-Author of a bunch of books (Amazon)
• Past-President of ODTUG and Rocky Mountain Oracle
User Group
Founded 2012 by
industry veterans First customers 2014,
with over 120 general availability
database patents 2015
• Schema-on-Read vs Schema-on-Write
• Why we still need data modeling
• What is JSON?
• Example JSON #1
• Simple 3NF model
• Simple Data Vault model
• Example JSON #2
• 3NF model
• Data Vault model
• Schema-on-Read
• Popularized in document stores and NoSQL dbs
• No upfront modeling
• No predefined structure
• Called semi-structured or flexible-structure data
• Can change contents and structure over time
• Load & Go
• Agile!
• Schema-on-Write
• What we do in RDBMS today
• Requires knowing the structure in advance
• Upfront modeling & table design required
• Must map source data to the database tables
• ETL/ELT may break if the source data changes
• We all do!
• To take advantage of all this data, we have to use it
• Schema-on-Read
• There is a SCHEMA – which means a model!
• To query the data requires knowing the structure
• Which means the MODEL of the data or “document”
• Few reporting or BI tools can infer the schema
• So we have to transform it, somehow
• Load to tables and columns?
• Expose with a SQL view?
© 2018 Snowflake Computing Inc. All Rights Reserved
What is JSON?
"phoneNumber":
{
"areaCode": "415",
"subscriberNumber": "5551234”
},
"cityName": "London”,
Available on
Amazon.com
https://www.amazon.com/
Better-Data-Modeling-
Enhancing-Developer-
ebook/dp/B00UK75LYI/
SHAMELESS PLUG:
Available on
Amazon.com
http://www.amazon.com
/Better-Data-Modeling-
Introduction-
Engineering-
ebook/dp/B018BREV1C/
Discover the performance, concurrency, and simplicity of
Snowflake
As easy as 1-2-3!
Sign up and receive
01 Visit Snowflake.com $400 worth of free
usage for 30 days!
02 Click “Try for Free”
More info at
http://snowflake.com
Visit my blog at
http://kentgraziano.com
THANK YOU