Dorieh Data Platform
Contents
Introduction
What is Data Platform
Data Domains
Data Processing Pipelines
Python Packages
Data Modelling for Dorieh Data Platform
Extensions used for creating federated view of different years
The Data Dictionary Generation tool
Medicare: Building a Data Warehouse from ResDac Files
Data dictionary and lineage for Medicare processing
Table Lineage Diagram
Table Lineage SVG Diagram
Table cms.mbsf_ab_2011
Table cms.mbsf_ab_2012
Table cms.mbsf_ab_2013
Table cms.mbsf_ab_2014
Table cms.mbsf_abcd_2015
Table cms.mbsf_abcd_2016
Table cms.mbsf_abcd_2017
Table cms.mbsf_abcd_2018
Table cms.mbsf_d_2011
Table cms.mbsf_d_2012
Table cms.mbsf_d_2013
Table cms.mbsf_d_2014
Table cms.medpar_2011
Table cms.medpar_2012
Table cms.medpar_2013
Table cms.medpar_2014
Table cms.medpar_2015
Table cms.medpar_2016
Table cms.medpar_2017
Table cms.medpar_2018
View medicare._beneficiaries
View medicare._enrollments
Materialized view medicare._ps
Table medicare.admissions
Table medicare.beneficiaries
Table medicare.enrollments
View medicare.ip
Materialized view medicare.mbsf_d
View medicare.ps
View medicare.qc_adm_union
Materialized view medicare.qc_admissions
View medicare.qc_enrl_bene
Materialized view medicare.qc_enrollments
Alphabetic list of all tables
Alphabetic list of all columns in all tables
The Data Dictionary Generation tool
Medicaid: Building a Data Warehouse from ResDac Files
Introduction to data modelling for Dorieh Data Platform
Domain
Table
Column
Multi-column indices
Generation of the database schema (DDL)
Indexing policies
Linking with nomenclature
Ingesting data
Examples
Data Platform Internals
Database Testing Framework
Adding more data
Executing containerized apps
Terms and Acronyms
Indices
Dorieh Data Platform
Data Domains
Dorieh CMS Package (manipulating ResDac Health Data)
Medicare: Building a Data Warehouse from ResDac Files
Data dictionary and lineage for Medicare processing
Table Lineage Diagram
Materialized view medicare.qc_enrollments
Column medicare.qc_enrollments.zip
Column medicare.qc_enrollments.zip Lineage SVG
View page source
Column medicare.qc_enrollments.zip Lineage SVG
medicare.qc_enrollments.zip
medicare.qc_enrollments.zip
medicare.qc_enrl_bene.zip
medicare.qc_enrl_bene.zip
MAX(zip)
medicare.qc_enrl_bene.zip->medicare.qc_enrollments.zip
Transformed
medicare.enrollments.zip
medicare.enrollments.zip
MAX(zip)
medicare.enrollments.zip->medicare.qc_enrl_bene.zip
join natural
medicare._enrollments.zip
medicare._enrollments.zip
MAX(zip)
medicare._enrollments.zip->medicare.enrollments.zip
Copied
medicare._ps.zip
medicare._ps.zip
US Postal ZIP code, the first 5 digits
of it
numeric →
(CASE WHEN {column_name} < 100000 THEN {column_name}::int ELSE ({column_name}/10000)::INT END)
character varying →
SUBSTRING({column_name} FROM 1 FOR 5)::INT
integer →
{column_name}::INT
medicare._ps.zip->medicare._enrollments.zip
Aggregated On bene_id, year, state
medicare.ps.zip
medicare.ps.zip
US Postal ZIP code, the first 5 digits
of it
numeric →
(CASE WHEN {column_name} < 100000 THEN {column_name}::int ELSE ({column_name}/10000)::INT END)
character varying →
SUBSTRING({column_name} FROM 1 FOR 5)::INT
integer →
{column_name}::INT
medicare.ps.zip->medicare._ps.zip
Copied
medicare.ps.zip_parent
8 incoming links (columns)
medicare.ps.zip_parent->medicare.ps.zip