Dorieh Data Platform
Contents
Introduction
What is Data Platform
Data Domains
Health
Handling Medicaid data
Medicare Files Handling
Data dictionary and lineage for Medicare processing
Table Lineage Diagram
Alphabetic list of all tables
Alphabetic list of all columns in all tables
The Data Dictionary Generation tool
Querying Medicaid Data
Overview of health data (Medicare and Medicaid)
Project Structure
Documentation Indices
Climate
Exposure (from Atmospheric Composition Analysis Group of Washington University in St. Louis)
Environmental Protection Agency (EPA) data
Demographics
Data Processing Pipelines
Python Packages
Data Modelling for Dorieh Data Platform
Examples
Data Platform Internals
Database Testing Framework
Adding more data
Executing containerized apps
Terms and Acronyms
Indices
Dorieh Data Platform
Data Domains
Dorieh CMS Package (manipulating with Health Data)
Medicare Files Handling
Data dictionary and lineage for Medicare processing
Table Lineage Diagram
Materialized view medicare.qc_enrollments
Column medicare.qc_enrollments.zip
Column medicare.qc_enrollments.zip Lineage SVG
View page source
Column medicare.qc_enrollments.zip Lineage SVG
medicare.qc_enrollments.zip
medicare.qc_enrollments.zip
medicare.qc_enrl_bene.zip
medicare.qc_enrl_bene.zip
MAX(zip)
medicare.qc_enrl_bene.zip->medicare.qc_enrollments.zip
Transformed
medicare.enrollments.zip
medicare.enrollments.zip
MAX(zip)
medicare.enrollments.zip->medicare.qc_enrl_bene.zip
join natural
medicare._enrollments.zip
medicare._enrollments.zip
MAX(zip)
medicare._enrollments.zip->medicare.enrollments.zip
Copied
medicare._ps.zip
medicare._ps.zip
US Postal ZIP code, the first 5 digits
of it
numeric →
(CASE WHEN {column_name} < 100000 THEN {column_name}::int ELSE ({column_name}/10000)::INT END)
character varying →
SUBSTRING({column_name} FROM 1 FOR 5)::INT
integer →
{column_name}::INT
medicare._ps.zip->medicare._enrollments.zip
Aggregated On bene_id, year, state
medicare.ps.zip
medicare.ps.zip
US Postal ZIP code, the first 5 digits
of it
numeric →
(CASE WHEN {column_name} < 100000 THEN {column_name}::int ELSE ({column_name}/10000)::INT END)
character varying →
SUBSTRING({column_name} FROM 1 FOR 5)::INT
integer →
{column_name}::INT
medicare.ps.zip->medicare._ps.zip
Copied
medicare.ps.zip_parent
8 incoming links (columns)
medicare.ps.zip_parent->medicare.ps.zip