Psycho-Analytic Engineering (Coalesce 2021)

June 6, 2021 § Leave a comment

Using Data to Differentiate Our Selves

Keynote Talk Proposal for Coalesce 2021

Google Slides

Based on “DBT as Organizational Therapy

« Read the rest of this entry »

DBT as the “Couch” for Organizational Therapy

May 13, 2021 § 1 Comment

Or, “How ELTT is the Key to World Peace”

Draft Submission Script for Coalesce 2021 « Read the rest of this entry »

SyncHouse: MVC for Enterprise SaaS

May 2, 2021 § Leave a comment

A concrete proposal for Imagining a Data Resort as enforcing a Model-View-Controller architecture across multiple Software-as-a-Service applications. The key is replacing transient enterprise data integrations with a persistent “sync house,” and making that the one full-service Source of Truth for data, schemas, and business logic.

  1. Ingest data from Salesforce, NetSuite, etc. (e.g., Stitch/Talend, FiveTran)
  2. Store raw data in a LakeHouse (e.g., Databricks, Delta Lake; or just Redshift)
    1. Aka “ELT vs ETL
  3. Manage schemas via dbt (e.g., dbt Cloud)
  4. View and report on appropriate data (e.g., Mode, Data Studio)
  5. Push updates (reverse ETL) back to source applications (e.g., Celigo, Get Census)
« Read the rest of this entry »

My First Date with Quilt Data

July 21, 2020 § Leave a comment

I’ve known the good folks at Quilt Data for a long time. A company hackathon gave me a good excuse to actually use them “in anger” for an actual demo. These are my notes on how to configure quilt3 and create my first package (and panda data frame) from a CSV

« Read the rest of this entry »

Where Am I?

You are currently browsing entries tagged with data at iHack, therefore iBlog.