aka Quilt Data Hub or Lightdash 2.0? Challenge Can I evangelizea corporate data platformby just emailing out reportswith sufficiently smart URLs? Rationale I don't have the powerto pull others onto a new platform.But I can push useful data to othersin a way that inspires them to participate more directly with the platform Proposal Replace friendly... Continue Reading →
The Coherency Manifesto: Towards Communal Data Platforms
Version 1.0: Sep 11, 2021 (Interdependence Day) As a communitywho produces, consumes, and manages datawe hold these truths to be self-evident: Our most precious resource as a community is our ability to make better decisions together ("Coherency")Better decisions are enabled by higher Quality DataCoherency increases as our Communal Data Platform aligns the Syntax of producers... Continue Reading →
Psycho-Analytic Engineering (Coalesce 2021)
Using Data to Differentiate Our Selves Keynote Talk Proposal for Coalesce 2021 Google Slides Based on "DBT as Organizational Therapy" Pitch Video https://www.loom.com/share/58c52bb915da4d57a66995f618194ce8 More powerful tools and shorter cycle times mean that we analytics engineers “get to” spend less time on coding SQL and “have to” spend more time understanding the deeper needs, motivations, and... Continue Reading →
DBT as the “Couch” for Organizational Therapy
Or, "How ELTT is the Key to World Peace" Draft Submission Script for Coalesce 2021 Hey there Data Lovers, my name is Dr. Ernie. And this is my English Cocker Spaniel Qhuinn, who with me and my boss make up the IT department at a Palo Alto startup. I am Caltech physicist turned management consultant... Continue Reading →
Configuring DataBricks on AWS
Despite the excellent QuickStart tools, this was way harder than I thought. For some reason I had the worst difficulty creating a Workspace on AWS for Databricks. Here are some tips that might help others who get stuck. A. Be clear which "Account ID" to enter where My Account ID on DatabricksMy Account ID on... Continue Reading →
SyncHouse: MVC for Enterprise SaaS
A concrete proposal for Imagining a Data Resort as enforcing a Model-View-Controller architecture across multiple Software-as-a-Service applications. The key is replacing transient enterprise data integrations with a persistent "sync house," and making that the one full-service Source of Truth for data, schemas, and business logic. Ingest data from Salesforce, NetSuite, etc. (e.g., Stitch/Talend, FiveTran)Store raw... Continue Reading →
Imagining a Data Resort
A data resort is where data comes to get pampered, so that it is prepared to get back to work. Motivation The good news is that I finally understand how we really need to be managing all the business data in my organization. The bad news is that I don't know how to articulate that... Continue Reading →
Become Like a Billionaire
Obsess over a Wildly Important Problem that has not been properly characterizedIdentify a novel point of technological leverage for solving that problemDiscover a market hurting enough to pay for even a crappy solution to that problemIterate and improve on all the above until you die, fully solve the problem, or hand it over to someone... Continue Reading →
My First Date with Quilt Data
I've known the good folks at Quilt Data for a long time. A company hackathon gave me a good excuse to actually use them "in anger" for an actual demo. These are my notes on how to configure quilt3 and create my first package (and panda data frame) from a CSV Create a Quilt account.... Continue Reading →
SSO Login into Salesforce from Node via samlp SAML IdP
Documenting this in a blog post because it drove us crazy trying to figure out exactly what was involved, even though it was actually easy to implement once we understood all the terminology. In order for our previously-authenticated users to automatically log into Salesforce, we needed to: Create a "/sso-url" on our node server for our... Continue Reading →

You must be logged in to post a comment.