ahmedNarrator | 5 years ago | on: Launch HN: Narrator (YC S19) – a data modeling platform built on a single table
ahmedNarrator's comments
ahmedNarrator | 5 years ago | on: Launch HN: Narrator (YC S19) – a data modeling platform built on a single table
(We will open source that query abstraction later with a demo where you can translate Redshift Queries to Snowflake Queries).
Maybe in the future we can get that project into dbt so that dbt models can work on any warehouse as well.
ahmedNarrator | 5 years ago | on: Launch HN: Narrator (YC S19) – a data modeling platform built on a single table
Yeah we do something very similar to dbt for taking restructuring the data into a single time-series table. We add things like identity resolution, diffing, incremental update and computing some cache columns.
Your Crystal Ball is SPOT ON!!! We get 3 kinds of data people. The ones who are like: "THIS WILL NEVER WORK", "Too bad I already built all this" or the "THIS IS THE FUTURE, HOW IS EVERYONE NOT USING IT".
I would love to chat and show you what we have (schedule a demo on our site and it will go to me and we can chat!)
Also, Teaser... When you standardize all of data and you create a consistent way to relating that standardized structure then analysis become very consistent. Imagine a world where your email attribution deep dive can be run by loading a template and point it to your "opened email" activity and your "order activity".... coming soon ... a Narrative Library.
ahmedNarrator | 5 years ago | on: Launch HN: Narrator (YC S19) – a data modeling platform built on a single table
You can see some example transformations here: https://docs.narrator.ai/docs/by-data-source
ahmedNarrator | 5 years ago | on: Launch HN: Narrator (YC S19) – a data modeling platform built on a single table
This was quite a challenge and I think what makes the traceability and source of truth problem a lot simpler.
In Narrator, the data team writes small SQL to create single customer centric business concepts that we call activities. These are around 25 lines and decided to be understood by anyone in the company (i.e. "viewed page", "called us",...).
Now, every question you or a stakeholder has will simply be a rearrangement of these activities. If you can describe what you want, then Narrator can assemble a table that represent it.
Source of truth - What ever is in the activity stream? Tracebility - always Dataset (activities and how they relate), then activities (~25 SQL). Coherent Model - Customers doing actions in time.
Does that make sense? Some of these things are easier to show in a demo then describe in text.
The real magic is how do you assemble the data into tables that can answer any question when the output table is billions of rows. Unfortunately, that requires a lot of SQL magic that depends on taking advantage of a columnar warehouse.