Show HN: Open-source infra for building embedded data pipelines
41 points| seandoh | 3 years ago |github.com
We are building *open source infrastructure for deploying customer-facing data pipelines.*
Here’s our repo https://github.com/pipebird/pipebird and website https://pipebird.com/.
Pipebird (YC W22) is designed to enable companies that generate important data to offer secure data pushes to their customers’ warehouses, directly from their products.
Our team was previously building in fintech, where we heard from many of our peers that their customers wanted data pushed directly to their warehouses. Customers wanted to bring data into their source of truth without having to maintain custom built pipelines or introduce security risks by contracting a third-party ETL/ELT provider.
After seeing Stripe https://stripe.com/data-pipeline and customer.io https://customer.io/data-warehouse recently invest in building out their own native data sharing products, we realized that many SaaS companies could better support their customers and even generate additional revenue by offering native data pipelines.
Our goal with Pipebird is to make creating a reliable data pipeline as simple as pressing a button from a vendor's dashboard.
With the current iteration of the product, data can be selected from a number of sources (ex: Postgres, MySQL, CockroachDB, etc.), customers can configure pipelines and optionally apply transformations (like type casting), and data can be periodically synced directly to customers’ warehouses (ex: Snowflake). We’re actively adding sources/destinations and would appreciate any feature requests.
Here's a 2 min demo of the product https://www.loom.com/share/c7a7e4b4e57c4015b533fd754c510b2e
Pipebird is open source (MIT license) so that any developer can use it. Our aim is to not charge individual developers - we make money selling paid plans that include features like multiple projects, user permissions, additional security features, managed infra, support, etc.
Give us a whirl: https://github.com/pipebird/pipebird. We’d love your feedback and will be here to answer any questions!
ctc24|3 years ago
As someone who's been playing in the data sharing space, it's really exciting to see it get more attention!
All the best to y'all!
timothygoltser|3 years ago
The activation energy for setting up robust direct-to-customer pipelines is still too high on the provider’s end - it’s great to see this approach getting more investment recently.
All the best to Prequel as well!
erezsh|3 years ago
sudonim|3 years ago
Good luck with the next steps for Pipebird.
seandoh|3 years ago
xiaofei_|3 years ago
seandoh|3 years ago
To illustrate the difference, imagine ACME Inc. wants to pull its customer data from HubSpot. With Airbyte, they can use a pre-built connector that accesses data made available via the HubSpot API.
Now let's say HubSpot uses Pipebird to build native data sharing features for its customers. ACME Inc. can now deploy a secure data pipeline directly from HubSpot without involving any third-party. Since HubSpot is offering the pipelines, it can choose to expose more data than is made available via its API (Stripe has done this with their data pipeline product https://stripe.com/data-pipeline) and ACME Inc. doesn't need to worry about the pipeline breaking because it's coming directly from the source.
tarunmuvvala|3 years ago
The product looks great. I had faced such situations in past with different tools.
You should make the PM and Dev community aware of this tool to get better leads and usecases.
Wish you best for the future.
seandoh|3 years ago
alanjinomoto|3 years ago
seandoh|3 years ago
We see this as a way for companies to strengthen relationships with their customers and grow revenue.
azianmike|3 years ago
timothygoltser|3 years ago
seandoh|3 years ago
timothygoltser|3 years ago
hodgesrm|3 years ago