There's a growing conversation about how data should be treated as a product in the world of enterprise data. This idea gained significant traction with the rise of the “Data Mesh” architecture a few years ago, which introduced the concept of decentralized data ownership and product thinking.
However, the definition of a data product is still debated. While interpretations vary, most agree that a data product should:
There may be other considerations, but this post won't delve into every nuance.
To understand their roles, let’s first define data catalogs and data modeling tools:
Data catalogs excel at answering the basic question: “What data do we have?”
They create a dictionary of your data assets, making it easier to manage and understand them.
Data modeling tools help visualize how these assets connect with one another. They allow teams to establish the relationships between data entities and better understand how everything fits together in the larger system.
I often use a simple analogy to explain how these tools work together in building data products:
Imagine you’ve bought a couch from IKEA. It could be any product, but for simplicity, let's say it’s a kitchen table.
You unbox the set and find the manual. This manual lists all the necessary parts—screws, bolts, boards, etc.—required to assemble the table.
Although the written instructions are helpful, it might still be difficult to understand how to put everything together. IKEA solves this by including visual images that guide you through the assembly process.
In this analogy:
In Ellie.ai, you can create a visual diagram to show how different data assets (or entities) are related. For example, consider a business concept like a customer. A "customer" isn't just any company—it becomes a customer because of a relationship, such as an invoice or shipment that’s linked to it. The customer is defined by these relationships.
This is a simplified example, but it shows how crucial relationships are in data modeling. When you can visualize how business concepts (like customer or product) are interrelated, you're well on your way to creating a solid data product blueprint.
In short, the process starts with cataloging your existing data assets. Once that's done, you can build something new, such as a blueprint for creating a BI dashboard or any other data product.
Many of our customers are already using integrations between Collibra and Ellie.ai to achieve exactly this—and they’re seeing great results.