Data Mesh in Practice: Decentralizing Data Ownership

Data Mesh in Practice: Decentralizing Data Ownership

July 2, 2025·5 min read
Data

The Centralized Data Team Problem

Most organizations run their data platform through a central data engineering team. Business units request data products, the central team builds pipelines, and a growing backlog creates frustration on both sides.

Data mesh proposes a fundamentally different model: treat data as a product, owned by the domain teams that produce it. The central team shifts from building pipelines to providing a self-service platform that enables domain teams to build their own.

The Four Principles of Data Mesh

Domain Ownership

In a data mesh, the team that produces data is responsible for making it available as a product. The marketing team owns marketing data products. The finance team owns financial data products. Each domain team has the context to model their data correctly and the incentive to maintain quality.

This does not mean every team builds everything from scratch. It means they own the decision-making about their data — what to expose, how to model it, and what quality standards to enforce.

Data as a Product

Domain data is not just a byproduct of operational systems. It is a product with consumers who depend on it. Data products should have:

  • Discoverability: Registered in a central catalog with clear descriptions and ownership
  • Addressability: Accessible through standard interfaces without knowing internal implementation
  • Trustworthiness: Published quality metrics and SLAs that consumers can rely on
  • Self-describing: Schema documentation and semantic metadata that make the data understandable
  • Interoperable: Conforming to organizational standards for formats, identifiers, and naming conventions

Self-Serve Data Platform

The central platform team provides infrastructure and tooling that makes it easy for domain teams to build, deploy, and operate data products. This includes:

  • Standardized ingestion frameworks and templates
  • Compute and storage infrastructure with cost allocation
  • Data quality testing and monitoring tools
  • Access control and governance automation
  • Data catalog and discovery services

The platform should reduce the cognitive load on domain teams. A domain engineer should be able to publish a new data product in hours, not weeks.

Federated Computational Governance

Governance in a data mesh is not centralized command-and-control. It is a federated model where the central team defines standards and policies, but domain teams implement them.

The platform should automate governance wherever possible — enforcing naming conventions, applying access controls, tracking lineage, and monitoring quality. Manual governance processes do not scale.

Implementation Challenges

Data mesh sounds elegant in theory. In practice, several challenges must be addressed:

Organizational readiness: Data mesh requires domain teams to take on data engineering responsibilities. This requires new skills, new roles, and cultural change. Organizations with weak engineering cultures in business units will struggle.

Platform investment: The self-serve platform is a significant engineering undertaking. Most organizations underestimate the investment required to build a platform that genuinely enables domain teams.

Cross-domain queries: Business questions often span multiple domains. Ensuring that data products from different domains can be easily joined and compared requires careful attention to shared identifiers, consistent time dimensions, and compatible formats.

Cost management: Decentralized data production can lead to duplicated effort and uncontrolled spending. Federated governance must include cost visibility and accountability.

A Phased Approach

We recommend starting small and expanding based on demonstrated value:

Phase 1: Identify 2-3 domains with strong engineering capability and clear data products. Build the minimum viable platform to support them.

Phase 2: Expand to additional domains. Invest in platform capabilities based on the friction points identified in Phase 1.

Phase 3: Implement federated governance automation. Build cross-domain data products and marketplaces.

Phase 4: Optimize for scale. Address advanced topics like real-time data products, ML feature stores, and cross-organizational data sharing.

When Data Mesh Is Not the Answer

Data mesh is not appropriate for every organization. Consider traditional centralized approaches if:

  • Your organization has fewer than 50 engineers across all domains
  • Your data use cases are primarily centralized reporting and analytics
  • Domain teams do not have engineering capability and cannot realistically build it
  • Your data volume and complexity do not justify the platform investment

The right architecture depends on your organization, not on industry trends.