Enable javascript in your browser for better experience. Need to know to enable it?

黑料门

Data Mesh in practice: Getting off to the right start

Data Mesh in practice: Getting off to the right start (Part I)

Our learnings from implementing Data Mesh at Roche

Since Thoughtworker Zhamak Dehghani published her first article听听in 2019, a huge amount has been written on the topic. Now, with a significant number of successful Data Mesh projects under our belt, we鈥檙e able to enrich our understanding of the听practice听of Data Mesh.听

In this series of articles, we鈥檒l share practical learnings from our recent Data Mesh implementation engagement at . As the world鈥檚 largest global provider of healthcare, Roche generates, manages, and processes vast quantities of data across a wide range of deeply specialized domains. We lay out what it really takes to successfully apply the principles of Data Mesh in a large corporate environment.

As Zhamak points out in her book听, Data Mesh isn鈥檛 just another architectural approach 鈥 it鈥檚 a听sociotechnical听paradigm that demands process, operating model, and technological transformation. 鈥淪ociotechnical鈥 refers to both the layers and nuanced ways that human social systems interact with 鈥斕齛nd come to grips with 鈥 technology to produce improved outcomes.听[1]

Principles guiding all building blocks of Data Mesh

The four principles of Data Mesh 鈥 as defined by Zhamak 鈥 help us start to understand the range of sociotechnical听 requirements needed to build a strong Data Mesh.听

The first two, 鈥榙omain ownership and architecture鈥 and 鈥榙ata as a product,鈥 both demand significant changes to operating models. Whereas 鈥榮elf-service platforms鈥 and 鈥榝ederated computational governance鈥 demand deep engineering expertise and evolutionary architectures. These are two very different types of change, and if they aren鈥檛 understood and听 approached in the right way, they can undermine the huge potential value of the Data Mesh approach.

In this new series of articles, we will take a deep dive into the organizational, process and operating model changes required to support Data Mesh, the practical demands of building data products, and the architectural requirements of the approach. Throughout the series, we鈥檒l show you how we鈥檝e balanced the diverse changes needed to bring Roche鈥檚 vision for Data Mesh to life, and share some of our proven principles and practices to help you do the same.

The process and key practices at a glance

Before embarking on any data journey, especially a Data Mesh one, it鈥檚 essential that you have a clearly articulated data strategy at the organizational level. To create that strategy, teams must answer value-oriented questions, such as:

  • How are data, AI, and other leading capabilities going to deliver value for our business?

  • Do we want data to help us increase revenues, cut costs, improve customer experiences, or create new revenue streams 鈥 or some combination of the four?

  • What are our key strategic data use cases, and how can our data strategy support them?

Answering those questions enables organizations to make informed decisions about what they really need from their data architecture, and help shape how they ultimately measure the value delivered by their chosen architectural approach. In this article series, we鈥檙e looking at the process that follows that definition 鈥 exploring the journey that begins once an organization has clearly defined its data strategy and determined that Data Mesh is the right architectural choice to support and deliver it.

Across this article series, we focus on the practices, principles, and steps that 黑料门 takes to plan and execute successful Data Mesh听 journeys. Each article explores a different aspect of those journeys, but here鈥檚 a quick snapshot of how those actions come together across the journey as a whole:

Cross-domain planning and scoping

  • The process begins with听cross-domain and leadership teams defining the 鈥榳丑测鈥听for their organization 鈥 their motivations for adopting a data-centric approach to their business. This helps them shape their data strategy, and begin with a clear view of the business value and benefits that data strategy can deliver for their team
  • The team can then听determine if Data Mesh is a suitable architectural paradigm听to support that strategy, and set听measurable, value-based goals听that will be used to determine how effectively their decisions are moving them towards the benefits their strategy intends to deliver
  • Leaders also听define the 鈥榳丑补迟鈥听鈥 early hypotheses for how they鈥檙e going to do that, mapping out domains to include early in the process, and getting relevant teams to a point where they can begin undertaking detailed discovery exercises听 听 听 听

Operating model: from project to product

  • Operating model is defined听to ensure that teams across and within domains can work to create greatest value from the Data Mesh model with ease
  • New roles and responsibilities are defined, for clarity and empowerment of domain and data product actors
  • Governance structures are defined, building alignment on desired outcomes, balancing autonomy and interoperability, and ultimately ensuring consistency across the Data Mesh

Product thinking

  • Use principles of design thinking听to identify longlist potential data products
  • Map hypothesized products to a Lean Value Tree听to determine high-value options
  • Define the exact purpose and intended value of use cases and agree how to measure it
  • Work the longlist down into a shortlist of high-value use cases, and听work back towards the data products needed to bring them to life, in line with the lean value tree

Technology evolution

  • Approach and manage data products as atomic, functionally-cohesive units
  • Establish a streamlined developer experience听for creating and maintaining data products
  • Create a consistent definition of a data product听across the entire organization
  • Automate governance听and Access Control Policies听
  • Apply fitness functions听to guide the evolution of the mesh
  • Provide clear guidance (or patterns) for data sharing

Cross-domain maintenance and evolution

  • Decisions made across all three areas are听measured against defined success metrics, enabling continuous improvement, and constant iteration to improve and optimize the Data Mesh
  • Guardrails ensure听high interoperability between data products, and ultimately ensure the success and value of the Data Mesh

If you鈥檙e looking for insight into a specific aspect of the Data Mesh journey, you can jump ahead to the relevant article using the links below, where you鈥檒l find expanded explanations of the practices and principles introduced above:

To open the series, we鈥檒l look at a few of the consistent, high-level challenges faced when adopting Data Mesh, before introducing our design-thinking-based double diamond process that we use to kickstart successful, high-value Data Mesh journeys.

Challenge #1: Overcoming the dichotomy between rapid scaling and continuous learning

Following the principle of network effects [2], the value of Data Mesh increases with the number of interoperable use cases it serves, so scaling is a top priority for enterprises adopting the model. However, their eagerness to scale at speed has consistently created a challenge across all of the implementations we鈥檝e seen.听

From the social perspective of our sociotechnical paradigm, organizations adopting Data Mesh are on a learning journey. They have a lot of experimenting to do to determine what鈥檚 going to work best for them. In the process, they鈥檒l discover powerful business and customer use cases that deliver unique value for their organization. Naturally, they want to apply those lessons across as many domains as possible.听

However, if you scale too fast, you won鈥檛 have the opportunity to learn effectively or incorporate what you鈥檝e learned. Going too quickly creates a dynamic where distributed domains are all doing their own learning, but not learning from one another or collaborating on collective data efforts. Teams falling prey to this challenge get stuck in the experimentation stage, searching for ways to solve their own issues without identifying any of the enterprise-wide challenges that the organization as a whole could overcome using Data Mesh.

To address this potential trade-off and overcome the challenges it creates across our Data Mesh projects, we鈥檝e designed a set of practices to help organizations successfully balance both the learning and scaling demands of Data Mesh adoption. In keeping with the federated nature of the Data Mesh, our approach allows for parallel work by decentralized teams.

Challenge #2: Defining and empowering domains

Our long experience practicing听has helped us understand the benefits of drawing clear boundaries around contexts to separate concerns and allow teams to focus on solving problems within the context they fully understand and can control. It is natural that many Data Mesh implementations try to define domains to which they can assign data product ownership. However, in our experience, it is not necessary to reorganize business units or departments for the sole purpose of designing data product teams and data products. In many cases, you can start assigning data product ownership within your existing structure, and evolve it as required, as your journey progresses. In determining who owns the data, we follow the business outcomes to business processes and existing decision making frameworks around it.

Data Mesh empowers domains to make their own data-related decisions and build their own data products. That鈥檚 a lot of autonomy 鈥 especially compared to traditional centralized data architecture approaches, where everything must go through a single IT or data team.听

That level of freedom naturally raises a lot of important questions: Who needs to be part of goal definition conversations? How should outcomes be measured? Who keeps an eye on progress towards targets and offers management support to teams when they need it?

To help answer those questions, we drew upon 黑料门鈥櫶EDGE听operating model for inspiration on how to connect high-level business goals right the way down to a data product team鈥檚 backlog items. We found that EDGE lends itself well to the context of domains adopting Data Mesh. Rather than giving teams prescriptive outputs that they need to work toward, they鈥檙e aligned around specific goals to deliver customer value. Each domain has the autonomy to decide the best way to reach their domain-specific goals.

The result is a set of domains that are all aligned with organizational strategy and data strategy, but empowered to work toward strategic goals in ways that make sense based on their context, and apply their domain-specific knowledge to leverage data in creative ways, enabling them to deliver more customer value.听

Challenge #3: Running before you can walk

As Data Mesh best practice is constantly evolving, there is no single 鈥榖est path鈥 for teams to follow. They鈥檝e determined that Data Mesh is right for them, defined some initial use cases, and got to work on making it happen. Unfortunately, in some cases, that鈥檚 led to ineffective projects that haven鈥檛 made it past the POC phase.

Through our experience, we鈥檝e applied the Design Council鈥檚 鈥樷 approach to onboarding a new domain to Data Mesh, as shown below.

Onboarding a business unit / domain to the Data Mesh process

The double-diamond ensures that domains focus on the business 鈥榳丑测鈥 and 鈥榳丑补迟鈥 before they move on to think about the engineering 鈥榟辞飞鈥. The first phase of the double diamond incorporates the vital discovery step providing the 鈥榳丑测鈥 and 鈥榳丑补迟鈥, while the second phase of the double diamond addresses the 鈥榟辞飞鈥.

Key practice: start your journey with your existing organizational boundaries

Defining domains is a key part of the Data Mesh journey, but redefining domain boundaries can introduce a lot of additional challenges, and isn鈥檛 always a necessary prerequisite. We recommend starting with the organizational boundaries you already have, and only moving those boundaries to create new domains if you hit significant barriers along the way.

By going through a detailed discovery process, domain teams know exactly what they鈥檙e getting into and can strategically align with business goals and customer value. They鈥檙e able to clearly lay out what they want to achieve by joining the Data Mesh, how it can help them contribute to strategic goals, and what they鈥檒l need to get there. It starts the onboarding process, while also providing a framework for everything that needs to come after it.

One approach we have found very successful here is to define, as part of the discovery, the 鈥淒ata Mesh delta鈥 鈥 the gap between where the domain currently is vs what they want to achieve with Data Mesh. To identify this delta,听 we break our discovery process down into three streams:

  • The operating model stream听

  • The product stream听

  • The tech stream

In line with the principles of Data Mesh, those streams can run concurrently. The graphic below shows the three-stream discovery process that we鈥檙e currently running across multiple domains at Roche:

The three streams identified and explored during the discovery process don鈥檛 just help to scope and plan a domain鈥檚 onboarding onto the Data Mesh. They provide a framework for how onboarding should be executed, and the simultaneous organizational, process, and technological changes that are needed to ensure that Data Mesh ultimately delivers its full value for each domain, and the organization as a whole.听

Adopting Data Mesh requires more than just technology change

In the following articles that comprise this series, we鈥檒l take a deep dive into the three different types and areas of change required to achieve Data Mesh success. Each article will offer insights to those that work across the relevant domains, concluding with a high-level overview of the diverse changes and decisions required to successfully adopt, embed, and drive value from Data Mesh.

Throughout the series, we鈥檒l share artifacts and insights from our recent Data Mesh implementation engagement with Roche. Through the lens of their journey, we鈥檒l show you exactly how these different types of change come together to create a robust Data Mesh implementation and onboarding strategy..

In our next article, we鈥檒l explore the organizational and operating model decisions that teams need to make throughout their Data Mesh journeys, introducing principles and practices used across our projects to ensure success and maximize value creation.

Footnotes

[1]听The term 鈥渟ociotechnical鈥 was introduced by researchers at the Tavistock Institute in London in the middle of the 20th century. The canonical paper explored how a change in sociotechnical arrangements of work resulted in improvements in team cohesiveness, work satisfaction, and reduced sickness and absenteeism. See Trist EL and Bamforth K (1951) Some Social and Psychological Consequences of the Longwall Method of Coal-Getting: An Examination of the Psychological Situation and Defences of a Work Group in relation to the Social Structure and Technological Content of the Work System. Human Relations 4(1): 3鈥38. DOI: 10.1177/001872675100400101.

[2]听Barab谩si A-L (2002)听Linked: How Everything Is Connected to Everything Else and What It Means for Business, Science, and Everyday Life. London: Plume.