+

Data Feast Weekly

Get weekly insights on modern data delivered to your inbox, straight from our hand-picked curations!

The Data Product Strategy - Becoming Metrics-First
The Data Product Strategy - Becoming Metrics-First

The Data Product Strategy - Becoming Metrics-First

15 min
|
Proven Models, Metric Model as a reflection, and Metric Model Enablement
Aug 24, 2023

Originally published on

Modern Data 101 Newsletter

,

the following is a revised edition.

Understanding the Essence of Data


"AI as the new Electricity” has been evangelised by Andrew NG and the data community from a transformational point of view.

“Just as electricity transformed almost everything 100. years ago, today I actually have a hard time thinking of an industry that I don’t think AI will transform in the next several years.” - Andrew NG


But in fact, in a more literal sense, Data is the new Electricity. It flows and enables “appliances”- literally, as well as in the analogy of digital applications and decisions.

The Data Product Strategy is based on the successful models developed around harnessing the power of electricity. The ideation process has been bottom-up: which means this context was looked up after the strategy naturally came together and aligned to these models as they intrinsically fit the true nature or essence of data. The context of "Data as the new electricity" puts a good wrapper around the strategy and brings the story together.

If these proven stable models, which have been unleashed across the globe after several rounds of evolution, are able to regulate something so volatile and chaotic as electricity to scale it as a serpentine commodity running all around us without really suggesting a constant presence, these approaches would prove to be concrete for data as a commodity.

Strategy TLDR

1. Business Opportunity / Supply Gap
2. Use Case Identification
3. Metric Model: Identification of metrics and relations
4. Metric Enablement Model
5. Data Product Top-to-Bottom Requirements
6. Bottom-to-Top Development + Iteration
7. Metric Validation
8. Deployment
9. Continuous Metric Iteration

Business Metric Model, aka SLO model


Optimal Key to Data Product Success

Why a Metric Model and not just a Data Model

TLDR

1. The ideal reference/wireframe for product creation
2. Direct wireframe of business goals - direct correlation
3. More stable & consistent compared to underlying data
4. Comprehensible and manageable by business teams
5. Low friction to culture transition


A metric model is more stable and consistent with business needs compared to the underlying data. It makes the ideal reference or wireframe for product ideation and creation. While any product requires a set of metrics that it's developed around, data requires a metric model to transition into the product state, and a list of metrics wouldn’t suffice given the spread or omnipresence of data.

A metric model provides a direct interface and has a direct correlation to business processes and goals. This also makes it more comprehensible and manageable for business teams who otherwise host significant friction against learning low-level specifics. Data Product, even though it produces clean and consumable data, has several layers of low-level specifics that are more comprehensible for product management and engineering teams. Thereafter, a model at the business level instead of at the product or data level ensures low friction to culture transition.

A great thread on what makes a good data strategy

What does a Metric Model Entail

TLDR

1. All metrics tie back to primary business goals, e.g., TTROI
2. Every node represents either a metric or a sub-metric
3. Every edge represents how each sub-metric enables or impacts a higher-order metric
4. This model is solely influenced by "business change drivers"


All metrics tie back to primary business goals, e.g. ROI, Time to Revenue, Conversion Rate, etc. Interestingly, all these metrics depend on sub-metrics that enable them. If each metric value is treated as an agreement, all the sub-metrics enabling it become low-level objectives. The way SLAs and SLOs are interrelated.


Every node in a metric represents a metric or a sub-metric. The node at the pinnacle is the prima metric or the key metric the use case or the domain wants to push, for example, ROI. Every edge conveys how a sub-metric enables or impacts a higher-order metric.

Positive Inertia

The metric model is not exposed to much change as it sits very close to business goals, which are stable unless achieved or significantly pivoted. If you recall any business use case you worked on, the primary metrics that tie back to customer-facing endpoints are extremely consistent.

There could be a bit of ruffle in downstream metrics, but these too are known to be quite consistent because at the end of each metric is, after all, one type of customer - internal or external. Every change triggers an iteration loop of communication between the metric driver and the metric beneficiary. Which is why business metrics are well thought out, chosen with care, and finalised sternly.

The only points of influence that inspire change in the metric model are business change drivers, which Jon Cooke summarises really well:



Approaching the Metric Model

TLDR

1. Importance of enabler metrics alongside customer-facing metrics.
2. Metric prioritisation framework to navigate dips and spikes through strategic effort and resource allocation.


Every use case has a very specific and practically very narrow set of metrics. However, if we consider all the metrics that power the topsoil, the number of metrics could easily overflow a manageable limit. In prevalent stacks, most of the sub-metrics are not identified, especially with respect to data, and the data-driven efforts toward achieving the prima metrics are largely ill-defined.

You can't improve what you don't measure. - Peter Drucker


To truly become data-driven, data-driven efforts need to become measurable. And to do it in a way that completely aligns with business objectives calls for a metric model. This implies identifying and implementing the right set of sub-metrics assigned to higher-order business metrics.

While aligning with business may seem like a natural course of action, this is not what runs in practice. The goals and metrics of data teams and related efforts are largely separated or ill-defined with respect to business goals. A metric model bridges that gap and ensures high transparency across all tracks.

Applying Product Framework at the Metric Level

To prevent any overwhelm, the metric model needs to pass through a prioritisation filter, aka a concrete product strategy. One that enables you to substantiate the ideated model with its true value proposition to prioritise the efforts behind metrics resourcefully. One is free to use any framework they deem fit for this purpose, such as, say, a Lean Value Tree, right before freezing the Metric Model. In this case, we have used the BCG framework from the Boston Consulting Group.

The BCG Framework is a growth-share matrix that sheds light on high-performing tracks, discardable tracks or ones that need consistent maintenance. It has four interesting categories, and usually, they apply to products and initiatives, but we are using them for metric prioritisation.


Below, we have explained them with respect to metrics.

  • Star: High growth, high share
    These are high-value metrics tied to products and initiatives that have high growth rates and have the highest impact coverage. Keeping up these metrics brings in a lot of revenue, but maintaining them also drains significant efforts since they operate in the growth stage. For example, say, metrics defined for Apple’s celebrity campaigns by the marketing domain. These metrics are in extremely high growth areas and impact a wide share of their prospects.

  • Cash Cow: Low growth, high share
    Cash cows are the most lucrative metrics for the business as they cover significant ground in terms of customer impact but in low-growth areas. This enables enough cash to come in, which can be used for powering other initiatives, especially stars. For example, metrics related to conversion rates across marketing campaigns. Usually, enterprises have stable campaigns that consistently focus on converting prospects to bring in value for sales. This is a consistent engine, rarely disrupted by star campaigns, but has a high share in the revenue impact. The value generated from such metrics could be channelled into other metrics to power them, especially star metrics.

  • Question Mark: High growth, low share
    Question marks indicate potential. Metrics in this area would ideally be associated with initiatives with high growth prospects that impact consumers marginally. For example, AI add-ons. The organisation can make a decision based on these metrics’ progress whether to keep investing in these streams’ potential or pause/sell them off.

  • Pet: Low growth, low share
    Pets make the most draining segment with initiatives that have low growth prospects as well as low impact coverage. For example, integration and maintenance overheads of data pipelines. In other words, initiatives that could be optimised or entirely subtracted.


In summary, with a BCG framework-based prioritisation, a metric model would enable, there is more clarity on:

  • Profitable and sustainable lines
  • Where to invest further
  • Where to cap efforts
  • What to decommission
  • What to be consistent with
  • How and to what extent to react to dips and spikes

💡 True Product Experience
As a virtue of the metric model, inconsistency triggers, and prioritisation chain, you would know where and what to fix without navigating complex layers and illegible edges/connections in the data stack. This is the foundation of your data product - each metric is like an appliance that the data strives to light up.

💡 The Metric Enablement Model: Light the Bulb


You have the logical metric model. Now, how do you power it up? You need an enablement model that answers:1. How are the metrics powered?
2. How do Data Product o/p ports map to the metric model?


To answer these, we need to go back to the proven models that have been used in the context of regulating the electric flow. Models such as parallel circuits, capacitors, and resistors came after several iterations and failures. The consolidated model at the end of this evolution chain then went on to last for years due to its stability and ability to scale and regulate something as volatile and chaotic as electricity. Today, it’s all around us, quite literally, powering all our regular operations and businesses passively.

That’s exactly what we want with data, and that’s what becoming data-driven - a passive enabler deeply embedded within all our operations. We want to reach a state where we don’t have to call out data-driven decisions explicitly; all decisions would be data-driven.

Representing one-way flow


Designs employed in a parallel AC circuit:


If these proven models, which have been unleashed across the globe after several rounds of evolution, are able to regulate something so volatile and chaotic as electricity to scale it as a serpentine commodity running all around us without really suggesting a constant presence, these approaches would prove to be concrete for data as a commodity.

Let’s find out how this model translates into the realm of data.

⚙️ Metric-Enablement Model


Designs employed in a parallel metric model:


Parallel Edges

Corruption or disruptive changes in one track don't impact other tracks while not limiting the visibility of data from other tracks. Each metric enables certain decisions. Thus, the parallel model enables both decision isolation as well as collaborative decision-making. Most decisions are insulated as much as possible from sour decisions but benefit from positive ones.

Yes, data products are also driven by the principle of infra isolation and, therefore, self-dependency. This design also needs to be reflected on the metric layer that sits on top of the data products layer due to the same virtues.

💡 It’s important to remind ourselves that bringing in the product ideology means reflection across all layers or verticals, even beyond data products. Adopting a new design approach means implementing it top-down instead of limited and incomplete implementation in selected areas.

Metric Life Optimisation

As a virtue of parallel circuits/pipelines, discrepancies in one shouldn't trigger fatalities in another unless they are directly dependent, e.g., metric #4 & #5 (series). Optimised metric life also implies the ability to consistently capture metric values and recover failed captures with minimal time to recovery. The data product layer behind the metric model becomes the enabler.

The metric’s health is also optimised through targeted RCA. When one metric goes off, you know exactly which loop to target. Think of a fuse distribution board, which is basically a great experience for true product experience.

  • When a metric goes out, you know which circuit to target
  • One faulty loop doesn't impact other loops
  • Complex and illegible connections are abstracted through fuse triggers. The engineer doesn't need to navigate the entire complex stack of circuits to understand what went wrong and where.

The same circuit in a different arrangement, with fuse-like indicators plugged into the metric store/trigger.

Localised Regulation

Regulation prevents overwhelm, friction, and debt in processes and pipelines. Data could easily become chaotic and unmanageable without regulated resistance. While in prevalent systems and processes there are uncontrollable resistances, data products bring in the ability to regulate the right form of resistance. For example, say, SLOs in the form of quality checks or access and masking policies.

This is quite an interesting segment, as this makes one of the touch points between data products and the metrics. Specific output ports from a mesh of data products stream into the pipelines behind every metric in the metric model. Each output port is preceded by a contract that ensures the quality and governance as desired by the metric drivers.


There are two ways to establish these connections:

  • Bottom-Up Enablement
    Plug in existing data products wherever applicable. This would gradually become the primary way after the org has spent some time building data products. For this, your discoverability across data products and sub-elements in the data products needs to be state-of-the-art. Data Developer Platforms make this feasible since they have visibility across all components in the data stack.

  • Top-Down Enablement
    If new data products are required for certain metrics, bridging metrics and output ports would be top-down. This would be the primary way when orgs freshly adopt a data product ecosystem. Based on the requirements, a new development cycle is kickstarted, and the data product is built with the business metrics in mind. The data product funnel from Jon Cooke is a great way to trigger such top-to-bottom development.

Data Product Funnel

Metric Store with Triggers

Maintains consistency in metrics by triggering recovery or boosting pipelines on flickering metric values (analogy of A.C.- bi-directional flow). This acts like a direct feedback loop between data and business.


The metric store is the second touch point between data products and the metrics model, making this one of the primary components. Without a feedback loop that couples data initiatives with business goals, we would be pushed down to the same unresolved challenges where data efforts are disorganised and not even vaguely tied to the actual business impact. There needs to be a continuous feedback loop between outcome and activity. The metric store is that bridge, consistently passing metrics and triggers from business to data counterparts, allowing the data product management team to prioritise their efforts (refer to BCG matrix).

We’ve spoken several times about Samir Sharma’s data strategy with feedback loops as one of its major backbones. Will go out on a limb and say, this approach has disrupted the way we think about data in very implementable terms. Tying back metrics to data initiatives isn’t rocket science, and we could have concrete ways to establish the links.

Samir Sharma’s Data Strategy Framework

Non-Disruptive Extensibility

Any new data product circuit or third-party tool can be added as loops or sub-loops after due diligence and strategy and would immediately inherit all the properties of the parallel model without any complex integration overhead.

Final Note


During the course of last year, we came across some amazing data strategies and approaches, some of which we have highlighted here. We are glad to open up our approach and encourage discussions and enhancements around it. Here’s a quick revisit of the strategy.

  1. Business Opportunity / Supply Gap
  2. Use Case Identification
  3. Metric Model: Identification of metrics and relations
  4. Metric Enablement Model
  5. Data Product Top-to-Bottom Requirements
  6. Bottom-to-Top Development + Iteration
  7. Metric Validation
  8. Deployment
  9. Continuous Metric Iteration
// Text truncation functionality const elements = document.querySelectorAll('[ms-code-truncate]'); elements.forEach((element) => { const charLimit = parseInt(element.getAttribute('ms-code-truncate')); // Helper function to recursively traverse the DOM and truncate text nodes const traverseNodes = (node, count) => { for (let child of node.childNodes) { if (child.nodeType === Node.TEXT_NODE) { if (count + child.textContent.length > charLimit) { child.textContent = child.textContent.slice(0, charLimit - count) + '...'; return count + child.textContent.length; } count += child.textContent.length; } else if (child.nodeType === Node.ELEMENT_NODE) { count = traverseNodes(child, count); } } return count; } // Create a clone to work on without modifying the original element const clone = element.cloneNode(true); traverseNodes(clone, 0); // Replace the original element with the truncated version element.parentNode.replaceChild(clone, element); }); });