Tanay Arora
Self-Service Analytics · Part 2

Data at the point
of decision.

The signal is in the data. The person who needs to act on it is somewhere else. Closing that gap — without a ticket, without waiting two days, without an engineer as the intermediary.

What governed data exposure unlocks
Governed data layer
Warehouse · semantic views · access controls
BUILD SKILLS
Teams create their own reusable workflows — no data engineer in the loop
BUILD DASHBOARDS
Teams ship their own views — GTM pipelines, sales performance, delivery tracking
RUN ANALYSIS
Teams ask questions and get grounded answers — no ticket, no wait
The data team stops being the bottleneck and starts being the platform
Fig 1 — One governed layer. Three kinds of self-service it enables.

The real cost of a data bottleneck

Every data team I've worked in or alongside has the same problem at some point. The pipelines are solid, the models are clean, the data is accurate — and the team is still buried. Not building. Answering questions. Pulling the same numbers in slightly different forms for slightly different people, week after week.

The cost isn't visible on a roadmap. It shows up as decisions made without data because the request queue was too long. It shows up as a CS lead who knows a customer is at risk but can't see the usage data without filing a ticket. It shows up as a leadership team that gets their metrics two days after the week ends, by which point the window to act has closed.

The question that kept coming up: what would change if the data team's job was to build the infrastructure that lets other people answer their own questions — rather than answering them one by one?

The bottleneck model vs the self-service model
Before
Business user has a question
↓ files a ticket
Data engineer picks it up
↓ writes a query
Sends back a spreadsheet
↓ 2–3 days later
Decision already made without it
After
Business user has a question
↓ asks directly
Governed interface handles it
↓ seconds
Answer arrives, consistent and trusted
↓ immediately
Decision made with data
Fig 2 — The difference isn't the quality of the data. It's who can reach it, and when.

What self-service actually means

Self-service data gets misunderstood. It doesn't mean giving everyone access to a BI tool and hoping for the best. That just moves the bottleneck — now a PM is writing SQL at 11pm, getting a subtly wrong answer, and making a decision on it with confidence.

Real self-service means the interface is built for the consumer, not the data engineer. The question a CS lead needs to answer on a Monday morning is different from what a product manager needs for a roadmap review. The data is the same. The interface has to fit the job.

Four interfaces, each designed around a specific decision and a specific person. The underlying data — the warehouse, the models, the semantic layer — is shared. What changes is the surface each team sees, shaped to how they actually work.

One data layer, four self-service interfaces
Governed data layer
Warehouse · dbt models · semantic views · access controls
WEEKLY METRICS
Business review, automated
Leadership gets metrics before they sit down Monday morning — no compilation, no waiting.
CHURN SIGNALS
At-risk users, prioritised
CS lead sees who to contact today, in what order, with the context needed to act.
DELIVERY INTELLIGENCE
Engineering velocity, on demand
Product and engineering leads query cycle times, slippage, and rework in plain English.
CONVERSATION INTELLIGENCE
Call patterns, queryable
Sales and CS ask questions about call themes and objection patterns without touching the warehouse.
Fig 3 — Same governed foundation. Four interfaces, each shaped to a specific team's decision cadence.

The value each interface unlocks

Decisions made on Monday, not Wednesday

The weekly business review used to be a two-to-three hour manual process. Someone had to pull metrics from half a dozen places, reconcile the numbers, format them, and distribute the document before a meeting that needed those numbers to be useful. By the time the data was ready, the meeting was over or the week's decisions had already been made elsewhere.

Automating this didn't just save time. It changed the rhythm of the business. Leadership teams that get accurate metrics before their Monday morning discussion make different decisions than ones that get them on Wednesday afternoon. The window between a signal appearing in the data and an action being taken closed from days to hours.

Time from signal to decision
Manual process
2–3 days
Automated delivery
~3 min
Ad hoc question
sec
The window between signal and action. The shorter it is, the more useful the data.
Fig 4 — Reducing time to insight isn't a technical achievement. It's a business one.

The CS team that doesn't need to ask

A customer success team is only as effective as the signals it can see. If the only way to know a customer is at risk is to wait for them to cancel, you've already lost. If the signal exists in the data but requires a data engineer to surface it, you've added friction at exactly the moment speed matters most.

A weekly risk report classifies every paying user by behavioural signals — engagement trends, usage patterns, time since last active — and delivers a prioritised list to the CS lead before the week starts. No query, no ticket. The data was always there. What changed was when it arrived and who it reached.

Product managers and engineering leads need to answer questions about how work is moving — where projects are getting stuck, how long stages are taking, whether slippage is getting better or worse. These are analytical questions, but the people asking them aren't analysts. They shouldn't have to be.

Delivery data in a queryable semantic layer means a product manager can ask why a project stalled without waiting on a data engineer. The question gets answered in the same tool they're already using.

Conversations as a data source

Some of the most valuable signals in a company aren't in a database. They're in conversations — what customers say on calls, what objections come up repeatedly, what features get asked for before they're built. Most organisations can't answer questions about this data at all, because it lives in audio recordings and transcripts that nobody has time to analyse manually.

Connecting conversation data to the governed data layer — and exposing it through the same kind of interface as everything else — means a sales leader can ask "what objections are we hearing most?" and get a grounded, consistent answer rather than a gut feel. That's a genuinely new capability, not just a faster version of something that existed before.

Why governance is what makes it work

Self-service data without governance is dangerous. If different people are using different definitions of the same metric, self-service just means they can reach the wrong answer faster. The reason these interfaces work — the reason the CS lead trusts the risk report and the leadership team trusts the weekly metrics — is that the underlying definitions are enforced at the data layer, not in the interface.

What makes self-service trustworthy
Consistent definitions
Every metric means the same thing regardless of who asks or how they ask. Defined once, enforced at the data layer.
Controlled access
Each interface exposes only what its consumer needs to see. Self-service doesn't mean open access.
Fit-for-purpose surface
The interface is built for the consumer's decision, not for data engineers. A CS lead shouldn't need SQL to know who to call.
Fig 5 — Governance isn't the opposite of self-service. It's what makes self-service trustworthy enough to act on.

The shift wasn't visible in any single week. It happened gradually — fewer requests in the queue, more teams reaching for the data themselves, more questions answered before they were asked.

When you know it's working

The thing that told me this was actually working: the sales team built their own GTM dashboards to track pipeline and sales performance lifecycles. People outside the data team started creating their own skills. Nobody asked permission. They just did it. That's the outcome — not a metric, not a report, just other people building things because the infrastructure made it possible.

What this requires to work

01
A governed foundation first
Self-service built on inconsistent data erodes trust quickly. The semantic layer, the access controls, the consistent metric definitions — all of that has to exist before the interfaces on top of it are trustworthy.
02
Design for the decision, not the data
The right question when building a self-service interface isn't "what data do we have?" It's "what decision does this person need to make, and what do they need to see to make it?" The interface follows from the answer.
03
The data arrives before it's needed
A report that arrives after the meeting it was supposed to inform is a report that didn't change anything. Timing is part of the design. The CS lead's risk report needs to be there Monday morning, not when someone remembers to run it.
04
Trust is built incrementally
No one trusts a new data interface immediately. Start with one team, one decision, one cadence. Get the numbers right consistently. Trust follows from reliability, not from capability.
More from this series
01
AI & Data Strategy
From dashboards to decisions
03
AI & Data Governance
Governed infrastructure for agentic AI
04
Scaling dbt
The dbt project structure that scales
Tanay Arora
Senior Data Engineer · Melbourne, AU
LinkedIn GitHub Get in touch →
← AI & Data Strategy AI & Data Governance →