[Note: Article originally published on linkedin]
In the hyper-competitive world of video streaming, keeping subscribers engaged isn’t just a nice-to-have, it’s crucial. Engagement is inversely correlated with churn, meaning the more viewers interact with your content and platform, the less likely they are to cancel their subscriptions. But what exactly does that mean in practical terms? And how can streaming services measure and enhance engagement to reduce churn?

Let’s engage
Engagement refers to the degree to which users interact meaningfully with a streaming platform. It includes both quantity (how much time they spend) and quality (how immersive or satisfying that time is) and higher engagement means higher user’s satisfaction.
Common metrics used to measure the engagement are, for example, the total watch time per month, the frequency of use, the average session length, the completion rate, the early abandon rate and in general the viewing behavior. The indicators of a satisfying and “engaging” experience are many and not so straightforward to interpret, so are the indicators of an unsatisfying experience that are useful to recognize because they can anticipate the abandon of the service.
To deepen the understanding, engagement can also be broken down into active and passive dimensions. Active engagement includes user-initiated actions such as searching for content, creating watchlists, rating titles, and interacting with UI features like skip buttons or content previews. Passive engagement refers to behaviors like binge-watching a series or letting autoplay continue or simply consume a lot of content, every day. Both are valuable: active engagement suggests intentionality and platform loyalty, while passive engagement may indicate frictionless consumption.
Moreover, it’s important to track engagement across devices and other user’s behaviors. For instance, mobile sessions might be shorter but more frequent, while smart TV sessions are longer and more immersive. Cross-device continuity and resumed playback are also strong signals of user engagement and satisfaction with the service.
What is Churn ?
Churn refers to the rate at which subscribers cancel their streaming service subscriptions over a period, typically last month.
The reasons why a user abandon a streaming service are many, but we can subdivide them in 3 categories: QoE related (subpar quality, technical issues, bugs, instability, not flowless experience), Content related (not interested to content, content not updated, difficulty in content discovery etc..) and Other reasons (free trial expires, too expensive, temporary cancellations between content releases).
Data Analysis may help in the task of interpreting the user’s Engagement and likelihood of churn. Advanced platforms deploy machine learning to predict churn weeks in advance using behavioral signals such as reduced session frequency and duration over time, longer browsing times without playbacks, or low variability in content consumed.
These behaviors serve as early warnings and help platforms trigger retention strategies—such as targeted offers, personalized push notifications, or more simply should trigger internal alarms to analyze the phenomenon, find the root cause and fix it, especially when it interests a significative part of the audience.
The Inverse Correlation: Why Engagement Reduces Churn
Multiple studies and real-world cases show that higher engagement directly lowers churn. When users find value and entertainment in a platform, they are less likely to leave.
For example, this study from Owl&co (2024) confirms the huge impact that engagement can have on churn (and streaming companies balance). Streamonomics: Engagement vs Churn, Quantified
It’s clear that, if engagement reduces churn, it’s of crucial importance to measure engagement and work to increase it as much as possible, eliminating elements that contribute to a reduction of engagement and introducing new elements or improve existing ones that contribute to an increase.
What Drives Engagement in a Streaming Service?
Improving engagement isn’t just about having more content, it’s also about designing the experience to keep users coming back. This means not only well designed streaming app but also high quality, flowless streaming and -definitely- high QoE in video streaming.
Obviously content is king and so everything that revolves around the content are essential to increase the engagement. New and exclusive high quality content presented in a catchy way and contextualized with the viewer preferences are the key factor.
However, a bad UX with navigation problems, bugs, convoluted logics or invasive advertising can hamper the experience reducing the potentiality of high quality content, especially in the long run, when compared to better UX and flowless experience.
At the same time, and even more importantly, low video quality, inconsistent streaming performance, rebuffering can have a huge impact on engagement. The best streaming platforms have accustomed us to a very refined QoE and any worsening is difficult to tolerate for the end user.
How to Measure and Analyze Engagement Effectively
Analytics platforms like Mux, Conviva, NPAW’s Youbora and in-house data teams often use dashboards that combine raw logs with advanced metrics.
Event-driven telemetry is also key. Platforms instrument players and apps to emit real-time events (e.g., play, pause, seek, stall, buffer ratio, dropped frames) that feed into data lakes for batch or streaming analysis. These events are often correlated with user retention metrics using cohort analysis and regression modeling.
Engagement scoring models also integrate heatmaps of UI interactions, time-to-first-play (TTFP), and UI responsiveness to detect friction points. Machine learning models can segment users based on engagement profiles, enabling targeted actions like in-app tips, onboarding flows, or churn-prevention incentives.
When an effective practice to measure engagement is defined, is of crucial importance to have a well implemented A/B testing practice to measure the real impact that changes or new features can have on the final users.
Finally, many companies are exploring synthetic monitoring (simulated sessions on real devices) and stress testing to measure engagement-affecting bugs or performance drops before actual users are impacted. Even more insightfull is a benchmarking service like NTT Data’s OTT Observatory where multiple competing streaming services are analyzed, compared and benchmarked with objective KPIs to estimate the user’s point-of-view.
Strategic Excellence: The Power of Technology in Shaping the QoE
Today, in a crowded and commoditized market, technological excellence is no more a differentiator—it’s a strategic necessity. A well-engineered streaming infrastructure can subtly but powerfully shape user perception, reduce friction, maximize Quality of Experience (QoE), and ultimately solidify long-term customer loyalty & engagement.
Great content may draw attention, but it’s the execution that slowly earns or destroys trust.
True excellence lies in mastering the complexity of the user-platform relationship. This means not only ensuring reasonably fast, buffer-free video playback and quick UI response, but also cuddle users through a refined, gorgeous and rock solid video quality.
But how to dominate such complexity ?
To dominate complexity, a platform must continuously collect, correlate, and act on data from multiple fronts: QoE measurements, Behavioral metrics and Engagement signals.
But metrics alone aren’t enough. The real advantage comes when this data feeds into a culture of testing, learning, and adapting. A/B testing should become a permanent practice, not just for content layout or recommendations, but for technical changes: new encoders and players, UI transitions, bitrate ladders, AI-driven ABR heuristics, even audio codec swaps.
When technology, data, and culture align, platforms can proactively refine, not just react. This mindset of continuous optimization allows services not only to detect pain points but to resolve them before users churn. It is a holistic approach where engagement, retention, QoE, and UX are treated as parts of the same system, not separate silos.
And yet, all this leads to one inevitable frontier: understanding and measuring Quality of Experience in real time.
As discussed, video quality It’s a front-line component of engagement, and when measured precisely moment by moment in each streaming session, together with other KPIs becomes a compass for every strategic decision, from encoding strategy to UI prioritization.
Measuring Video QoE at the Core
At the heart of any effort to optimize engagement lies a deceptively complex question: how good is the user’s experience, in this moment, for that user ?
To answer this, modern streaming platforms are moving beyond traditional quality KPIs like resolution and bitrate embracing per-scene video quality assessment. The process begins with extracting frame-level/scene-level perceptual quality scores and content complexity indicators, producing a rich stream of metadata that can be interpreted in the context of each specific viewing session.
These data points are then correlated with external variables such as:
- Display size and device class (e.g., smartphone vs. 4K TV),
- Viewing context (on the move, in the living room),
- Content type (e.g., animation, dark scenes, high-motion action),
Once collected, this quality metadata is cross-referenced with session-level analytics and CDN delivery logs, including classical KPIs like:
- Video Startup Time
- Stall events and buffering,
- Network conditions and adaptation behavior.
The outcome is a composite view of the session, a granular, data-driven map of how the stream was delivered and perceived. This lays the foundation for highly targeted A/B testing practices, where QoE predictions and session outcomes can be validated experimentally.
Did lowering the peak bitrate on low-complexity scenes actually preserve perceived quality on tablets? Does a more aggressive encoder preset cause subtle artifacts in high-contrast HDR scenes? Can adjusting buffer targets during scene transitions reduce rebuffering without hurting responsiveness?
These are no longer “rhetorical” questions. They become testable hypotheses in A/B tests. In the end, also the effect of those technical improvement can be cross-tested to evaluate impact on engagement KPIs.
A Multi-Metric approach for Quality
A robust video quality framework should rely on multiple complementary metrics, each covering different aspects of visual fidelity and viewer perception. Among the most effective today we can mention:
• VMAF (Video Multi-Method Assessment Fusion): the industry workhorse, effective for general perceptual quality measurement but with some well-known limits.
• CAMBI: excels at detecting banding artifacts, particularly in gradients and low-light scenes.
• IMAX XVS® Suite: a professional-grade toolkit focused on frame-by-frame perceptual video quality under strict visual norms.
While these tools are powerful, relying on a single score (especially VMAF alone) can be misleading. Over-optimization is always a risk and can create a false sense of control. In my measurement and optimization practices I’m profitably mixing VMAF with the IMAX XVS Suite and other complexity-indicators (i.e. NTT Data HHPower and bIndex) to have a more comprehensive and accurate multi-metric point of view. I found the IMAX XVS® Suite and in particular the NR-XVS® metric, to be significantly complementary to VMAF, VMAF neg and Banding Metrics (like Cambi and bIndex) and compensate for the limit of VMAF and the risk of overfitting that can emerge from optimizing video streaming to maximize a single metric.
VMAF provides an estimation of the fidelity of the encoded video while NR-XVS® provides a more absolute score on the pleasantness of the video sequence. This difference can be exploited to gain a deeper insight into the actual quality perceived by users in different viewing conditions and achieve different and more extreme level of optimization mitigating at the same time the risk of overfit.
A multi-metric approach is not just more accurate and less error prone—it’s also more actionable. When metrics remain individually exposed, they become input channels for predictive models capable of:
- Better estimating real QoE in various streaming conditions and devices
- Recognizing perception thresholds/corner cases based on content and context,
- And enabling CAE (Context-Aware Encoding), which dynamically adapts renditions, encoding parameterizations and delivery strategies to match the complexity and relevance of what is actually being watched.
In short, the final frontier is no longer just measuring quality. It’s about understanding it at scale, in real time with the objective to “master” it.
Because when you can see the quality of experience as clearly as the bitrate or frame rate, you unlock the ultimate lever of competitive advantage: the ability to shape, not just serve, viewer expectations.
Conclusion: Engagement as a Strategic Monopolizer of User Time
While high engagement is a clear sign of user satisfaction (and a powerful antidote to churn) it also triggers a second, often underestimated dynamic: the monopolization of user attention. As engagement crosses certain thresholds, the user’s available time becomes saturated by the platform itself, leaving little or no opportunity for competitors to intervene.
The more time a user spends within a service, the less cognitive and emotional bandwidth remains for rival platforms, whether they’re other streaming services, social media, or even gaming ecosystems.
Netflix has clearly embraced this logic: Its expansion into cloud gaming, mobile content, and future social-like features reflects an ambition to anchor itself deeper into the daily attention economy of the household. After all, it’s not just Prime Video or Disney+ that compete for screen time, it’s also TikTok, YouTube Shorts, and Instagram.
By capturing more moments in the user’s day, Netflix isn’t just increasing satisfaction. It’s preemptively reducing the window of opportunity for any other platform to insert itself. In the modern digital landscape, where time is the ultimate finite resource, owning the user’s attention equals owning the market.



































