K
Watchlist
← Dealbook
Stats Perform logoSP

Stats Perform

Information Technology & Enterprise Software / AI/ML Platforms
D
3 risks

Stats Perform is applying vertical data moats to media content, representing a unknown vertical AI play with unclear generative AI integration.

www.statsperform.com
unknownGenAI: unclearChicago, United States
$275.0Mraised
6KB analyzed9 quotesUpdated May 1, 2026
Event Timeline
Why This Matters Now

With foundation models commoditizing, Stats Perform's focus on domain-specific data creates potential for durable competitive advantage. First-mover advantage in data accumulation becomes increasingly valuable as the AI stack matures.

Stats Perform is a sports data and technology platforms.

Core Advantage

The combination of an extremely large, proprietary historical and event-level sports dataset (Opta) plus in-house foundation sports AI models that have been productized across a wide set of commercial modules (predictions, metrics, content widgets, research).

Build SignalsFull pattern analysis

Vertical Data Moats

4 quotes
high

Clear use of a large, proprietary, domain-specific dataset (Opta) as a competitive advantage. They position long-tail, high-quality historical and live sports data as the core asset that underpins models, products and commercial offerings.

What This Enables

Unlocks AI applications in regulated industries where generic models fail. Creates acquisition targets for incumbents.

Time Horizon0-12 months
Primary RiskData licensing costs may erode margins. Privacy regulations could limit data accumulation.

RAG (Retrieval-Augmented Generation)

2 quotes
medium

Strong indication they combine retrieval from large structured/unstructured proprietary sports datasets (Opta feeds) with generative/prediction models to produce narratives, metrics and live commentary. The wording implies embedding/search plus generation for fan-facing content and predictions.

What This Enables

Accelerates enterprise AI adoption by providing audit trails and source attribution.

Time Horizon0-12 months
Primary RiskPattern becoming table stakes. Differentiation shifting to retrieval quality.

Continuous-learning Flywheels

2 quotes
emerging

Implied but not explicit: product feedback and new coverage likely feed model improvements (e.g., new metrics, sharper predictions). Language suggests iterative improvement of models using ongoing coverage and research, but there is no direct statement about live feedback loops, A/B testing or user-correction pipelines.

What This Enables

Winner-take-most dynamics in categories where well-executed. Defensibility against well-funded competitors.

Time Horizon24+ months
Primary RiskRequires critical mass of users to generate meaningful signal.
Technical Foundation

Stats Perform builds on OptaAI. The technical approach emphasizes unknown.

Model Architecture
Primary Models
8 proprietary 'foundation sports AI models' (brand context: OptaAI / Stats Perform foundation models) -- specific model names not provided
Team
Founder-Market Fit

insufficient data: no founder names or backgrounds identified in provided content; cannot assess fit

Engineering-heavyML expertiseDomain expertiseHiring: data engineerHiring: machine learning engineerHiring: research scientistHiring: AI/ML product/engineering roles
Considerations
  • • No identifiable founders or leadership bios in provided content; limited transparency on founding team
  • • Reliance on marketing copy with limited verifiable details in this excerpt
Business Model
Go-to-Market

content marketing

Target: enterprise

Pricing

custom

Enterprise focus
Sales Motion

field sales

Distribution Advantages
  • • Proprietary Opta data feeds and AI models
  • • Global brand trust and recognition
  • • Extensive coverage across leagues, broadcasters, media, apps, bookmakers and teams
Customer Evidence

• logos: FC Barcelona, Mastercard, Asahi, TikTok

• Opta Analyst research cited by global media and broadcasters

Product
Stage:mature
Differentiating Features
Massive data scale (7.2PB) with 8 AI foundation models8 foundation AI models powering 200+ software modulesIntegrated ecosystem spanning data, analysis, editorial content and fan experiencesOpta Analyst and widely cited analytics research
Integrations
Broadcasters, media, apps, leagues, federations, bookmakersBrand activations with entities like FC Barcelona, Mastercard, Asahi, TikTok
Primary Use Case

Deliver data-driven sports content, insights and experiences to engage fans and drive revenue for rights holders, broadcasters and brands

Novel Approaches
Vertical data moat: massive proprietary sports dataset (Opta) powering models and productsNovelty: 7/10Data Strategy

The combination of scale (7.2 PB), vertical focus (sports/Opta) and multi-decade coverage is a significant domain-specific moat, enabling rich feature engineering and domain-specific metrics that general-purpose datasets cannot replicate.

Competitive Context

Stats Perform operates in a competitive landscape that includes Sportradar, Genius Sports, Second Spectrum.

Sportradar

Differentiation: Stats Perform emphasizes proprietary Opta event-level data combined with in-house foundation sports AI models and a broad content/product suite (predictions, editorial widgets, research) at large scale; Sportradar is stronger on betting integrity services and odds distribution.

Genius Sports

Differentiation: Genius focuses on official league partnerships and betting integrations; Stats Perform positions itself as an AI-first company with deep historical Opta data, predictive models and a large set of software modules for media, teams and brands beyond betting.

Second Spectrum

Differentiation: Second Spectrum is often centered on computer vision and visual analytics; Stats Perform combines long-established human-collected event data (Opta), vast historical archives and generalizable foundation AI models plus packaged content modules and prediction services.

Notable Findings

Massive proprietary data asset (7.2 PB) focused on sports/Opta: This is not just big data for the sake of it — the size implies long-running, high-granularity time series (40+ years of global coverage), multi-modal sources (event logs, tracking, video) and extensive labels. That volume enables training specialized models that general public datasets simply can't reproduce.

Eight "foundation sports AI models": Instead of one general LLM/CV model, they appear to have invested in multiple domain-specialized foundation models (likely covering event prediction, trajectory forecasting, CV-based player detection/pose, natural-language generation for commentary, betting/prediction models, recommender/engagement models, and potentially anomaly/differential analysis). This multi-foundation approach is a technically distinct choice: vertically specialized backbones per subtask, then assembled into product modules.

Productized, composable inference surface (200+ software modules): Claiming 200+ modules implies a microservice/feature-store + model-as-a-service architecture that exposes small, reusable capabilities (e.g., xG calculation, pass chain detection, highlight identification, personalized feed generator). That leads to rapid recombination of capabilities for bespoke publisher/broadcaster integrations.

Tight integration of research + production: 'In-house researchers' plus large historical coverage suggests a closed loop where research outputs (new metrics, models) are quickly operationalized across modules — a nontrivial MLOps pipeline connecting experimentation, feature engineering on massive historical corpora, and low-latency production usage.

Real-time, latency-sensitive delivery to broadcast and betting partners: The commercial focus (broadcasters, bookmakers, live widgets) implies engineered solutions for streaming ingestion, near-real-time feature computation, and high-throughput low-latency inference — an implementation challenge beyond batch model hosting.

Risk Factors
Overclaimingmedium severity
Feature, Not Productmedium severity
No Clear Moatlow severity
What This Changes

Stats Perform's execution will test whether vertical data moats can deliver sustainable competitive advantage in media content. A successful outcome would validate the vertical AI thesis and likely trigger increased investment in similar plays. Incumbents in media content should monitor closely for early signs of customer adoption.

Source Evidence(9 quotes)
“Stats Perform is the world leader in sports AI.”
“8 foundation sports AI models used in 200+ software modules”
“AI-powered predictions”
“generate new metrics”
“OptaAI powered magical fan experiences”
“Tell new stories with analytics platforms and our incredible in-house researchers”