← Dealbook
X Square logo

X Square

X Square is positioning as a series a horizontal AI infrastructure play, building foundational capabilities around agentic architectures.

series aHorizontal AIGenAI: corewww.x2robot.com
$143.4Mraised
Why This Matters Now

As agentic architectures emerge as the dominant build pattern, X Square is positioned to benefit from enterprise demand for autonomous workflow solutions. The timing aligns with broader market readiness for AI systems that can execute multi-step tasks without human intervention.

X Square is a company aiming to create a universal brain-cerebellum system for robots, encompassing capabilities from perception to action.

Core Advantage

Unified, proprietary WALL-A large model enabling perception, reasoning, and action in a single end-to-end system, tightly integrated with custom hardware for high-fidelity, generalist manipulation.

Agentic Architectures

high

X Square's WALL-A model and robots are described as capable of autonomous perception, decision making, and action, with multi-step reasoning and tool use in complex environments. The system orchestrates perception, understanding, and action in a unified pipeline, aligning with agentic architectures.

What This Enables

Full workflow automation across legal, finance, and operations. Creates new category of "AI employees" that handle complex multi-step tasks.

Time Horizon12-24 months
Primary RiskReliability concerns in high-stakes environments may slow enterprise adoption.

Vertical Data Moats

high

X Square leverages proprietary, industry-specific data from real-world robot deployments in domains like home service, education, logistics, and industrial assembly. This creates a vertical data moat, providing a competitive advantage for training and improving embodied intelligence models.

What This Enables

Unlocks AI applications in regulated industries where generic models fail. Creates acquisition targets for incumbents.

Time Horizon0-12 months
Primary RiskData licensing costs may erode margins. Privacy regulations could limit data accumulation.

Continuous-learning Flywheels

medium

There are indications of ongoing model and hardware optimization based on deployment feedback, suggesting a continuous learning loop. Robots are described as quickly mastering new skills, implying rapid adaptation and improvement from real-world usage.

What This Enables

Winner-take-most dynamics in categories where well-executed. Defensibility against well-funded competitors.

Time Horizon24+ months
Primary RiskRequires critical mass of users to generate meaningful signal.

Micro-model Meshes

medium

The mention of multi-modal models and specialized control for different robot components (e.g., high degree-of-freedom hands, tool use, auditory/visual interaction) suggests the possible use of multiple specialized models working together, though not explicitly stated.

What This Enables

Cost-effective AI deployment for mid-market. Creates opportunity for specialized model providers.

Time Horizon12-24 months
Primary RiskOrchestration complexity may outweigh benefits. Larger models may absorb capabilities.
Technical Foundation

X Square builds on GreatWall, WALL-A. The technical approach emphasizes fine tuning.

Competitive Context

X Square operates in a competitive landscape that includes Unitree Robotics, Tesla Optimus (Tesla, Inc.), Agibot (Agibot Robotics).

Unitree Robotics

Differentiation: X Square emphasizes a unified, end-to-end large model (WALL-A) for perception-to-action and claims to be among the first in China to use a fully end-to-end approach for embodied intelligence. Unitree is more hardware-centric and less focused on large model-driven autonomy.

Tesla Optimus (Tesla, Inc.)

Differentiation: Tesla Optimus is positioned as a general-purpose humanoid but is less open about its AI stack and model architecture. X Square claims earlier adoption of end-to-end embodied large models and highlights multi-modal, multi-task generalization and fine manipulation.

Agibot (Agibot Robotics)

Differentiation: X Square claims to be among the earliest in China with end-to-end embodied large models and focuses on few-shot learning, long-sequence manipulation, and high-fidelity dexterous hands. Agibot is less explicit about end-to-end model scale and integration.

Notable Findings

X Square is among the earliest in China to pursue a fully end-to-end architecture for 'general embodied intelligence'—meaning their models go directly from raw multimodal sensory input to low-level robotic actuation, bypassing traditional modular pipelines (perception, planning, control). This is still rare globally, especially at production scale.

Their 'GreatWall' (WALL-A) model is described as a '操作大模型' (operation foundation model) with '百亿级参数' (tens of billions of parameters), suggesting a scale similar to leading LLMs but focused on embodied tasks. The model claims to support cross-task, cross-domain generalization, few-shot learning, and long-horizon reasoning for both rigid and deformable object manipulation.

X Square tightly couples hardware and software co-design, iterating both in parallel. The robot body is 'continuously optimized' for multi-modal model control, which is a non-trivial engineering challenge and increases defensibility: most competitors either buy off-the-shelf robots or treat hardware as a secondary concern.

The platform demonstrates high dexterity (five-fingered 'ArtiXon Hand'), dual-arm coordination, and real-world deployment in multi-step, long-horizon tasks (e.g., industrial wire harness sorting, hotel tissue replacement, cold drink preparation, manual assembly with manuals). This breadth of manipulation tasks, especially with flexible and deformable objects, is technically challenging and not widely demonstrated.

They emphasize 'embodied chain-of-thought' (具身思维链) and 'physical process causal reasoning'—implying their models attempt to reason about causal relationships in the physical world, not just pattern-match. This is a frontier area in robotics AI research.

Risk Factors
overclaimingmedium severity

The site uses heavy buzzwords (e.g., '端到端架构', '百亿级参数', '全流程智能统一', '全球领先') and makes broad claims about technical leadership and capabilities (e.g., '已在多个维度达到全球领先', '速度媲美人类操作') without providing concrete technical benchmarks, demos, or published results to substantiate these claims.

no moatmedium severity

While the company claims proprietary models (GreatWall, WALL-A), there is little evidence of a strong data or technical moat. The approach (fine-tuning, end-to-end architectures) is increasingly common in the embodied AI/robotics space, and no unique data advantage or ecosystem lock-in is articulated.

feature not productlow severity

The core value proposition centers on a generalist embodied AI model controlling robots across tasks. This could be subsumed by larger platform players (e.g., Tesla, Boston Dynamics, OpenAI) as a feature of their broader robotics stack, unless X Square can show unique integration or verticalization.

What This Changes

If X Square achieves its technical roadmap, it could become foundational infrastructure for the next generation of AI applications. Success here would accelerate the timeline for downstream companies to build reliable, production-grade AI products. Failure or pivot would signal continued fragmentation in the AI tooling landscape.

Source Evidence(12 quotes)
"自变量机器人聚焦自研“通用具身智能大模型”,构建具备精细操作能力的通用机器人,是国内最早采用完全端到端路径实现通用具身智能大模型的公司之一。"
"公司自研的「GreatWall」操作大模型系列的WALL-A,具备自主感知、决策与高精度操作能力,已在多个维度达到全球领先。"
"围绕自研的 WALL-A 操作大模型,实现从感知、理解到动作控制的全流程智能统一。"
"彻底突破传统机器人依赖规则与编程的局限,实现在复杂环境中的高精度、自主化作业。"
"端到端架构"
"百亿级参数"