K
Watchlist
← Dealbook
X Square logoXS

X Square

Horizontal AI
B
4 risks

X Square is positioning as a series b horizontal AI infrastructure play, building foundational capabilities around agentic architectures.

x2robot.com
series bGenAI: coreShenzhen, China
$293.3Mraised
11KB analyzed14 quotesUpdated May 1, 2026
Event Timeline
Why This Matters Now

As agentic architectures emerge as the dominant build pattern, X Square is positioned to benefit from enterprise demand for autonomous workflow solutions. The timing aligns with broader market readiness for AI systems that can execute multi-step tasks without human intervention.

X Square develops robotic systems and automation solutions designed for industrial and service applications.

Core Advantage

Integrated end‑to‑end embodied foundation model + proprietary real‑world data + soft‑hardware co‑design (dexterous five‑finger hands, tactile/force control) that enables zero/low‑shot generalization on long, complex physical manipulation sequences.

Build SignalsFull pattern analysis

Agentic Architectures

8 quotes
high

The system is described as an autonomous perception→decision→action loop running on a manipulator platform. Descriptions of long-sequence tasks, physical "thinking-chain" reasoning, proactive sensing and decision-making, and zero-shot generalization indicate agent-like components that plan and execute multi-step manipulations and use internal tool/skill repertoires rather than single-step mapping.

What This Enables

Full workflow automation across legal, finance, and operations. Creates new category of "AI employees" that handle complex multi-step tasks.

Time Horizon12-24 months
Primary RiskReliability concerns in high-stakes environments may slow enterprise adoption.

Continuous-learning Flywheels

5 quotes
high

They emphasize real-world data collection and ongoing soft‑hardware co‑iteration, implying a feedback loop where deployment data (and task outcomes) are used to update models and controllers. The combination of field deployments, an ecosystem plan and claims of continuous optimization point to a production flywheel improving models from operational data.

What This Enables

Winner-take-most dynamics in categories where well-executed. Defensibility against well-funded competitors.

Time Horizon24+ months
Primary RiskRequires critical mass of users to generate meaningful signal.

Vertical Data Moats

5 quotes
high

They appear to build a domain-specific moat from proprietary physical‑world sensor data and hardware co‑design. The emphasis on real-world datasets, custom robot bodies/sensors and task-specific performance across many contact-rich manipulation tasks signals an industry-specific data advantage that’s hard to replicate.

What This Enables

Unlocks AI applications in regulated industries where generic models fail. Creates acquisition targets for incumbents.

Time Horizon0-12 months
Primary RiskData licensing costs may erode margins. Privacy regulations could limit data accumulation.

Micro-model Meshes (multi-model orchestration)

4 quotes
emerging

There are hints of multiple models or model versions (series) and multi‑modal capabilities, but the text repeatedly stresses a single end‑to‑end large 'WALL-A' model. No explicit router, expert-selection, MoE, or per-task model routing is described. This suggests possible modular/specialized components but not a clearly articulated micro‑model mesh.

What This Enables

Emerging pattern with potential to unlock new application categories.

Time Horizon12-24 months
Primary RiskLimited data on long-term viability in this context.
Technical Foundation

X Square builds on GreatWall, WALL-A, WALL-OSS, leveraging In-house / self-developed infrastructure. The technical approach emphasizes hybrid.

Model Architecture
Primary Models
WALL-A (GreatWall 操作大模型系列)WALL-OSS (开源具身模型,WALL-OSS)
Fine-tuning

Not specified in content; implied adaptation on real-world sensor/action datasets and use of few-shot, cross-task transfer and reinforcement learning (no explicit LoRA/FT method mentioned). — “以真实世界数据为主要数据来源” — real-world multimodal sensor + interaction data from robot bodies

Inference Optimization
On-robot / tightly coupled deployment implied by '搭载模型' and '软硬一体同步迭代' (edge / robot-hosted inference)No explicit mentions of quantization, distillation, batching, or hardware accelerators in the provided content.
Team
• Founder / CEO (anonymous)high technical

Public materials describe the founding team as composed of experts from world-renowned AI and robotics laboratories and top universities; no individual names disclosed in the provided content.

Founder-Market Fit

The stated background—来自世界知名AI与机器人实验室的专家、全球最早提出注意力机制的研究者、千亿级大模型算法负责人等—aligns with building an end-to-end embodied AI robot platform; strong market fit for leading both hardware and AI software; however, lack of disclosed founder identities makes it harder to assess track records and execution credibility.

Engineering-heavyML expertiseDomain expertiseHiring: R&D/AI researchersHiring: robotics hardware engineersHiring: software engineers for platform integration
Considerations
  • • No disclosed founder names or bios; difficulty assessing individual prior track records
  • • High-level claims without independent verification (e.g., 'global leading' or 'first to adopt end-to-end path')
  • • Limited public detail on team size, organizational structure, or governance
Business Model
Go-to-Market

sales led

Target: enterprise

Pricing

custom

Enterprise focus
Sales Motion

field sales

Distribution Advantages
  • • 软硬件一体化自研平台与控制大模型 WALL-A 提供统一入口
  • • 自主研发模型与机器人本体,降低对外部依赖
  • • 在国内率先实现端到端具身大模型,形成技术与产品壁垒
  • • 多场景落地与生态合作计划提升网络效应与市场渗透
Customer Evidence

• 投资者背书:字节跳动、红杉中国、深创投等

• 政府/高校参与生态建设计划与行业认知提升

Product
Stage:general availability
Differentiating Features
国内最早采用端到端路径实现通用具身智能大模型自研 WALL-A 操作大模型,强调端到端统一的控制链自研并优化机器人本体以适配多模态大模型控制WALL-OSS 开源具身模型,全球开源模型性能竞争极短时间掌握新技能与零样本泛化能力
Integrations
没有明确列出第三方集成平台
Primary Use Case

在真实物理世界中的高精度自主作业,跨场景执行通用任务

Novel Approaches
End-to-end embodied foundation model (WALL-A / WALL-OSS)Novelty: 8/10Model Architecture & Selection

Moving the entire perception->reasoning->control pipeline into a single end-to-end embodied foundation model at '百亿级参数' scale (and open-sourcing a variant) departs from traditional modular robotics stacks and aligns more with recent large-model trends but executed for real physical control.

Unified single-model pipeline instead of multi-model orchestrationNovelty: 7/10Compound AI Systems

This single-model emphasis for real-world robotic control (not just simulation) is relatively rare in deployed robotics and signals a deliberate architectural choice to favor end-to-end learned control over hand-engineered modular pipelines.

Internal spatio-temporal memory and physical 'thinking chain' for long-horizon tasksNovelty: 8/10Retrieval & Knowledge

Claiming an internal chain-of-thought specialized for physical process reasoning (rather than pure language CoT) focused on manipulation, deformation and fluid dynamics is unusual and, if true, important for long-horizon robotic autonomy.

Competitive Context

X Square operates in a competitive landscape that includes Boston Dynamics, Tesla / Optimus, Figure (humanoid robotics).

Boston Dynamics

Differentiation: X Square emphasizes end-to-end embodied foundation models (WALL‑A) for fine manipulation and multi‑step task generalization, plus soft‑hardware co‑design and dexterous five‑finger hands. Boston Dynamics is strongest on dynamic mobility and robust field platforms, less focused on large-scale end‑to‑end multi‑modal manipulation models and open foundation models.

Tesla / Optimus

Differentiation: X Square presents deployed multi-step manipulation capabilities today (dexterous tasks, deformable object handling) and an explicitly model-driven product line with open/supported WALL‑OSS models; X Square stresses tactile/force control, mm-level manipulation and soft/hardware joint iteration. Tesla’s approach is high-volume humanoid hardware integration with different scale and ecosystem (and a mainly in‑house, vertically scaled approach).

Figure (humanoid robotics)

Differentiation: X Square differentiates by coupling a claimed end‑to‑end embodied large model (WALL‑A) with proprietary dexterous end‑effectors (ArtiXon Hand), specialized tactile/force control, and claims of better zero/low‑shot generalization across tasks — positioning as a 'brain + body' platform rather than primarily a hardware integrator.

Notable Findings

They present a single end-to-end 'embodied' large model (WALL-A / GreatWall series) that claims to go directly from multi-modal perception to high-precision motor commands. Unlike the common perception→planner→controller split, they emphasize a unified transformer-like model driving real actuators.

They explicitly advertise a '具身思维链' (embodied chain-of-thought) — applying chain-of-thought / causal physical reasoning to long-horizon physical processes (e.g., multi-step cooking, assembly). This is an unusual attempt to transplant LLM-style internal reasoning strategies into closed-loop robot control.

Tight soft‑hardware co-design: the product descriptions imply the robot morphology (high-DOF arms, ArtiXon five‑finger hand, chassis) and sensor suite were iteratively optimized to the model, not vice‑versa. That suggests joint design / parameterization of hardware kinematics and learned policy, not just software on off-the-shelf arms.

Targeting extremely hard contact-rich domains (fluids, deformable objects, cloth, zippers, wire harnesses) and claiming millimeter / sub‑millimeter force/position control with tactile feedback — these are domains most robotics teams avoid or treat modularly; doing them end‑to‑end is technically unusual and demanding.

Large heterogeneous sensor stack (2D LiDAR, ultrasound, RGBD, 3D‑TOF, single‑point TOF, IR, tactile) feeding one unified model. Integrating multi-rate, multi-modal streams (vision + touch + range + proprioception) into a single policy is nontrivial and atypical for deployed products.

Risk Factors
Overclaiminghigh severity
No Clear Moatmedium severity
Feature, Not Productmedium severity
Undifferentiatedmedium severity
What This Changes

If X Square achieves its technical roadmap, it could become foundational infrastructure for the next generation of AI applications. Success here would accelerate the timeline for downstream companies to build reliable, production-grade AI products. Failure or pivot would signal continued fragmentation in the AI tooling landscape.

Source Evidence(14 quotes)
“自变量机器人聚焦自研“通用具身智能大模型”, 构建具备精细操作能力的通用机器人, 是国内最早采用完全端到端路径实现通用具身智能大模型的公司之一。”
“自研的「GreatWall」操作大模型系列的WALL-A,具备自主感知、决策与高精度操作能力”
“围绕自研的 WALL-A 操作大模型, 实现从感知、理解到动作控制的全流程智能统一。”
“端到端架构 百亿级参数 跨任务跨场景泛化”
“端到端架构的具身基础模型 自主感知 - 决策 - 行动 支持真实物理世界跨模态推理”
“端到端具身模型驱动”