Model family
DMind has released seven open-source models across three generations, plus the DMind Benchmark dataset. All weights are public on Hugging Face under permissive licenses.
DMind-3 (2026, current generation)
Three models that work together as the Edge-Local-Cloud stack. See Sovereign architecture for how they fit together.
DMind-3-Nano (270M)
A 270-million-parameter model that runs in browser extensions, wallet apps, and on mobile. It does deterministic safety checks at signing time, parsing calldata, identifying unlimited approvals, and flagging suspicious patterns. Fully on-device, no network required. Built around a standardized function-calling protocol.
DMind-3-Mini (4B)
A 4-billion-parameter model that runs on a user's local machine, including consumer GPUs and recent Apple silicon. It handles private strategy reasoning and deep research using the user's portfolio as context. Trained with C³-SFT (see Training methods) so it produces an answer, critiques itself, and revises before finalizing.
DMind-3 (21B)
The 21-billion-parameter cloud model. Built on OpenAI's gpt-oss-20b base, with a custom Transformer and Multi-Scale RoPE position encoding for a 256k-token context window. Native BF16/FP16 precision. Trained on 500,000+ curated documents and multi-terabyte on-chain data. Runs in the cloud or a private VPC, not on user devices.
DMind-2 series (2025)
DMind-2 was the transitional generation, focused on tool-calling and crypto investment analysis. Both models are still usable but are no longer the primary recommendation, since DMind-3 supersedes them.
DMind-2-107B
A 107-billion-parameter flagship for crypto investment analysis with tool-calling support. The model can directly call on-chain data APIs, exchange APIs, and market data services.
DMind-2-4B
A 4-billion-parameter lightweight version of the same family. Designed for local deployment with the same investment-analysis specialization in a smaller footprint.
DMind-1 series (2025)
The first generation, and the first publicly released Web3-native LLM with open weights. Built on Alibaba's Qwen3 base models, fine-tuned on 13,276 expert-curated Web3 knowledge items distilled from 32.7GB of source documents.
DMind-1 (32B)
The original Web3-native LLM. Built on Qwen3-32B. Trained in two stages: supervised fine-tuning with LoRA on the curated dataset, then RLHF with PPO using a Web3-specific reward model. On Web3 tasks it matches the performance of much larger general-purpose models at 10–30% of the token cost.
DMind-1-mini (14B)
A distilled version built on Qwen3-14B, using both DMind-1 and a general SOTA model as dual teachers. The distillation works at three levels. The student learns the teacher's final outputs, the teacher's full probability distribution over each token, and the teacher's intermediate-layer representations. This is the most-downloaded model in the family, because most agent applications need a small, fast model rather than a flagship.
At a glance
DMind-3-Nano
270M
Browser / wallet / mobile
Transaction safety checks
DMind-3-Mini
4B
User device
Private strategy and research
DMind-3
21B
Cloud / private VPC
Market-wide research
DMind-2-107B
107B
Cloud
Investment analysis with tool calls
DMind-2-4B
4B
Local
Lightweight investment analysis
DMind-1
32B
Cloud
First Web3-native LLM
DMind-1-mini
14B
Cloud / local
Distilled Web3 agent model
All seven models, plus the benchmark dataset, are available at huggingface.co/DMindAI.
Last updated
Was this helpful?

