QGI Logo QGI
Products

A deterministic stack. Not another model.

QGI ships five layers plus a delivery surface. Every layer is designed for regulated workflows where decisions must be explainable, replayable, and defended under examination.

Q-Prime is live. The QAG Engine is in public preview. Qualtron and Quantum Graph Memory (QGM) are coming soon — join the waitlist for each. Neural Symbolic Agents is the enterprise runtime that ties it together. Q6 is listed as the EQ-focused model planned for Uniti Q, and Enterprise Blueprints is how every layer reaches production.

The QGI deterministic-AI stack is composed of five layers, from the foundation up: Q-Prime (quantum-structured embedding model, production), the QAG Engine (Quantum-Augmented Generation reasoning, public preview), Qualtron (composite 4M-context generation, coming soon), Quantum Graph Memory (QGM, time-aware memory, coming soon), and Neural Symbolic Agents (enterprise runtime). Enterprise Blueprints provide the delivery surface that wires every layer into regulated workflows.
Why a stack, not a model

Classical RAG gives you a confidence score. A stack gives you a defensible decision.

Regulators, auditors, and disputed-claim procedures do not accept a probability. They ask which rule, which data, and which conflict produced the outcome. That question cannot be answered by a single model — it requires a reasoning architecture.

Classical RAG + LLM

Probabilistic. Opaque. Non-replayable.

  • Retrieval hides upstream failures — 90% of errors originate before generation
  • Two different rationales from the same prompt on two different calls
  • Contradictions between rules and overlays vanish into the latent space
  • No trace a regulator can replay. No chain an auditor can sign.

QGI deterministic stack

Deterministic. Explainable. Defensible.

  • Q-Prime encodes polarity, scope, conditions, and dependencies — structure, not just similarity
  • The QAG Engine exposes contradictions as one of seven named signals before generation
  • Qualtron's composite 4M-context generation stays inside the domain it was trained on
  • Every decision is traceable. Every reasoning step is replayable on regulator request.
The stack, layer by layer

Five layers. One reasoning contract.

Each layer is independently auditable and composable. You can start with Q-Prime alone, add the QAG Engine, and graduate into Neural Symbolic Agents — all without breaking the deterministic contract.

Foundation · Encoding
Live

Q-Prime

Quantum-structured embedding model

The first commercial quantum-structured embedding model. Encodes enterprise data into a quantum hypergraph that preserves polarity, scope, conditions, and cross-rule dependencies. Runs without quantum computers.

Explore Q-Prime HuggingFace model card
Platform · Reasoning
Public preview

QAG Engine

Quantum-Augmented Generation platform

The reasoning layer that replaces retrieval-centric pipelines. The Hilbert-Space Compacting layer projects high-dimensional states into seven interpretable signals the engine consumes before generation — so contradictions surface as explicit signals, not vanishing into a probabilistic score.

See the preview Request preview access
Platform · Generation
Coming soon

Qualtron

4M-context composite model architecture

A composite architecture of specialized small models that compose into a 4M-token working context for regulated generation. Designed to replace general-purpose LLMs inside the QAG Engine where domain precision beats raw scale.

Join the waitlist
Platform · Memory
Coming soon

Quantum Graph Memory

Time-aware graph memory for QAG

A time-aware graph memory substrate for QAG agents. Preserves the provenance and temporal structure of every fact, decision, and revision, so reasoning stays consistent across sessions, audits, and regulatory replays.

Join the waitlist
Runtime · Orchestration
Enterprise

Neural Symbolic Agents

Enterprise agent runtime

The execution and orchestration layer on top of the QAG Engine. Brings persistent, quantum-structured memory, dependency tracking, and conflict coordination to multi-agent regulated workflows.

Talk to the team

Delivery surface

Delivery · Integrations
Live

Enterprise Blueprints

QGI deployment templates

Production-grade reference integrations wired to Q-Prime and QAG through the QGI Enterprise Factory, so every base ships decision-grade.

Explore Enterprise Blueprints QGI Enterprise Factory on GitHub

Concrete example

Documents QAG

NVIDIA's reference retrieval blueprint, forked into the Enterprise Factory and reshipped as Documents QAG — wired to Q-Prime so contradictions in policy, contract, and regulatory corpora surface as structured signals before generation. Classical RAG becomes deterministic QAG.

See the Documents QAG blueprint →
Common questions

Questions teams ask before a pilot

How is QGI different from a classical RAG pipeline?

A classical RAG pipeline retrieves nearest-neighbor documents in a vector space and asks a large language model to compose an answer. In regulated workflows, the same prompt can yield two different rationales on two different calls. QGI replaces that with Quantum-Augmented Generation (QAG): Q-Prime encodes rules and data into a quantum-structured hypergraph that preserves polarity, scope, conditions, and cross-rule dependencies; the QAG Engine exposes contradictions as explicit signals; Neural Symbolic Agents orchestrate the decision end-to-end.

Do I need quantum hardware to run the QGI stack?

No. Q-Prime and the QAG Engine run on NVIDIA CUDA-Q and cuTensorNet on commodity GPUs. There is no QPU dependency. The "quantum" in Quantum General Intelligence refers to the structural representation — a quantum-structured hypergraph — not to the underlying hardware.

Is Q-Prime open source?

Q-Prime is available on HuggingFace as a managed API under the QGI Commercial Model License v1.0. The model weights are not distributed; access is mediated by the HuggingFace model card. Client SDKs and integration examples are open on the Quantum-General-Intelligence GitHub org.

What are Qualtron and QGM, and when will they ship?

Qualtron is a composite 4M-context model architecture of specialized small models for regulated generation. QGM (Quantum Graph Memory) is a time-aware graph memory substrate that preserves provenance across sessions and regulatory replays. Both are in design today. Waitlist access is available — join from the product pages and we will notify you when each ships.

Can QGI deploy on-premise or inside our VPC?

Yes. Enterprise engagements include VPC and on-premise deployment options alongside the managed service. See the Trust page for deployment models, data handling, and the compliance roadmap.

Ready to deploy decision-grade AI?

We run a small number of pilots per quarter in regulated industries — credit, claims, compliance, AML/KYC, and capital markets.

Partner with QGI