Live Foundation · Encoding

Q-Prime

Quantum-structured embedding model

The first commercial quantum-structured embedding model. Q-Prime is the foundation layer of the QGI deterministic stack — where polarity, scope, conditions, and cross-rule dependencies are preserved as structure, not collapsed into a similarity score.

HuggingFace model card GitHub repository Talk to an engineer

Distributed under the QGI Commercial Model License v1.0. No QPU required — runs on NVIDIA CUDA-Q.

Capabilities

What Q-Prime does differently

Every capability below is a property you can test today. Request access on HuggingFace, run your own regulated-domain evaluation, and compare against whichever classical embedding model you use now.

Architecture

Quantum-structured hypergraph

Enterprise data — guidelines, overlays, contracts, policies, loan files — encoded as a hypergraph that preserves polarity (what holds), scope (where it holds), conditions (when it holds), and cross-rule dependencies. Traditional vector embeddings lose that structure; Q-Prime preserves it.

Distribution

Commercial model, managed API

Distributed on HuggingFace as a managed API under the QGI Commercial Model License v1.0. Weights are not shipped; access is mediated by the HuggingFace model card so you get predictable latency, provenance, and audit trails from day one.

Runtime

No QPU required

Runs on NVIDIA CUDA-Q and cuTensorNet on commodity GPUs. The "quantum" in Q-Prime is the representation, not the hardware — so you can adopt the foundation layer of the deterministic stack without waiting for a fault-tolerant quantum computer.

Property

Deterministic inference

The same input produces the same structural encoding every time. Two different calls from two different regions give you the same scope and dependencies — exactly what a regulator expects when they ask the system to replay its reasoning.

Audit

Regulatory-grade provenance

Every embedding carries its rule ids, source documents, and condition graph. When an adverse-action letter is challenged, the chain from input sentence to output structure survives examination without an expert-witness reconstruction.

Integration

Drop-in for retrieval pipelines

Q-Prime can slot into existing retrieval-centric pipelines as the encoding layer. You still get your vector store API — you just get structured output instead of a flat similarity score.

Pairing

Paired with the QAG Engine

Q-Prime is the foundation that the QAG Engine reasons over. Together they form the bottom two layers of the deterministic stack. Using Q-Prime alone is valuable; pairing it with QAG unlocks the seven HSC signals that make contradictions visible before generation.

Developer

Open client SDKs

Client libraries, integration examples, and reference retrievers are published on the Quantum-General-Intelligence GitHub org. Internal tooling, benchmarks, and red-team evaluations are also available under pilot engagement.

Training

Trained on regulated-domain corpora

Pre-trained on compliance, contract, regulatory, and lending corpora — not just open web. The encoding is calibrated for the vocabulary and structure of regulated decisions, which is exactly the gap probabilistic embeddings struggle with.

Pair with the QAG Engine

Q-Prime is the foundation. The QAG Engine is the reasoning.

Q-Prime encodes structure. The QAG Engine consumes that structure through the Hilbert-Space Compacting layer, which projects high-dimensional states into seven interpretable signals: Relevance, Conflict, Overlap, Redundancy, Coverage, Coherence, and Topology.

Next layer

QAG Engine →

The reasoning platform that runs on top of Q-Prime. Contradictions surface as explicit signals before generation. Public preview.

Concrete example

Documents QAG →

A concrete blueprint that takes NVIDIA's RAG reference implementation and remakes it as deterministic QAG, powered by Q-Prime.

Q-Prime FAQ

Questions developers ask

How do I get access to Q-Prime?

Q-Prime is available today on HuggingFace at QGI-dev/q-prime as a managed API under the QGI Commercial Model License v1.0. Access is mediated by the HuggingFace model card — request access and we will follow up with terms.

Can I self-host the weights?

Not today. Q-Prime is distributed as a managed API so that provenance, audit, and license compliance remain intact. Self-hosting is available under enterprise engagement — contact us for terms.

Does Q-Prime work without the QAG Engine?

Yes. Q-Prime can be used as the encoding layer of your existing retrieval pipeline; you get structured output instead of flat similarity. You unlock the full deterministic contract by pairing it with the QAG Engine, which exposes the seven HSC signals.

What hardware do I need to run Q-Prime?

NVIDIA GPUs with CUDA-Q / cuTensorNet support. No quantum hardware required. For most evaluation workloads, a single H100 is sufficient. Production deployments scale horizontally.

Launch record

How Q-Prime came to market

Q-Prime was announced publicly on April 21, 2026 and shipped as a managed API on HuggingFace. Independent coverage from AiThority (Quantum Algorithm Engine for production AI) and Quantum Zeitgeist (the quantum-structured hypergraph) is linked from the launch note.

Read the launch announcement →

Ready to encode your regulated corpus as structure?

Request Q-Prime access on HuggingFace, or talk to us about a pilot that pairs Q-Prime with the QAG Engine inside your workflow.

HuggingFace model card → Explore a pilot

Partner with QGI