Q-Prime
Quantum-structured embedding model
The first commercial quantum-structured embedding model. Q-Prime is the foundation layer of the QGI deterministic stack — where polarity, scope, conditions, and cross-rule dependencies are preserved as structure, not collapsed into a similarity score.
Distributed under the QGI Commercial Model License v1.0. No QPU required — runs on NVIDIA CUDA-Q.
What Q-Prime does differently
Every capability below is a property you can test today. Request access on HuggingFace, run your own regulated-domain evaluation, and compare against whichever classical embedding model you use now.
Quantum-structured hypergraph
Enterprise data — guidelines, overlays, contracts, policies, loan files — encoded as a hypergraph that preserves polarity (what holds), scope (where it holds), conditions (when it holds), and cross-rule dependencies. Traditional vector embeddings lose that structure; Q-Prime preserves it.
Commercial model, managed API
Distributed on HuggingFace as a managed API under the QGI Commercial Model License v1.0. Weights are not shipped; access is mediated by the HuggingFace model card so you get predictable latency, provenance, and audit trails from day one.
No QPU required
Runs on NVIDIA CUDA-Q and cuTensorNet on commodity GPUs. The "quantum" in Q-Prime is the representation, not the hardware — so you can adopt the foundation layer of the deterministic stack without waiting for a fault-tolerant quantum computer.
Deterministic inference
The same input produces the same structural encoding every time. Two different calls from two different regions give you the same scope and dependencies — exactly what a regulator expects when they ask the system to replay its reasoning.
Regulatory-grade provenance
Every embedding carries its rule ids, source documents, and condition graph. When an adverse-action letter is challenged, the chain from input sentence to output structure survives examination without an expert-witness reconstruction.
Drop-in for retrieval pipelines
Q-Prime can slot into existing retrieval-centric pipelines as the encoding layer. You still get your vector store API — you just get structured output instead of a flat similarity score.
Paired with the QAG Engine
Q-Prime is the foundation that the QAG Engine reasons over. Together they form the bottom two layers of the deterministic stack. Using Q-Prime alone is valuable; pairing it with QAG unlocks the seven HSC signals that make contradictions visible before generation.
Open client SDKs
Client libraries, integration examples, and reference retrievers are published on the Quantum-General-Intelligence GitHub org. Internal tooling, benchmarks, and red-team evaluations are also available under pilot engagement.
Trained on regulated-domain corpora
Pre-trained on compliance, contract, regulatory, and lending corpora — not just open web. The encoding is calibrated for the vocabulary and structure of regulated decisions, which is exactly the gap probabilistic embeddings struggle with.
Q-Prime is the foundation. The QAG Engine is the reasoning.
Q-Prime encodes structure. The QAG Engine consumes that structure through the Hilbert-Space Compacting layer, which projects high-dimensional states into seven interpretable signals: Relevance, Conflict, Overlap, Redundancy, Coverage, Coherence, and Topology.
Next layer
QAG Engine →
The reasoning platform that runs on top of Q-Prime. Contradictions surface as explicit signals before generation. Public preview.
Concrete example
Documents QAG →
A concrete blueprint that takes NVIDIA's RAG reference implementation and remakes it as deterministic QAG, powered by Q-Prime.
Questions developers ask
How do I get access to Q-Prime?
Q-Prime is available today on HuggingFace at QGI-dev/q-prime as a managed API under the QGI Commercial Model License v1.0. Access is mediated by the HuggingFace model card — request access and we will follow up with terms.
Can I self-host the weights?
Not today. Q-Prime is distributed as a managed API so that provenance, audit, and license compliance remain intact. Self-hosting is available under enterprise engagement — contact us for terms.
Does Q-Prime work without the QAG Engine?
Yes. Q-Prime can be used as the encoding layer of your existing retrieval pipeline; you get structured output instead of flat similarity. You unlock the full deterministic contract by pairing it with the QAG Engine, which exposes the seven HSC signals.
What hardware do I need to run Q-Prime?
NVIDIA GPUs with CUDA-Q / cuTensorNet support. No quantum hardware required. For most evaluation workloads, a single H100 is sufficient. Production deployments scale horizontally.
How Q-Prime came to market
Q-Prime was announced publicly on April 21, 2026 and shipped as a managed API on HuggingFace. Independent coverage from AiThority (Quantum Algorithm Engine for production AI) and Quantum Zeitgeist (the quantum-structured hypergraph) is linked from the launch note.
Read the launch announcement →Ready to encode your regulated corpus as structure?
Request Q-Prime access on HuggingFace, or talk to us about a pilot that pairs Q-Prime with the QAG Engine inside your workflow.