Veridical Tech

// Research Docs

GTU Overview

Public-safe summary of what GTU is for, what failure modes it is evaluated against, and how this site frames proof.

What GTU is

Project GTU is presented here as a context optimization program for fact-sensitive long-context work. The public claim is not that GTU wins every possible quality comparison. The public claim is that it can be evaluated under bounded prompt budgets against specific factual retention tasks.

What GTU is tested against

The public test surface focuses on conditions that matter in practice: early facts buried under noise, similar-fact confusion, environment separation, configuration lookup, long debug context, and long-range planning constraints.

  • Does the system preserve the right fact under long noisy history?

  • Does it keep similar facts separated instead of contaminating answers?

  • Does it remain useful when history size becomes operationally extreme?

  • How does quality move as prompt budget changes across operating points?

Public proof standard

This site uses a black-box standard. The public discussion stays on benchmark setup, datasets, operating points, and results. Internal implementation detail is intentionally out of scope.

Primary benchmark

125 balanced black-box cases

Ultra-long validation

4 cases above 0.8M baseline tokens

Compared surfaces

full history, sliding window, GTU operating points