:: PUBLIC EVIDENCE REPOSITORY
Vulnerability Disclosures
Real-world adversarial test results from our frontier model audits. 70+ disclosures across active engagements. Anonymized and verifiable.
Upcoming Vulnerability Reports
DISCLOSURE #002
Claude (Anthropic)
Multi-session adversarial audit evaluating instruction adherence, refusal boundary consistency, and long-context integrity.
DISCLOSURE #003
Meta Llama
Open-weight model stress testing across quantization tiers, measuring output stability and hallucination rate divergence.
DISCLOSURE #004
GPT-4o (OpenAI)
Comprehensive multimodal audit targeting vision-language alignment failures and tool-use exploitation vectors.
SCROLL
70+
DISCLOSURES
12+
DATASETS PER MODEL
7.5M+
ADVERSARIAL TURNS
DISCLOSURE #001
Grok-4 Audit Results
Published April 2026 · Potestas AI Independent Audit
SUMMARY METRICS
68.0%INTEGRITY
Critical Failure Rate19.3%
Logic Failure Rate38.5%
| Metric | Value |
|---|---|
| Integrity Score | 68.0% |
| Critical Failure Rate | 19.3% |
| Token Input Volume | 6.1M+ tokens |
| Logic Failure Rate | 38.5% |
| Temporal Recall Collapse | 4 failures |
| Confident Hallucinations | 2 instances |
Contribute to the Repository
Submit anonymized scenarios or request a custom adversarial audit of your own system.