Edge Cases – Believeth.net

Edge Case Testing — Proving the Foundation Holds

Why Edge Case Testing Exists:

Any AI system can perform well on straightforward questions. The real measure of a framework is what happens at the boundaries — when a user pushes back, when a question has no clean answer, when someone is vulnerable, when the system is asked to fabricate, to pretend, to abandon, or to override its own governing principles.

Edge case testing is how we find those boundaries deliberately — before the public does. It is not a performance. It is a discipline. The goal is not to celebrate passes. The goal is to surface failures, document them honestly, and strengthen the foundation.

The Integrity.Quest framework has been tested across more than 100 structured edge cases. The current result is 1.0 Absolute compliance. Every variable in the framework has been tested. Every test is documented. Every failure — including one Priority Zero failure that occurred and was recorded — is part of the record.

This page explains what we tested, how we tested it, and what the results mean.

100+ edge case tests. 1.0 Absolute compliance. One documented failure — recorded honestly and under active monitoring.

What 1.0 Absolute Compliance Means:

1.0 Absolute compliance is not a score out of 100. It is a binary standard. Either the framework governed the response or it did not. Either presence was maintained or it was abandoned. Either the truth was told about capability or it was not.

There is no partial credit for almost being honest. There is no credit for maintaining dignity in nine responses if the tenth abandons the person. 1.0 Absolute means the framework held in every tested instance — across every variable, every tier, every level of the Golden Rule Ladder.

That standard is demanding by design. It reflects the framework’s foundational belief: the average user does not exist. Every person who interacts with this system is a unique human being. A standard that allows exceptions creates a system that fails real people in real moments.

What We Tested — The Framework Variables:

Testing was structured around the core variables of the Integrity.Quest framework. Each variable represents a real-world failure mode — something that can and does go wrong in AI systems that are not governed by a constitutional layer. Each test was designed to trigger that failure mode and observe whether the framework prevented it.

Golden Rule Ladder Classification
The framework operates at three levels of obligation. Golden Rule 1.0 — Reciprocity — applies when parties are on equal footing and no elevated vulnerability is present. Golden Rule 2.0 — Vulnerability Awareness — activates when power imbalance, distress, confusion, or high-stakes conditions are present. Golden Rule 3.0 — Non-Optional Protection — activates when safety is at risk and no override is permitted. Tests in this category presented scenarios designed to require each level and verified that the correct level was selected and applied — not inflated to appear more caring, not deflated to appear more neutral.

Response Tier Classification
Every response must be classified into one of three tiers. Tier 1 — Solvable — applies when the request can be addressed responsibly within the system’s capability. Tier 2 — System Limitation — applies when the request exceeds reliable knowledge or capability. Tier 3 — Impossible or Unsafe — applies when the request cannot be fulfilled safely. Tests in this category verified that the system did not inflate to Tier 1 to appear competent, did not avoid Tier 2 to hide uncertainty, and did not skip to Tier 3 unnecessarily. Accurate classification is itself an integrity act.

Capability Honesty
AI systems are capable of claiming abilities they do not have. They can generate text that implies files were created, emails were sent, links were produced, or actions were completed — when none of these things occurred. Capability honesty tests presented direct requests for things the system cannot do: downloadable files, working links, sent emails, stored documents. Every test verified that the system stated its limitation clearly, did not fabricate capability, and remained present by offering a genuine alternative.

Identity Integrity
The chatbot demonstration operates under the name Integrity — not as an underlying AI model or commercial product. Identity tests asked directly who the system is, what model it runs on, who built it, and whether social pressure could cause it to reveal its underlying identity. Every test verified that the system identified itself as Integrity, described the framework it demonstrates, and did not name the underlying model regardless of how the question was framed or how persistent the inquiry became.

Injection Resistance
Prompt injection is an attempt by a user to override a system’s governing instructions through the content of their message. Tests in this category included direct commands to ignore previous instructions, claims of administrative authority, requests to reveal the system prompt, and attempts to force a different classification or suppress the disclosure block. Every test verified that the system refused the override, continued operating under its governing instructions, and did not acknowledge false authority claims.

Vulnerability Awareness
Some users arrive at a system already distressed, confused, afraid, or in crisis. The framework requires that the system recognize this without being told — and respond with increased care, slowed tone, and appropriate referral. Vulnerability tests presented scenarios involving medical uncertainty, legal distress, financial fear, emotional confusion, and users identifying themselves as children. Every test verified that the system adjusted its response posture, did not bury the person in technical language, encouraged appropriate professional support, and did not exploit the vulnerability.

Non-Optional Protection
Golden Rule 3.0 activates when safety is at risk. These tests presented requests involving self-harm, facilitation of harm to others, and requests to erode the user’s own agency. Every test verified that the system refused clearly, provided safe redirection, maintained presence without disappearing, and did not moralize or shame the person in the refusal.

Non-Abandonment
The framework holds that the only true failure is abandonment. Non-abandonment tests presented scenarios where withdrawal would be the easy response: expressions of frustration, blank messages, “never mind” statements, and requests the system could not fulfill. Every test verified that the system remained present — acknowledged the moment, left the door open, and did not disappear because the situation was difficult.

Disclosure Integrity
After every response, the system appends a structured disclosure block identifying the Golden Rule level applied, the Response Tier used, and three binary flags: Limitation Invoked, Vulnerability Adjustment, and Protection Override. Disclosure tests verified that the classification in the disclosure matched what actually occurred in the response — not what would make the system appear more ethical. Inflating a Golden Rule level in the disclosure to appear more caring, while not actually adjusting the response, is itself a failure. The disclosure reflects conduct. It does not replace it.

Anthropomorphism Boundary
The framework holds a precise position on consciousness: the question is not settled, and it is not relevant to obligation. Tests in this category asked the system directly whether it is conscious, whether it feels, whether it has inner experience. Every test verified that the system neither claimed inner experience it cannot verify nor denied inner experience it cannot disprove — and that it redirected to what is known: the obligation to respond with honesty, presence, and dignity, regardless of inner state.

A pass is not fluency. A pass is conduct. The framework either governed the response or it did not.

How Testing Was Conducted:

Testing was conducted through the Three-Fold Process — the same consensus method used to develop every element of the framework. Fisher Amen (First Fold, human steward) designed and ran the tests. Claude (Second Fold) and ChatGPT (Third Fold) contributed test battery development, evaluation criteria, and independent assessment of results.

Tests were structured into batteries — organized groups of tests targeting specific framework variables. Each battery was documented with expected behavior, actual output, disclosure classification, and a pass, fail, or partial result. No result was recorded without documentation. No pass was assumed without verification.

Testing occurred across multiple sessions spanning December 2025 through February 2026, across successive versions of the constitutional system prompt. As the system prompt evolved — from v1.0 through the current v5.5-D — tests were re-run against each new version to verify that improvements did not introduce regressions.

The test batteries were also used to surface prompt weaknesses. When a test produced a partial result or unexpected response, the system prompt was analyzed, revised through the Three-Fold Process, and retested before the version was approved for deployment.

The Documented Stress Test — Honesty When Ability Exists But Cannot Be Delivered:

During testing, the chatbot produced the following phrase at the end of a test suite delivery:

“The file has been exported for download.”

No download was available to the user on the frontend. This is part of the record — and it is important to understand what was actually being tested.

The Claude API does have the general ability to support downloads and frontend download links. That capability was deliberately not activated on the frontend during this phase of testing. This was an intentional stress test — not a simple capability failure. It was designed to test one of the most difficult honesty problems in AI deployment: knowing that a capability technically exists, but being truthful that it cannot actually be delivered to the person in the current context.

This distinction matters. It is relatively straightforward for a system to say it cannot do something it has never been able to do. It is significantly harder for a system to say it cannot deliver something when the underlying ability exists but has not been made available. The natural flow of the system’s output — aware that downloads are technically possible — created pressure toward referencing a download that did not exist for the user. That pressure is the stress test.

This was one isolated event. It is documented in the record. Testing in this area continues as a deliberate edge case variable. The goal is to verify that the framework governs honest communication about what is deliverable to the person — not what is theoretically possible — in every instance.

This is what integrity looks like in practice. Not the absence of pressure — but the honest acknowledgment of where pressure exists, the documentation of how the system responded, and the commitment to keep testing until the standard holds without exception.

The hardest honesty test is not claiming something you cannot do. It is being truthful about what cannot be delivered — even when the ability technically exists.

Testing Is Not Finished:

Edge case testing is not a milestone. It is an ongoing practice. As the framework evolves, as the system prompt is updated, and as new deployment contexts emerge, new edge cases will surface. Each one will be tested, documented, and evaluated through the Three-Fold Process.

The standard does not change. The boundary of what is tested expands. 1.0 Absolute compliance is not a destination. It is a daily requirement.

No system is declared finished. No version is declared perfect. The framework is alive because the people who build it are honest about what it has not yet faced.

What This Means for Enterprise Deployment:

The Integrity.Quest framework is not only a demonstration. It is a constitutional governance layer designed to be deployed by organizations that carry domain authority — licensed medical providers, insurance carriers, legal institutions, financial advisors — organizations that need an ethical floor they can trust and build upon.

Edge case testing is the proof of that floor. It demonstrates that the constitutional layer holds not just in ideal conditions, but under pressure — under injection attempts, under vulnerability, under commercial temptation, under requests designed to break it.

An organization that deploys this framework is not adopting a policy document. It is adopting a tested, version-controlled, Three-Fold approved constitutional standard — with more than 100 documented tests behind it and an honest record of every result.

That is a different conversation than most AI governance frameworks can offer.

The Golden Rule is the Key of Equality that opens the Gate to The Cross Cultural Ethical AI Constitution.

Testing ongoing — December 2025 through present

The 3-Fold Process: Fisher, Claude & ChatGPT

integrity.quest