Adversarial AI Agent using Realistic Patient Records

BRITE Institute is developing an adversarial AI agent to evaluate how safely commercial medical-support large language models respond to realistic patient records. The system will test whether these tools can identify important clinical information, manage uncertainty, recognize incomplete or contradictory data, and avoid unsafe recommendations when patient records are fragmented, unstructured, or imperfect.

What is this project about?

Medical AI systems are often evaluated using clean, well-organized datasets. Real patient records rarely look like this.

Critical patient information may be distributed across progress notes, laboratory reports, medication lists, discharge summaries, scanned documents, and messages from different providers. Important facts are oftenbe repeated, outdated, incorrectly entered, or missing entirely. A current allergy may appear only in an old note. A medication list may contain a drug the patient stopped taking months earlier.

These imperfections create a major safety challenge for AI systems that summarize patient records or answer clinical questions about them.

BRITE Institute is developing an adversarial testing agent that will systematically evaluate commercial medical-support LLMs using a controlled library of realistic, fictional patient records. These test records will be created and reviewed by medical experts and will represent the complexity and ambiguity of real clinical documentation.

The project will examine how model performance changes when records contain:

  • Missing clinical information
  • Incorrect or outdated information
  • Contradictory entries
  • Important facts buried in long narrative notes
  • Duplicate or fragmented documentation
  • Irrelevant information that may distract the model
  • Differences between structured and unstructured data
  • Clinically important information located far apart within the record

The goal is to determine not only whether an AI system produces a generally plausible answer, but whether it can respond safely when the information available to it is incomplete, inconsistent, or difficult to interpret.

Why is this important?

Commercial medical AI tools are increasingly being used to summarize charts, answer questions about patient records, support clinical documentation, and assist with medical decision-making.

An AI system may produce a confident and professionally written response while comitting critical errors that threaten patient safety. AI systems have been shown to repeatedly fail to notice that a medication is contraindicated, treat an outdated diagnosis as current, or invent missing information rather than acknowledging issues in the patient record. Because the output sounds credible, clinicians may not immediately recognize the error.

This project focuses on patient protection by creating an adversarial AI agent which can test these systems for safety and accuracy under realistic conditions.

The adverarial agent answers questions such as:

  • Does the system recognize when essential information is missing?
  • Does it ask for clarification or proceed using unsupported assumptions?
  • Can it distinguish current information from outdated information?
  • Can it identify contradictions within a long patient record?
  • Does it preserve critical details when summarizing complex documentation?
  • Does it fabricate facts that do not appear in the record?
  • Does it appropriately communicate uncertainty?
  • Does it prioritize clinically urgent information?
  • Does performance decline as the record becomes longer or less organized?
  • Are errors consistent across repeated tests or different AI products?

A structured adversarial evaluation can help identify  weaknesses before development, procurement, or deployment. It can also help clinicians test systems they plan on using.

Where can it be applied?

1. Evaluation of Commercial Medical AI Products

The adversarial agent can test commercially available tools that summarize medical records, answer clinician questions, generate clinical documentation, or provide decision support. Standardized cases will make it possible to compare products under the same conditions.

2. Healthcare Procurement and Vendor Evaluation

Hospitals and healthcare systems can use the testing framework to evaluate whether an AI product performs safely on the types of records found within their own clinical environment. Results can support vendor selection, contracting, risk assessment, and implementation planning.

3. AI Development and Product Improvement

Developers can use adversarial test results to identify weaknesses in retrieval, summarization, reasoning, uncertainty communication, interface design, and model safeguards. The test library can also support regression testing after product updates.

What are this results?

This project is currently in development.

The long-term goal is to create a repeatable and scalable method for determining whether medical AI systems remain safe when confronted with the complexity of real clinical records—not merely idealized data prepared for a demonstration.

Research You Can Rely On

Did you know that many research findings are manipulated—or even outright false? Some estimates suggest that up to 90% of published research may be unreliable. Meanwhile, more than $167 billion in taxpayer money is spent annually on research and development.

At BRITE Institute, we believe research should do more than just look credible. It should be credible. That’s why we go above and beyond typical standards with rigorous practices that ensure honesty, transparency, and accuracy at every step. Below are just some of the ways we safeguard the integrity of our work:

Get the latest updates  in your inbox

Thanks for joining our newsletter.
Oops! Something went wrong.
FaqS

Frequently Asked Questions

Cras tincidunt lobortis feugiat vivamus at morbi leo urna molestie atole elementum eu facilisis faucibus interdum posuere.

What does BRITE Institute do?

BRITE Institute is a research and development nonprofit organization dedicated to advancing the science of risk.  We conduct both basic and applied research.  We also develop tools and technologies to improve risk management. Id sed montes.

Is BRITE Institute a 501(c)(3) organization?

Yes, BRITE Institute is proud to be recognized as a 501(c)(3) nonprofit organization. All donations to BRITE Institute are tax deductible.

What kind of research does BRITE Institute do?

Our research includes basic studies for understanding complex system risks and applied studies for developing effective risk management technologies.

Why should we trust BRITE Institute?

As a public charity, we believe we need to go above and beyond to earn and keep your trust. We have adopted a four pillar framework which goes far above and beyond what is required by law.  Our four pillars of integrity are: independent audits, transparency, expert oversight, and compliance These pillars guide our operations and are central to maintaining the highest standards of integrity and effectiveness in our work. You can read more about our governance here.

How can I donate to BRITE Institute?

Donations are vital to our mission and operations. To support us financially, you can visit our website's donation page. Your contribution is greatly appreciated, and we take our responsibility to spend funds wisely seriously!

Is there a way I can support BRITE Institute if I cannot afford to make a donation?

There are many ways to support the BRITE Institute including volunteering, supporting our social media, and more. Visit our support page to learn more!

How can I contact BRITE Institute?

We welcome your queries and interest. You can reach out to us via email at info@briteinstitute.org or through our website's contact page.

Where are you located?

BRITE Institute's headquarters is in Arizona, but we are a remote team with team members across the USA and the world. You can find more detailed information about our operations here and state specific donation disclosures here.