Redacta — Pseudonymise medical text before AI · PharmaTools.AI

Privacy · OpenClaw Skill

Pseudonymise patient data before AI ever sees it.

Redacta replaces names, NHS numbers, dates of birth and more with labelled tokens — so clinical text can be safely processed by AI, with the meaning intact.

800+ installs · v1.0.0 · MIT-0 license · by PharmaTools.AI
$openclaw skills install redacta

Two layers of detection

Patterns catch the structured identifiers. Reasoning catches the ones that don't follow a pattern — the names, addresses and ages that regex alone reliably misses.

LAYER 01 · PATTERNS

Deterministic matching

Fixed-format identifiers are matched exactly, every time — including a Modulus 11 checksum to confirm real NHS numbers.

  • NHS numbers (checksum-validated)
  • Dates of birth
  • UK postcodes & phone numbers
  • Email & hospital / MRN numbers
  • National Insurance numbers
LAYER 02 · REASONING

Agent judgement

For everything a pattern can't pin down, Redacta reads context — and tells a patient apart from the clinician treating them.

  • Patient names (not clinician names)
  • Postal addresses
  • Identifying ages
  • Errs toward redacting when unsure
  • Returns a full redaction report

Identifiers become labelled tokens

[PATIENT_NAME] [DATE_OF_BIRTH] [AGE] [NHS_NUMBER] [POSTCODE] [PHONE] [EMAIL] [MRN] [NI_NUMBER] [ADDRESS]

The clinical meaning stays. The patient behind it doesn't travel with the text.

Anyone putting medical text into AI

If you've ever pasted a letter, note or report into a chatbot to summarise or rewrite it, Redacta is the step that should come first.

Clinicians & care teams

Summarise or rewrite a letter with AI without exposing the patient behind it.

Researchers & medical writers

Work with real case text in AI tools while keeping identifiers out of the prompt.

Builders & AI agents

Drop a pseudonymisation step into any agent workflow that touches clinical text.

Install in one line

Redacta is free and open source under the MIT-0 license. Install it from ClawHub and run it on any clinical text.

01

Install the skill

Add Redacta from ClawHub with a single command.

02

Point it at your text

A letter, note, discharge summary or report — paste it in.

03

Get clean output

A pseudonymised document plus a report of every identifier replaced.

$openclaw skills install redacta

An honest note on limits. Redacta is a strong first line of defence, not a guarantee. It won't catch every possible identifier and isn't a substitute for formal data-protection processes. Always review the redaction report before sharing text.

Need this at scale?

Redacta runs as a skill today. If you want pseudonymisation as an API, an on-prem deployment, or built into your own clinical workflow, let's talk.

Discuss Integration