CaseClerk

Let’s cut to it: lots of firms want AI in intake, but there’s a big hang‑up—can you use ChatGPT if the conversation touches health info? If your intake captures protected health information (PHI), a w...

We’re talking BAAs, what counts as PHI, and how these tools store and use data. That’s the stuff that decides whether you’re safe—or not.

Below, I’ll break down when HIPAA hits law firms, where ChatGPT falls short, how to reduce risk with de‑identification, and what a HIPAA‑aligned intake setup looks like. I’ll also show where a purpose‑built option like LegalSoul fits.

TL;DR — Is ChatGPT HIPAA-compliant for client intake?

Short answer: no, not for intake that includes PHI. If a tool might see PHI, you need a Business Associate Agreement (BAA) and firm control over retention, access, and model training. Consumer ChatGPT doesn’t offer a BAA, so putting PHI there is a HIPAA problem, full stop.

Enforcement isn’t theoretical. In 2016, Raleigh Orthopaedic Clinic paid $750,000 because a vendor handled PHI without a BAA. Different context, same rule. For law firms acting as business associates, the bar is identical. The safer path: either keep PHI out through real de‑identification or use an AI intake platform that will sign a BAA and give you encryption, RBAC, audit logs, and “no training on your data.” Also check your malpractice policy—many carriers now ask about AI vendors and BAAs. No BAA, no PHI.

When HIPAA applies to law firms

HIPAA kicks in when a covered entity or a business associate creates, receives, maintains, or transmits PHI. Plenty of firms qualify as business associates when they serve healthcare clients and see PHI to deliver legal work. HHS/OCR has said plainly: attorneys can be business associates, and a BAA is required if you’ll handle PHI.

Common triggers: healthcare regulatory matters, internal investigations, employment cases with medical accommodations, provider defense. Plaintiff PI and med‑mal matters often involve PHI too; even if you’re not a BA, confidentiality and state privacy rules still apply. Web forms, site chat, e‑sign flows, recorded calls—all of these feed intake. If any of that pipes into a general LLM without a BAA, you’ve created risk. HHS has also warned about tracking tech on health pages. Same idea here: if a vendor sees PHI on your behalf, you need a BAA and Security Rule safeguards.

What counts as PHI in intake

PHI = health info tied to one of 18 identifiers. Names, phone numbers, full‑face photos, addresses, account numbers, IP addresses, device IDs—the list goes on. Intake examples: “Jane Doe hurt her wrist, ER on 10/12 at Mercy,” a voicemail transcript with caller ID and treatment dates, or a medical bill upload with account numbers. Images and screenshots can leak identifiers too (metadata is sneaky).

To de‑identify under HIPAA Safe Harbor, you remove all 18 identifiers, or you get an expert determination. That means more than scrubbing names. Dates (smaller than a year), locations smaller than a state, and unique IDs have to go. HHS has also noted that an IP address can be PHI when tied to a person’s interaction with a covered entity’s site—relevant if you handle provider intake or host BA‑bound portals. One more gotcha: call analytics and “AI note‑taking” tools can capture biometric identifiers (voice prints), which are on the list. Design intake questions to separate issue spotting from identifiers so AI can help without crossing the PHI line.

How ChatGPT handles data and why it matters for HIPAA

HIPAA is about contracts and controls. No BAA means you don’t send PHI, period. Consumer ChatGPT may retain chats for quality and safety, and you don’t get the logging and control HIPAA expects. Even where an API or enterprise flavor says “no training on your data,” you still need encryption, role‑based access control (RBAC), retention/deletion you control, and incident response.

Ask the basics: Will prompts be used to improve models? Who can see logs? Where does data live? Can we set retention to 0–30 days? Can we prove deletion? You need those answers, in writing. Audit logs and RBAC aren’t busywork—they’re how you investigate when someone pastes the wrong thing into the wrong chat. Also set data loss prevention (DLP) and egress rules so PHI can’t leave your network for non‑BAA endpoints, while letting your approved, BAA‑covered AI keep working.

Risk scenarios in law firm intake

Someone drops a PDF of medical records into a public LLM to “summarize.” That’s a PHI disclosure without a BAA.
Your website chatbot forwards messages—names, dates of service, provider details—to an LLM that won’t sign a BAA.
Email‑to‑LLM automations push intake replies through a connector that stores messages outside your controls.
Tracking pixels on intake pages send IP addresses and page paths that imply treatment; OCR has warned about similar disclosures.

OCR cases show how harsh missing BAAs can be. Raleigh Orthopaedic Clinic and Center for Children’s Digestive Health both got hit mainly for the BAA gap. Firms can make the same mistake with plug‑ins and “free trials.” Another quiet risk: drafting demand letters with medical summaries in a public chat. Treat PHI like payment data—either tokenize (de‑identify) before AI sees it, or make sure the entire chain (chatbot to logs) is covered by a BAA and solid Security Rule controls.

De-identification and PHI minimization strategies

Run a two‑stage intake. Stage 1: fast triage, no identifiers collected. Stage 2: once the matter looks viable, switch to a PHI‑capable flow under a BAA. For existing files, run a PHI redaction pass before AI touches anything: strip the 18 identifiers, swap in stable placeholders like [Patient_A] or [DOS_01], and store the mapping table outside the AI context.

Verify the results. De‑identification isn’t “remove the names and call it a day.” Dates under a year, small locations, device IDs—all need attention. Mix pattern‑based redaction (regex for SSNs/MRNs) with ML entity detection for the weird stuff (rare conditions, unique events). Spot‑check outputs. For tougher sets, get internal expert review. Add a few “canary” tokens (e.g., FAKE‑NAME‑ALPHA) to make sure nothing re‑inserts identifiers downstream. Keep AI‑ready data in a de‑identified store and lock PHI behind least‑privilege access. If a prompt or integration goes sideways, the blast radius stays small.

Requirements checklist for a HIPAA-aligned AI intake tool

A signed BAA with clear scope and a transparent subprocessor list
Encryption in transit and at rest, tenant isolation, and optional data residency
“No training on your data,” short, configurable retention (0–30 days), and verifiable deletion
RBAC with SSO/MFA and granular permissions
Detailed audit logs (access, prompts, outputs, admin actions) you can export
PHI detection/redaction guardrails and prompt rules to prevent over‑collection
Defined incident response SLAs and breach notification steps

Tie each control to the HIPAA Security Rule (164.308 administrative, 164.312 technical). Logs and RBAC let you answer “who saw what, when,” which OCR loves to ask. One detail many firms miss: retention needs to match statutes of limitations and legal holds. If the tool can’t pause deletion for a hold, you risk spoliation. If it can’t delete on schedule, you risk over‑retention. Also check that your vendor can segregate PHI by matter and practice group to enforce least‑privilege access.

When limited use of ChatGPT may be acceptable

ChatGPT is fine for non‑PHI work: marketing copy, high‑level research, internal SOPs. Write it down in your policy: no client identifiers, no health facts. For early triage, use “PHI‑free” prompts that ask only broad questions like “What happened?” “Which state?” “Type of matter?” and tell users not to include names, DOBs, provider names, or images. Back that up with DLP that blocks patterns like MRNs or SSNs.

Using ChatGPT with PHI needs a BAA and more controls. A useful trick: prepend an always‑on “PHI firewall” instruction so the model refuses inputs with health identifiers. Log the refusals, then coach the folks who keep trying. When doing research, keep facts and citations, but swap client details for placeholders. Audit usage regularly—pull logs, talk to power users, and verify your AI policy is working in real life.

Governance for AI in your firm

Create an AI acceptable‑use policy that connects directly to HIPAA and confidentiality duties (ABA Model Rules 1.1, 1.6, 5.3). Specify approved tools, prohibited data types, and the rule: any PHI goes only into a BAA‑covered platform. Train everyone on what PHI looks like with real intake examples, plus quick demos on de‑identifying before using AI. Add DLP and outbound filtering so PHI can’t slip to non‑BAA tools.

Do vendor risk reviews yearly. Confirm BAAs, read security reports, check subprocessor lists. Many malpractice carriers now ask about AI governance, and having it documented helps. Supervision matters too: partners should review how AI outputs get used in letters, discovery, and client messages. Give people a fast approval path for new tools so they don’t go rogue, and spin up a small AI review group (IT, risk, one practice lead) to test prompts, hammer on PHI guardrails, and keep a living playbook aligned with HHS/OCR guidance.

Implementation roadmap for compliant AI intake

Map your intake. Track where identifiers enter (web, phone, email, portals) and where they land.
Decide: de‑identify before AI, or process PHI in a BAA‑covered platform. Many firms do both: triage de‑identified, heavy records under a BAA.
Pick a HIPAA‑aligned AI intake tool, sign the BAA, and document retention, training, and access settings.
Configure SSO/MFA, RBAC by practice group, “no training on your data,” short retention, and PHI guardrails.
Pilot with one PI or med‑mal team. Track conversion, time to qualify, and PHI escapes (target: zero), plus user feedback.
Review logs weekly; sample conversations for policy drift and coach quickly.
Roll out firmwide with training, DLP enforcement, and quarterly audits.

For PI intake especially, align AI retention with matter lifecycle and legal holds. Do a pre‑mortem: assume a prompt with PHI leaked. Could you trace it, respond fast, notify properly, and show minimal exposure based on your logs, DLP, and contracts?

How LegalSoul supports HIPAA-aligned intake automation

LegalSoul is built for firms that want AI in intake without HIPAA drama. We sign a BAA and run HIPAA‑aligned processing: encryption in transit and at rest, tenant isolation, and optional regional data residency. Default stance: “no training on your data,” short, configurable retention—even zero—and verifiable deletion. You get RBAC, SSO/MFA, and detailed audit logs that actually answer questions.

Workflows include PHI detection/redaction, prompt guardrails that prevent over‑collection, and a two‑stage intake: quick triage with no identifiers, then a secure PHI capture flow. We integrate with your DMS/CRM under signed DPAs and preserve chain‑of‑custody for file moves. Admin dashboards flag policy issues in real time (like attempted PHI in a non‑PHI prompt) so you can coach fast. Multi‑jurisdiction practice? We support data residency and segment PHI by practice group to enforce least privilege. Think HIPAA‑friendly AI intake for attorneys, with the documentation you’ll want if OCR calls.

FAQs

Do we always need a BAA for AI? Only when PHI is processed. If intake could pair identifiers with health info, treat it as PHI and require a BAA.
Can client consent replace a BAA? No. HIPAA still requires a BAA when a vendor handles PHI for you.
Is de‑identification enough? Yes, if it meets Safe Harbor or expert determination. Remove all 18 identifiers, not just names.
Are call transcripts PHI? If they combine identifiers (caller ID, names) with health info, yes. Voice prints count as biometric identifiers.
If we disable training, are we good? Not by itself. You still need a BAA, logs, RBAC, encryption, retention controls, and incident response.
What about tracking pixels on intake pages? HHS has warned these can cause impermissible disclosures for covered entities; firms acting as business associates should avoid them on PHI pages.
Can we use ChatGPT for marketing? Yes—no client identifiers, no health facts, with a clear policy and DLP. For intake or records, use a HIPAA‑aligned platform.

Quick Takeaways

Consumer ChatGPT isn’t OK for PHI intake—no BAA, thin auditability, and not enough control over retention or training.
If intake might capture PHI, either de‑identify carefully before using AI or choose a platform with a BAA, encryption, RBAC, audit logs, and “no training on your data.”
Build guardrails: a clear AI policy, training on PHI, DLP/egress controls, short retention, and regular audits across forms, chat, calls, and email flows.
Two‑stage intake (PHI‑free triage, then secure PHI capture) or LegalSoul gets you the safer path with signed BAAs, data isolation, retention controls, PHI guardrails, and solid logs.

Conclusion and next steps

Consumer ChatGPT and PHI intake don’t mix. If your firm handles health info, either de‑identify thoroughly or use an AI platform that will sign a BAA and give you encryption, RBAC, audit logs, short retention, and no training on your data. Put an AI policy in place, add DLP, and move to a two‑stage intake flow.

Want the benefits without the risk? Book a 20‑minute LegalSoul demo. We’ll walk your intake, flag the gaps, and spin up a pilot that improves qualification speed, keeps clients happy, and stays defensible if anyone comes asking.

Is ChatGPT HIPAA-compliant for law firms’ client intake?