🛡️ Shield 82M: A PII stripping/filtering model 🛡️

Reddit r/LocalLLaMA / 4/25/2026

📰 NewsSignals & Early TrendsTools & Practical UsageModels & Research

Key Points

  • Shield 82M is a newly released open-source PII filtering model fine-tuned from distilroberta-base for removing personally identifiable information from text in any language.
  • The model replaces sensitive fields such as names, emails, phone numbers, and addresses with placeholders (e.g., [PERSON], [EMAIL], [PHONE], [ADDRESS]).
  • It demonstrates strong performance, reporting total accuracy of about 96%, including multilingual examples (e.g., French) alongside basic English tests.
  • The model is available for users to try via Hugging Face at the provided Shield 82M repository link.
  • The announcement invites community feedback from users who test the model on their own data and use cases.

Hey, r/LocalLLaMA !

I am finally back with a new model: 🛡️ Shield 82M

It's a finetuned version of distilroberta-base and it's able to filter out all types of PII (Personally identifiable information) of texts in any language.

Here are some examples:

1) Test with name ,email and phone:

Original: My name is John Doe. Email: john@example.com. Phone: +49 123 45678.
Protected: My name is [PERSON]. Email: [EMAIL]. Phone: [PHONE].

2) basic test:

Original: I live in Cambridge
Protected: I live in [ADDRESS]

3) French test (multilingual):

Original: Mon e-mail est [jean.dupont@example.fr](mailto:jean.dupont@example.fr) et mon téléphone est +33 6 12 34 56 78.
Protected: Mon e-mail est [EMAIL] et mon téléphone est [PHONE].

So, we see that this model performs really well with a total accuracy of ~96%.

And: it's completely open-source like all my models. :D

If you want to try it out: https://huggingface.co/LH-Tech-AI/Shield-82M

Have fun with it. :-)

See you in the comments. Would really like to get some feedback from you.

submitted by /u/LH-Tech_AI
[link] [comments]