🛡️ Shield 82M: A PII stripping/filtering model 🛡️

Reddit r/LocalLLaMA / 4/25/2026

📰 NewsSignals & Early TrendsTools & Practical UsageModels & Research

共有:

Key Points

Shield 82M is a newly released open-source PII filtering model fine-tuned from distilroberta-base for removing personally identifiable information from text in any language.
The model replaces sensitive fields such as names, emails, phone numbers, and addresses with placeholders (e.g., [PERSON], [EMAIL], [PHONE], [ADDRESS]).
It demonstrates strong performance, reporting total accuracy of about 96%, including multilingual examples (e.g., French) alongside basic English tests.
The model is available for users to try via Hugging Face at the provided Shield 82M repository link.
The announcement invites community feedback from users who test the model on their own data and use cases.

I am finally back with a new model: 🛡️ Shield 82M

It's a finetuned version of distilroberta-base and it's able to filter out all types of PII (Personally identifiable information) of texts in any language.

Here are some examples:

1) Test with name ,email and phone:

Original: My name is John Doe. Email: john@example.com. Phone: +49 123 45678.
Protected: My name is [PERSON]. Email: [EMAIL]. Phone: [PHONE].

2) basic test:

Original: I live in Cambridge
Protected: I live in [ADDRESS]

3) French test (multilingual):

Original: Mon e-mail est [jean.dupont@example.fr](mailto:jean.dupont@example.fr) et mon téléphone est +33 6 12 34 56 78.
Protected: Mon e-mail est [EMAIL] et mon téléphone est [PHONE].

So, we see that this model performs really well with a total accuracy of ~96%.

And: it's completely open-source like all my models. :D

If you want to try it out: https://huggingface.co/LH-Tech-AI/Shield-82M

Have fun with it. :-)

See you in the comments. Would really like to get some feedback from you.

submitted by /u/LH-Tech_AI
[link] [comments]