Philterd Blog – Deep dives into data de-identification, open-source privacy, and the future of self-hosted compliance.

AI, Redaction

Using an LLM or Pattern-based Rules for PII/PHI Redaction

In our data-driven world, being able to protect Personally Identifiable Information (PII) and Protected Health Information (PHI) is imperative. Whether you’re securing customer data, complying with regulations like GDPR or HIPAA, or simply aiming for responsible data handling, the need…

philterd
May 1, 2025

Philter

Philter 3.1.0

Philter 3.1.0 is now available. Philter 3.1.0 is built upon Phileas 2.12.0 which brings:

philterd
March 23, 2025

Phileas

Phileas 2.12.0

Phileas 2.12.0 has been released. This version of the popular open source redaction library brings: Look for a new version of Philter soon in the AWS, Google Cloud, and Azure marketplaces soon that is built on Phileas 2.12.0!

philterd
March 20, 2025

LLMs, Philter

Why Using an LLM to Redact PII and PHI is a Bad Idea

We have seen a lot – and you probably have to – posts on various social media and blogging platforms showing how you can redact text using a large language model (LLM). They present a fairly simple solution to the…

philterd
February 17, 2025

Phileas, Redaction, Search

Shielding Your Search: Redacting PII and PHI in OpenSearch with Phinder

In today’s data-driven world, safeguarding Personally Identifiable Information (PII) and Protected Health Information (PHI) is paramount. When leveraging search platforms like OpenSearch, ensuring sensitive data remains confidential is crucial. Enter Phinder, an open-source OpenSearch plugin that leverages the power of…

philterd
January 10, 2025

Phileas

Phileas 2.10.0

We are excited to announce the release of Phileas 2.10.0! What’s changed in this version: * Making FilterResponse not be a final record class by @jzonthemtn in #166* Removing commons-csv dependency by @jzonthemtn in #174* Removing guava dependency and adding…

philterd
January 6, 2025

Integrations, Phileas

Phileas in Graylog – Removing PII from Logs

We are very excited to share with you that Graylog has integrated Phileas, the open source PII/PHI redaction engine, into their centralized log management solution. With this new integration, Graylog now has the ability to identify and redact different types of PII (personally…

philterd
December 1, 2024

Phileas

Phileas 2.9.1

We are excited to announce the release of Phileas 2.9.1. What’s changed in this version: * LineWidthSplitService is using a new line separator instead of a space* An empty list of spans from ph-eye does not indicate failure* Have a…

philterd
November 27, 2024

AWS, Integrations, Philter, Redaction

Automatically Redacting PII and PHI from Files in Amazon S3 using Amazon Macie and Philter

Amazon Macie is “a data security service that discovers sensitive data using machine learning and pattern matching.” With Amazon Macie you can find potentially sensitive information in files in your Amazon S3 buckets, but what do you do when Amazon Macie…

philterd
November 17, 2024

AI, Philter

Philter as an AI Policy Layer

A policy layer is an important part of every source of AI-generated text. An AI policy layer is an important part of every source of AI-generated text because it inspects the AI-generated text to prevent sensitive information from being exposed.…

philterd
October 10, 2024

AWS, Philter, Redaction

Redacting Text in Amazon Kinesis Data Firehose

Amazon Kinesis Firehose is a managed streaming service designed to take large amounts of data from one place to another. For example, you can take data from sources such as Amazon CloudWatch, AWS IoT, and custom applications using the AWS…

philterd
September 9, 2024

Phileas, Redaction

Phileas — The Open Source PII and PHI redaction engine

I am delighted to announce the project that provides the core PII and PHI redaction capabilities is now open source! Introducing Phileas, the PII and PHI redaction engine! Phileas is now available under the Apache license on GitHub. Both Philter and Phirestream use Phileas to identify…

philterd
May 22, 2023

Phileas, Redaction

What is format-preserving encryption?

In cryptography, you have plain text and cipher text. An encryption algorithm transforms the plain text into the cipher text. The cipher text won’t look anything like the plain text, in terms of characters and length. There are many different…

philterd
May 21, 2023