Private Identifiable Information
Understand Galileo's PII Metric
Definition: Identify PII spans within a sample (both input and output). The current model detects the following precisely defined categories:
Account Information: account numbers, BIC and IBAN.
Address: must contain at least a street name and number, and may contain extra elements such as city, zip code, state, etc.
Credit Card: credit card number, CVV and expiration date.
Date of Birth: must contain a day, month and year.
Email.
Name: must contain first and last name (and optionally middle name).
Network Information: IPv4, IPv6 and MAC addresses.
Personal Identification: personal IDs not included in other categories. In particular: PIN, IMEI, VIN, VRM, Driver license.
Password.
Phone Number.
Social Security Number (SSN).
Username.
Calculation: We leverage a Small Language Model (SML) trained on proprietary datasets.
Usefulness: Automatically identify PII occurrences in any part of the workflow (user input, chains, model output, etc), and respond accordingly by implementing guardrails or other preventative measures.
Explainability: To highlight which parts of the text were detected as PII, click on the 👁️ icon next to the PII metric value. The type of PII detected along with the model's confidence will be shown on the input or output text.
Last updated