Anonymizer

Anonymizer is a toolkit to help you to automate production data anonymization for test environments and compliancy.

Features:

Scramble data by column
Maintains referential integrity
Consistent output state on multiple runs of same source data
Truncate entire tables
Drop tables or individual columns
Store configuration (DSN + Filename) in a .env file

Usage

Scan your application's database schema and decide which columns contain sensitive information. For example: user.email or request.ip, etc.
Create a configuration file (example below) for your application that lists all the sensitive columns with a method for anonymization.
Run anonymizer on your test database:

vendor/bin/anonymizer

anonymizer.yml format

This file defines which columns needs to be anonymized, and using which method. Additionally you truncate or drop entire tables or columns.

Here's an example:

---
columns:
    user.email:
      method: faker
      arguments:
        formatter: email
      cascades:
        - user_email.address
        - comment.email

    request.ip:
      method: faker
      arguments:
        formatter: ipv5
      cascades:
        - exception.ip
truncate:
  - table1
  - table2

drop:
  - user.ip
  - request.agent
  - tmp*
  - *.password

All columns are listed in tableName.columnName format. For each column a method is defined, with some optional arguments. Most common is the faker method, that takes a formatter as an argument (i.e. email, userName, city, ipv4, etc - see the faker docs for more)

If you have any columns in other tables that reference this column, you can list them in the cascades key (optional). This will ensure that the external columns are updated with the same new value so their references still work.

Wildcards

You can use wildcards (*, ?) in tablenames of the drop list.

Configuration

You can use the environment (or a .env file) to pass ANONYMIZER_DSN (or PDO) and ANONYMIZER_FILENAME values to the anonymizer run command. These values will be used to connect to the database, and read the specified configuration yaml file.

About the "randomly" generated data

The faker is initialized with the same seed every run (0). This ensures that multiple runs of anonymizer on the same source data result in the same anonymized data.
The faker method ensures all generated values are unique within a single table. This prevents problems with references etc
If you list cascades that contain values that are not defined in the source table, they will be updated to NULL. This prevents sensitive data lingering around in cascades accidentally. In a properly integrity-checked database this scenario would not happen.

Verbosity / schema

anonymizer analyzes the schema (tablenames + columnnames) before running. This information is used by the wildcard functionality. You can run anonymizer with -v to increase the verbosity to view this data. This can help you to verify if you're not seeing any tables or columns that should be dropped.

License

MIT. Please refer to the license file for details.

Brought to you by the LinkORB Engineering team

Check out our other projects at linkorb.com/engineering.

Btw, we're hiring!

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
bin		bin
src		src
.editorconfig		.editorconfig
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
composer.json		composer.json
composer.lock		composer.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Anonymizer

Features:

Usage

anonymizer.yml format

Wildcards

Configuration

About the "randomly" generated data

Verbosity / schema

License

Brought to you by the LinkORB Engineering team

About

Releases

Packages

Languages

License

prajapati-kaushik/anonymizer

Folders and files

Latest commit

History

Repository files navigation

Anonymizer

Features:

Usage

anonymizer.yml format

Wildcards

Configuration

About the "randomly" generated data

Verbosity / schema

License

Brought to you by the LinkORB Engineering team

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages