I assume that the use case is - create a snapshot of production database for testing purposes (functional, performance, integration testing), right?
What about GDPR then? I remember that production data can’t be used for testing purposes because it contains sensitive data.
j
Jeremy Kreutzbender
08/02/2023, 3:20 PM
We're very close to shipping a feature that will allow people to integrate with https://www.tonic.ai/ to cleanse the PII. The process would be, Snapshot from AWS -> Create database -> Cleanse with Tonic -> New cleansed snapshot -> Instant Datasets uses the cleansed snapshot to make databases which people can check out
Jeremy Kreutzbender
08/02/2023, 3:22 PM
We're also exploring a much more simple version (free / open source) of data cleansing where maybe you might just need to truncate tables or replace credit card numbers with a random set of digits. Tonic excels in this area and works to maintain the realism of the data, but that level of detail may not be required for everyone
j
Jevgeni Demidov
08/02/2023, 3:23 PM
I think GDPR and different ISO/SOC certifications requirements are higher than just removing credit cards
j
Jeremy Kreutzbender
08/02/2023, 3:25 PM
Credit cards is an example of PII that may need to be removed. It would be up to the team using this feature to determine what data needs to be cleansed / removed