Are you wondering if Statice has the right synthetic data solution for your needs? In this post, we discuss some of the advantages of working with our software. From integration to evaluation, our data anonymization solution has everything to fit your team’s requirements.
2. The idea and the technology behind the Statice data
anonymization software
Original data
Artificially-generated
new data points
Privacy-preserving
synthetic data
Privacy guarantee to
prevent re-identification
3. What does your data team need to know about Statice's
solution?
Get started immediately
Your entire data team can use it
Plug in the most data sources
Use various types of structured data
Integrate in modern infrastructure
Customize the synthetization
Assess utility with evaluations
Get privacy guarantees
Work with large-scale datasets
1
2
3
4
5
6
7
8
9
4. 1. You can integrate the Statice software in any modern
infrastructure.
You deploy the SDK on-premise, in a private cloud,
or your local infrastructure. You can also deploy on
any major public cloud providers such as Google
Cloud, AWS, or Azure, and data analytics platforms
like Databricks or JupyterHub.
5. 2. You can get started immediately.
Deploying Statice software is not only simple, but it's
also fast. It took less than two hours for one of our
clients in the financial industry to install and run
their first dataset synthesization. Besides, our team
provides you with full support and extensive
documentation.
6. 3. Your entire data team can use the Statice software.
The software comes with a programmatic
interface and a command-line interface (CLI).
Therefore, the Statice software is not dedicated only
to developers and data scientists but also, through
the CLI, to users with basic programming skills.
7. 4. You can plug in the most commonly used data sources.
The software supports any data in tabular form:
from .csv files to database exports (Postgres, MySQL,
MongoDB). It is also possible to get custom data
formats on request.
8. 5. You can generate synthetic data from various types of
structured data.
The software can generate synthetic data from
most structured data types, including primitive
types like categorical, continuous, or discrete. It can
also input non-primitive types such as
geolocation, temporal, DateTime, and transactional
data types.
9. 6. You can work with large-scale datasets.
Statice software can handle large amounts of data.
Users successfully processed datasets with tens of
millions of entries and over 500 dimensions.
10. 7. You can customize the synthetization process.
The software is highly customizable to fit your
project's needs. You can manually extend the types
of supported data and fine-tune the synthetization
process by adjusting the parameters' values. You
can also use table lookup to replace PII, which is
removed from the data with user-provided "fake"
information like.
11. 8. You get evaluations to assess the utility of your synthetic
data.
The software compares the conditional
distributions, the pairwise dependencies, and the
original dataset's marginal distributions to the
synthetic dataset to ensure that utility is preserved.
12. 9. You get guarantees about the privacy of your synthetic
data.
In addition to the Statice models being trained to
satisfy differential privacy, several mechanisms
ensure that your synthetic data is truly anonymous.
For instance, the software simulates privacy
attacks to ensure the generated synthetic data's
full anonymity.