Skip to content

matheusfsa/fake-dataset

Repository files navigation

Fake Dataset

image

Toolkit for generating fake datasets.

How to Install

pip install fake-dataset

Usage

>>> from fake_dataset import columns, generators

>>> data_gen = generators.DataGenerator(
...    vehicle=columns.CategoricalRandomColumn(categories=["car", "bus", "bicycle"], missing_rate=(0.2, 0.5), na_value="NA"),
...    year=columns.IntegerRandomColumn(values_range=(1950, 2010), missing_rate=(0.1, 0.2)),
...    value=columns.FloatRandomColumn(values_range=(10e4, 10e5), missing_rate=(0.0, 0.0)),
...    )

>>> data_gen.sample(3)
           value vehicle  year
0  823994.355388     car  2002
1  903007.903927      NA  1952
2  435372.320886      NA  None

Credits

This package was created with Cookiecutter and the giswqs/pypackage project template.

About

Toolkit for generating fake datasets.

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages