syndi.sampler

Sampler

Sampler(task_instance, train_data, generator)

Sampler.sample_data()

Sampler._store_data(synthetic_data)

Sampler._sample_original()

Sampler._sample_uniform()

Sampler._get_class_to_sample_size(data)

Sampler._sample_uniform_classification()

Sampler._sample_in_interval(sample_size, …)

Sampler._sample_uniform_regression()

Convert self.train_data[self.task.target] continuous column into a discrete binned distribution from max to min value of the continuous columns then run retry sampling (sample a bunch of elements and only select ones in interval) to balance the target column using synthetic data.

Sampler._sample_baseline()