Nguyen
SRToolkit.dataset.nguyen
Nguyen symbolic regression benchmark.
Nguyen
Nguyen(dataset_directory: str = os.path.join(user_data_dir('SRToolkit'), 'nguyen'), n_samples: int = 10000, seed: Optional[int] = 42, force_generate: bool = False)
Bases: SR_benchmark
The Nguyen symbolic regression benchmark.
Contains 10 expressions without constant parameters (first 4 are polynomials, first 8 use
one variable, last 2 use two variables). The benchmark ships with pre-generated data. If the
download fails, data is generated from the stored per-variable samplers using n_samples
points and the given seed.
References
Examples:
Parameters:
| Name | Type | Description | Default |
|---|---|---|---|
dataset_directory
|
str
|
Directory where dataset files are stored or will be downloaded to.
Defaults to the platform-appropriate user data directory (e.g. |
join(user_data_dir('SRToolkit'), 'nguyen')
|
n_samples
|
int
|
Number of samples to generate per dataset when falling back to sampler-based
data generation (i.e. when the download fails or |
10000
|
seed
|
Optional[int]
|
Random seed used for sampler-based data generation. Defaults to |
42
|
force_generate
|
bool
|
If |
False
|