skdim.datasets.BenchmarkManifolds¶
-
class
skdim.datasets.BenchmarkManifolds(random_state: int = None, noise_type: str = 'uniform')[source]¶ Generates a commonly used benchmark set of synthetic manifolds with known intrinsic dimension described by Hein et al. and Campadelli et al. [Campadelli2015]
Parameters: - noise_type : str, 'uniform' or 'gaussian'
Type of noise to generate
Methods
generate(name, n, dim, d, noise)Generates all datasets. -
generate(name: str = 'all', n: int = 2500, dim: int = None, d: int = None, noise: float = 0.0)[source]¶ Generates all datasets. A ground truth dict of intrinsic dimension and embedding dimension is in BenchmarkManifolds.dict_truth.keys()
Parameters: - n : int
The number of sample points
- dim : int
If generating a single dataset, choose the embedding dimension. Note that some datasets have restrictions on the chosen embedding dimension
- d : int
If generating a single dataset, choose the intrinsic dimension. Note that some datasets have restrictions on the chosen intrinsic dimension
- noise : float, optional(default=0.0)
The value of noise in data
Returns: data (a dict of np.arrays or a single np.array with shape (n, dim)) – Generated data