skdatasets.utils.experiment.create_experiments¶
- skdatasets.utils.experiment.create_experiments(*, datasets: Mapping[str, Union[Bunch, Callable[[...], Bunch], Tuple[Callable[[...], Bunch], Union[Mapping[str, Any], str]]]], estimators: Mapping[str, Union[EstimatorProtocol[Any, Any], Callable[[...], EstimatorProtocol[Any, Any]], Tuple[Callable[[...], EstimatorProtocol[Any, Any]], Union[Mapping[str, Any], str]]]], storage: sacred.observers.base.RunObserver | str, config: Optional[Union[Mapping[str, Any], str]] = None, inner_cv: Union[CVSplitter, Iterable[Tuple[ndarray[Any, dtype[int]], ndarray[Any, dtype[int]]]], int, None, Literal[False, 'dataset']] = False, outer_cv: Union[CVSplitter, Iterable[Tuple[ndarray[Any, dtype[int]], ndarray[Any, dtype[int]]]], int, None, Literal[False, 'dataset']] = None, save_estimator: bool = False, save_train: bool = False) Sequence[Experiment] [source]¶
Create several Sacred experiments.
It receives a set of estimators and datasets, and create Sacred experiment objects for them.
- Parameters:
datasets (Mapping) –
Mapping where each key is the name for a dataset and each value is either:
A
sklearn.utils.Bunch
with the fields explained in Dataset structure. Onlydata
andtarget
are mandatory.A function receiving arbitrary config values and returning a
sklearn.utils.Bunch
object like the one explained above.A tuple with such a function and additional configuration (either a mapping or a filename).
estimators (Mapping) –
Mapping where each key is the name for a estimator and each value is either:
A scikit-learn compatible estimator.
A function receiving arbitrary config values and returning a scikit-learn compatible estimator.
A tuple with such a function and additional configuration (either a mapping or a filename).
storage (
sacred.observers.RunObserver
orstr
) – Where the experiments will be stored. Either a Sacred observer, for example to store in a Mongo database, or the name of a directory, to use a file observer.config (Mapping,
str
orNone
, defaultNone
) – A mapping or filename with additional configuration for the experiment.inner_cv (CV-like object,
"datasets"
orFalse
, defaultFalse
) –For estimators that perform cross validation (they have a
cv
parameter) this sets the cross validation strategy, as follows:If
False
the original value ofcv
is unchanged.If
"dataset"
, thesklearn.utils.Bunch
objects for the datasets must have ainner_cv
attribute, which will be the one used.Otherwise,
cv
is changed to this value.
outer_cv (CV-like object,
"datasets"
orFalse
, defaultNone
) –The strategy used to evaluate different partitions of the data, as follows:
If
False
use only one partition: the one specified in the dataset. Thus thesklearn.utils.Bunch
objects for the datasets should have defined at least a train and a test partition.If
"dataset"
, thesklearn.utils.Bunch
objects for the datasets must have aouter_cv
attribute, which will be the one used.Otherwise, this will be passed to
sklearn.model_selection.check_cv()
and the resulting cross validator will be used to define the partitions.
save_estimator (bool, default
False
) – Whether to save the fitted estimator. This is useful for debugging and for obtaining extra information in some cases, but for some estimators it could consume much storage.save_train (bool, default
False
) – IfTrue
, compute and store also the score over the train data.
- Returns:
experiments – Sequence of Sacred experiments, ready to be run.
- Return type:
Sequence of
sacred.Experiment
See also