pydvl.valuation.utility.classwise
¶
This module defines the utility used by class-wise Shapley valuation methods.
See the documentation for more information.
ClasswiseModelUtility
¶
ClasswiseModelUtility(
model: SupervisedModel,
scorer: ClasswiseSupervisedScorer,
*,
catch_errors: bool = False,
show_warnings: bool = False,
cache_backend: CacheBackend | None = None,
cached_func_options: CachedFuncConfig | None = None,
clone_before_fit: bool = True,
)
Bases: ModelUtility[ClasswiseSample, SupervisedModel]
ModelUtility class that is specific to class-wise shapley valuation.
It expects a class-wise scorer and a classification task.
PARAMETER | DESCRIPTION |
---|---|
model
|
Any supervised model. Typical choices can be found in the sci-kit learn documentation.
TYPE:
|
scorer
|
A class-wise scoring object. |
catch_errors
|
set to
TYPE:
|
show_warnings
|
Set to
TYPE:
|
cache_backend
|
Optional instance of CacheBackend used to wrap the _utility method of the Utility instance. By default, this is set to None and that means that the utility evaluations will not be cached.
TYPE:
|
cached_func_options
|
Optional configuration object for cached utility evaluation.
TYPE:
|
clone_before_fit
|
If
TYPE:
|
Source code in src/pydvl/valuation/utility/classwise.py
cache_stats
property
¶
cache_stats: CacheStats | None
Cache statistics are gathered when cache is enabled. See CacheStats for all fields returned.
training_data
property
¶
training_data: Dataset | None
Retrieves the training data used by this utility.
This property is read-only. In order to set it, use with_dataset().
__call__
¶
__call__(sample: SampleT | None) -> float
PARAMETER | DESCRIPTION |
---|---|
sample
|
contains a subset of valid indices for the
TYPE:
|
Source code in src/pydvl/valuation/utility/modelutility.py
__str__
¶
Returns a string representation of the utility. Subclasses should override this method to provide a more informative string
sample_to_data
¶
sample_to_data(sample: SampleT) -> tuple
Returns the raw data corresponding to a sample.
Subclasses can override this e.g. to do reshaping of tensors. Be careful not to
rely on self.training_data
not changing between calls to this method. For
manipulations to it, use the with_dataset()
method.
PARAMETER | DESCRIPTION |
---|---|
sample
|
contains a subset of valid indices for the
TYPE:
|
Returns: Tuple of the training data and labels corresponding to the sample indices.
Source code in src/pydvl/valuation/utility/modelutility.py
with_dataset
¶
Returns the utility, or a copy of it, with the given dataset. Args: data: The dataset to use for utility fitting (training data) copy: Whether to copy the utility object or not. Valuation methods should always make copies to avoid unexpected side effects. Returns: The utility object.