pydvl.influence.torch.util ¶

TorchTensorContainerType `module-attribute` ¶

TorchTensorContainerType = Union[
    Tensor, Collection[Tensor], Mapping[str, Tensor]
]

Type for a PyTorch tensor or a container thereof.

BlockMode ¶

Bases: Enum

Enumeration for different modes of grouping model parameters.

ATTRIBUTE	DESCRIPTION
`LAYER_WISE`	Groups parameters by layers of the model.
`PARAMETER_WISE`	Groups parameters individually.
`FULL`	Groups all parameters together.

EkfacRepresentation `dataclass` ¶

EkfacRepresentation(
    layer_names: Iterable[str],
    layers_module: Iterable[Module],
    evecs_a: Iterable[Tensor],
    evecs_g: Iterable[Tensor],
    diags: Iterable[Tensor],
)

Container class for the EKFAC representation of the Hessian. It can be iterated over to get the layers names and their corresponding module, eigenvectors and diagonal elements of the factorized Hessian matrix.

PARAMETER	DESCRIPTION
`layer_names`	Names of the layers. TYPE: `Iterable[str]`
`layers_module`	The layers. TYPE: `Iterable[Module]`
`evecs_a`	The a eigenvectors of the ekfac representation. TYPE: `Iterable[Tensor]`
`evecs_g`	The g eigenvectors of the ekfac representation. TYPE: `Iterable[Tensor]`
`diags`	The diagonal elements of the factorized Hessian matrix. TYPE: `Iterable[Tensor]`

get_layer_evecs ¶

get_layer_evecs() -> Tuple[Dict[str, Tensor], Dict[str, Tensor]]

It returns two dictionaries, one for the a eigenvectors and one for the g eigenvectors, with the layer names as keys. The eigenvectors are in the same order as the layers in the model.

Source code in src/pydvl/influence/torch/util.py

def get_layer_evecs(
    self,
) -> Tuple[Dict[str, torch.Tensor], Dict[str, torch.Tensor]]:
    """
    It returns two dictionaries, one for the a eigenvectors and one for the g
    eigenvectors, with the layer names as keys. The eigenvectors are in the same
    order as the layers in the model.
    """
    evecs_a_dict = {layer_name: evec_a for layer_name, (_, evec_a, _, _) in self}
    evecs_g_dict = {layer_name: evec_g for layer_name, (_, _, evec_g, _) in self}
    return evecs_a_dict, evecs_g_dict

ModelInfoMixin ¶

ModelInfoMixin(model: Module)

A mixin class for classes that contain information about a model.

Source code in src/pydvl/influence/torch/util.py

def __init__(self, model: torch.nn.Module):
    self.model = model

ModelParameterDictBuilder `dataclass` ¶

ModelParameterDictBuilder(model: Module, detach: bool = True)

A builder class for creating ordered dictionaries of model parameters based on specified block modes or custom blocking structures.

ATTRIBUTE	DESCRIPTION
`model`	The neural network model. TYPE: `Module`
`detach`	Whether to detach the parameters from the computation graph. TYPE: `bool`

build ¶

build(
    block_structure: OrderedDict[str, List[str]],
) -> Dict[str, Dict[str, Parameter]]

Builds an ordered dictionary of model parameters based on the specified block structure represented by an ordered dictionary, where the keys are block identifiers and the values are lists of model parameter names contained in this block.

PARAMETER	DESCRIPTION
`block_structure`	The block structure specifying how to group the parameters. TYPE: `OrderedDict[str, List[str]]`

RETURNS	DESCRIPTION
`Dict[str, Dict[str, Parameter]]`	An ordered dictionary of ordered dictionaries, where the outer dictionary's
`Dict[str, Dict[str, Parameter]]`	keys are block identifiers and the inner dictionaries map parameter names
`Dict[str, Dict[str, Parameter]]`	to parameters.

Source code in src/pydvl/influence/torch/util.py

def build(
    self, block_structure: OrderedDict[str, List[str]]
) -> Dict[str, Dict[str, torch.nn.Parameter]]:
    """
    Builds an ordered dictionary of model parameters based on the specified block
    structure represented by an ordered dictionary, where the keys are block
    identifiers and the values are lists of model parameter names contained in
    this block.

    Args:
        block_structure: The block structure specifying how to group the parameters.

    Returns:
        An ordered dictionary of ordered dictionaries, where the outer dictionary's
        keys are block identifiers and the inner dictionaries map parameter names
        to parameters.
    """
    parameter_dict = {}

    for block_name, parameter_names in block_structure.items():
        inner_ordered_dict = {}
        for parameter_name in parameter_names:
            parameter = self._extract_parameter_by_name(parameter_name)
            if parameter.requires_grad:
                inner_ordered_dict[parameter_name] = self._optional_detach(
                    parameter
                )
            else:
                warnings.warn(
                    f"The parameter {parameter_name} from the block "
                    f"{block_name} is mark as not trainable in the model "
                    f"and will be excluded from the computation."
                )
        parameter_dict[block_name] = inner_ordered_dict

    return parameter_dict

build_from_block_mode ¶

build_from_block_mode(block_mode: BlockMode) -> Dict[str, Dict[str, Parameter]]

Builds an ordered dictionary of model parameters based on the specified block mode or custom blocking structure represented by an ordered dictionary, where the keys are block identifiers and the values are lists of model parameter names contained in this block.

PARAMETER	DESCRIPTION
`block_mode`	The block mode specifying how to group the parameters. TYPE: `BlockMode`

RETURNS	DESCRIPTION
`Dict[str, Dict[str, Parameter]]`	An ordered dictionary of ordered dictionaries, where the outer dictionary's
`Dict[str, Dict[str, Parameter]]`	keys are block identifiers and the inner dictionaries map parameter names
`Dict[str, Dict[str, Parameter]]`	to parameters.

Source code in src/pydvl/influence/torch/util.py

def build_from_block_mode(
    self, block_mode: BlockMode
) -> Dict[str, Dict[str, torch.nn.Parameter]]:
    """
    Builds an ordered dictionary of model parameters based on the specified block
    mode or custom blocking structure represented by an ordered dictionary, where
    the keys are block identifiers and the values are lists of model parameter names
    contained in this block.

    Args:
        block_mode: The block mode specifying how to group the parameters.

    Returns:
        An ordered dictionary of ordered dictionaries, where the outer dictionary's
        keys are block identifiers and the inner dictionaries map parameter names
        to parameters.
    """

    block_mode_mapping = {
        BlockMode.FULL: self._build_full,
        BlockMode.PARAMETER_WISE: self._build_parameter_wise,
        BlockMode.LAYER_WISE: self._build_layer_wise,
    }

    parameter_dict_func = block_mode_mapping.get(block_mode, None)

    if parameter_dict_func is None:
        raise ValueError(f"Unknown block mode {block_mode}.")

    return self.build(parameter_dict_func())

NestedTorchCatAggregator ¶

Bases: NestedSequenceAggregator[Tensor]

An aggregator that concatenates tensors using PyTorch's torch.cat function. Concatenation is done along the first two dimensions of the chunks.

call ¶

__call__(nested_sequence_of_tensors: NestedLazyChunkSequence[Tensor]) -> Tensor

Aggregates tensors from a nested generator structure into a single tensor by concatenating. Each inner generator is first concatenated along dimension 1 into a tensor, and then these tensors are concatenated along dimension 0 together to form the final tensor.

PARAMETER	DESCRIPTION
`nested_sequence_of_tensors`	Object wrapping a generator of generators, where each inner generator yields `torch.Tensor` objects. TYPE: `NestedLazyChunkSequence[Tensor]`

RETURNS	DESCRIPTION
`Tensor`	A single tensor formed by concatenating all tensors from the nested
`Tensor`	generators.

Source code in src/pydvl/influence/torch/util.py

def __call__(
    self, nested_sequence_of_tensors: NestedLazyChunkSequence[torch.Tensor]
) -> torch.Tensor:
    """
    Aggregates tensors from a nested generator structure into a single tensor by
    concatenating. Each inner generator is first concatenated along dimension 1 into
    a tensor, and then these tensors are concatenated along dimension 0 together to
    form the final tensor.

    Args:
        nested_sequence_of_tensors: Object wrapping a generator of generators,
            where each inner generator yields `torch.Tensor` objects.

    Returns:
        A single tensor formed by concatenating all tensors from the nested
        generators.

    """

    outer_gen = cast(
        Iterator[Iterator[torch.Tensor]],
        nested_sequence_of_tensors.generator_factory(),
    )
    len_outer_generator = nested_sequence_of_tensors.len_outer_generator
    if len_outer_generator is not None:
        outer_gen = cast(
            Iterator[Iterator[torch.Tensor]],
            tqdm(outer_gen, total=len_outer_generator, desc="Row blocks"),
        )

    return torch.cat(
        list(
            map(
                lambda tensor_gen: torch.cat(list(tensor_gen), dim=1),
                outer_gen,
            )
        )
    )

TorchCatAggregator ¶

Bases: SequenceAggregator[Tensor]

An aggregator that concatenates tensors using PyTorch's torch.cat function. Concatenation is done along the first dimension of the chunks.

call ¶

__call__(tensor_sequence: LazyChunkSequence[Tensor]) -> Tensor

Aggregates tensors from a single-level generator into a single tensor by concatenating them. This method is a straightforward way to combine a sequence of tensors into one larger tensor.

PARAMETER	DESCRIPTION
`tensor_sequence`	Object wrapping a generator that yields `torch.Tensor` objects. TYPE: `LazyChunkSequence[Tensor]`

RETURNS	DESCRIPTION
`Tensor`	A single tensor formed by concatenating all tensors from the generator. The concatenation is performed along the default dimension (0).

Source code in src/pydvl/influence/torch/util.py

def __call__(
    self,
    tensor_sequence: LazyChunkSequence[torch.Tensor],
) -> torch.Tensor:
    """
    Aggregates tensors from a single-level generator into a single tensor by
    concatenating them. This method is a straightforward way to combine a sequence
    of tensors into one larger tensor.

    Args:
        tensor_sequence: Object wrapping a generator that yields `torch.Tensor`
            objects.

    Returns:
        A single tensor formed by concatenating all tensors from the generator.
            The concatenation is performed along the default dimension (0).
    """
    t_gen = cast(Iterator[torch.Tensor], tensor_sequence.generator_factory())
    len_generator = tensor_sequence.len_generator
    if len_generator is not None:
        t_gen = cast(
            Iterator[torch.Tensor], tqdm(t_gen, total=len_generator, desc="Blocks")
        )

    return torch.cat(list(t_gen))

TorchLinalgEighException ¶

TorchLinalgEighException(original_exception: RuntimeError)

Bases: Exception

Exception to wrap a RunTimeError raised by torch.linalg.eigh, when used with large matrices, see https://github.com/pytorch/pytorch/issues/92141

Source code in src/pydvl/influence/torch/util.py

def __init__(self, original_exception: RuntimeError):
    func = torch.linalg.eigh
    err_msg = (
        f"A RunTimeError occurred in '{func.__module__}.{func.__qualname__}'. "
        "This might be related to known issues with "
        "[torch.linalg.eigh][torch.linalg.eigh] on certain matrix sizes.\n "
        "For more details, refer to "
        "https://github.com/pytorch/pytorch/issues/92141. \n"
        "In this case, consider to use a different implementation, which does not "
        "depend on the usage of [torch.linalg.eigh][torch.linalg.eigh].\n"
        f" Inspect the original exception message: \n{str(original_exception)}"
    )
    super().__init__(err_msg)

TorchNumpyConverter ¶

TorchNumpyConverter(device: Optional[device] = None)

Bases: NumpyConverter[Tensor]

Helper class for converting between torch.Tensor and numpy.ndarray

PARAMETER	DESCRIPTION
`device`	Optional device parameter to move the resulting torch tensors to the specified device TYPE: `Optional[device]` DEFAULT: `None`

Source code in src/pydvl/influence/torch/util.py

def __init__(self, device: Optional[torch.device] = None):
    self.device = device

from_numpy ¶

from_numpy(x: NDArray) -> Tensor

Convert a numpy.ndarray to torch.Tensor and optionally move it to a provided device

Source code in src/pydvl/influence/torch/util.py

def from_numpy(self, x: NDArray) -> torch.Tensor:
    """
    Convert a [numpy.ndarray][numpy.ndarray] to [torch.Tensor][torch.Tensor] and
    optionally move it to a provided device
    """
    t = torch.from_numpy(x)
    if self.device is not None:
        t = t.to(self.device)
    return t

to_numpy ¶

to_numpy(x: Tensor) -> NDArray

Convert a detached torch.Tensor to numpy.ndarray

Source code in src/pydvl/influence/torch/util.py

def to_numpy(self, x: torch.Tensor) -> NDArray:
    """
    Convert a detached [torch.Tensor][torch.Tensor] to
    [numpy.ndarray][numpy.ndarray]
    """
    arr: NDArray = x.cpu().numpy()
    return arr

align_structure ¶

align_structure(
    source: Mapping[str, Tensor], target: TorchTensorContainerType
) -> Dict[str, Tensor]

This function transforms target to have the same structure as source, i.e., it should be a dictionary with the same keys as source and each corresponding value in target should have the same shape as the value in source.

PARAMETER	DESCRIPTION
`source`	The reference dictionary containing PyTorch tensors. TYPE: `Mapping[str, Tensor]`
`target`	The input to be harmonized. It can be a dictionary, tuple, or tensor. TYPE: `TorchTensorContainerType`

RETURNS	DESCRIPTION
`Dict[str, Tensor]`	The harmonized version of `target`.

RAISES	DESCRIPTION
`ValueError`	If `target` cannot be harmonized to match `source`.

Source code in src/pydvl/influence/torch/util.py

def align_structure(
    source: Mapping[str, torch.Tensor],
    target: TorchTensorContainerType,
) -> Dict[str, torch.Tensor]:
    """
    This function transforms `target` to have the same structure as `source`, i.e.,
    it should be a dictionary with the same keys as `source` and each corresponding
    value in `target` should have the same shape as the value in `source`.

    Args:
        source: The reference dictionary containing PyTorch tensors.
        target: The input to be harmonized. It can be a dictionary, tuple, or tensor.

    Returns:
        The harmonized version of `target`.

    Raises:
        ValueError: If `target` cannot be harmonized to match `source`.
    """

    tangent_dict: Dict[str, torch.Tensor]

    if isinstance(target, dict):
        if list(target.keys()) != list(source.keys()):
            raise ValueError("The keys in 'target' do not match the keys in 'source'.")

        if [v.shape for v in target.values()] != [v.shape for v in source.values()]:
            raise ValueError(
                "The shapes of the values in 'target' do not match the shapes "
                "of the values in 'source'."
            )

        tangent_dict = target

    elif isinstance(target, tuple) or isinstance(target, list):
        if [v.shape for v in target] != [v.shape for v in source.values()]:
            raise ValueError(
                "'target' is a tuple/list but its elements' shapes do not match "
                "the shapes of the values in 'source'."
            )

        tangent_dict = dict(zip(source.keys(), target))

    elif isinstance(target, torch.Tensor):
        try:
            tangent_dict = dict(
                zip(
                    source.keys(),
                    reshape_vector_to_tensors(
                        target, [p.shape for p in source.values()]
                    ),
                )
            )
        except Exception as e:
            raise ValueError(
                f"'target' is a tensor but cannot be reshaped to match 'source'. "
                f"Original error: {e}"
            )

    else:
        raise ValueError(f"'target' is of type {type(target)} which is not supported.")

    return tangent_dict

align_with_model ¶

align_with_model(
    x: TorchTensorContainerType, model: Module
) -> Dict[str, Tensor]

Aligns an input to the model's parameter structure, i.e. transforms it into a dict with the same keys as model.named_parameters() and matching tensor shapes

PARAMETER	DESCRIPTION
`x`	The input to be aligned. It can be a dictionary, tuple, or tensor. TYPE: `TorchTensorContainerType`
`model`	model to use for alignment TYPE: `Module`

RETURNS	DESCRIPTION
`Dict[str, Tensor]`	The aligned version of `x`.

RAISES	DESCRIPTION
`ValueError`	If `x` cannot be aligned to match the model's parameters .

Source code in src/pydvl/influence/torch/util.py

def align_with_model(
    x: TorchTensorContainerType, model: torch.nn.Module
) -> Dict[str, torch.Tensor]:
    """
    Aligns an input to the model's parameter structure, i.e. transforms it into a dict
    with the same keys as model.named_parameters() and matching tensor shapes

    Args:
        x: The input to be aligned. It can be a dictionary, tuple, or tensor.
        model: model to use for alignment

    Returns:
        The aligned version of `x`.

    Raises:
        ValueError: If `x` cannot be aligned to match the model's parameters .

    """
    model_params = get_model_parameters(model, detach=False)
    return align_structure(model_params, x)

empirical_cross_entropy_loss_fn ¶

empirical_cross_entropy_loss_fn(
    model_output: Tensor, *args, **kwargs
) -> Tensor

Computes the empirical cross entropy loss of the model output. This is the cross entropy loss of the model output without the labels. The function takes all the usual arguments and keyword arguments of the cross entropy loss function, so that it is compatible with the PyTorch cross entropy loss function. However, it ignores everything except the first argument, which is the model output.

PARAMETER	DESCRIPTION
`model_output`	The output of the model. TYPE: `Tensor`

Source code in src/pydvl/influence/torch/util.py

def empirical_cross_entropy_loss_fn(
    model_output: torch.Tensor, *args, **kwargs
) -> torch.Tensor:
    """
    Computes the empirical cross entropy loss of the model output. This is the
    cross entropy loss of the model output without the labels. The function takes
    all the usual arguments and keyword arguments of the cross entropy loss
    function, so that it is compatible with the PyTorch cross entropy loss
    function. However, it ignores everything except the first argument, which is
    the model output.

    Args:
        model_output: The output of the model.
    """
    probs_ = torch.softmax(model_output, dim=1)
    log_probs_ = torch.log(probs_)
    log_probs_ = torch.where(
        torch.isfinite(log_probs_), log_probs_, torch.zeros_like(log_probs_)
    )
    return torch.sum(log_probs_ * probs_.detach() ** 0.5)

flatten_dimensions ¶

flatten_dimensions(
    tensors: Iterable[Tensor],
    shape: Optional[Tuple[int, ...]] = None,
    concat_at: int = -1,
) -> Tensor

Flattens the dimensions of each tensor in the given iterable and concatenates them along a specified dimension.

This function takes an iterable of PyTorch tensors and flattens each tensor. Optionally, each tensor can be reshaped to a specified shape before concatenation. The concatenation is performed along the dimension specified by concat_at.

PARAMETER	DESCRIPTION
`tensors`	An iterable containing PyTorch tensors to be flattened and concatenated. TYPE: `Iterable[Tensor]`
`shape`	A tuple representing the desired shape to which each tensor is reshaped before concatenation. If None, tensors are flattened to 1D. TYPE: `Optional[Tuple[int, ...]]` DEFAULT: `None`
`concat_at`	The dimension along which to concatenate the tensors. TYPE: `int` DEFAULT: `-1`

RETURNS	DESCRIPTION
`Tensor`	A single tensor resulting from the concatenation of the input tensors,
`Tensor`	each either flattened or reshaped as specified.

Example

>>> tensors = [torch.tensor([[1, 2], [3, 4]]), torch.tensor([[5, 6], [7, 8]])]
>>> flatten_dimensions(tensors)
tensor([1, 2, 3, 4, 5, 6, 7, 8])

>>> flatten_dimensions(tensors, shape=(2, 2), concat_at=0)
tensor([[1, 2],
        [3, 4],
        [5, 6],
        [7, 8]])

Source code in src/pydvl/influence/torch/util.py

def flatten_dimensions(
    tensors: Iterable[torch.Tensor],
    shape: Optional[Tuple[int, ...]] = None,
    concat_at: int = -1,
) -> torch.Tensor:
    """
    Flattens the dimensions of each tensor in the given iterable and concatenates them
    along a specified dimension.

    This function takes an iterable of PyTorch tensors and flattens each tensor.
    Optionally, each tensor can be reshaped to a specified shape before concatenation.
    The concatenation is performed along the dimension specified by `concat_at`.

    Args:
        tensors: An iterable containing PyTorch tensors to be flattened
            and concatenated.
        shape: A tuple representing the desired shape to which each tensor is reshaped
            before concatenation. If None, tensors are flattened to 1D.
        concat_at: The dimension along which to concatenate the tensors.

    Returns:
        A single tensor resulting from the concatenation of the input tensors,
        each either flattened or reshaped as specified.

    ??? Example
        ```pycon
        >>> tensors = [torch.tensor([[1, 2], [3, 4]]), torch.tensor([[5, 6], [7, 8]])]
        >>> flatten_dimensions(tensors)
        tensor([1, 2, 3, 4, 5, 6, 7, 8])

        >>> flatten_dimensions(tensors, shape=(2, 2), concat_at=0)
        tensor([[1, 2],
                [3, 4],
                [5, 6],
                [7, 8]])
        ```
    """
    return torch.cat(
        [t.reshape(-1) if shape is None else t.reshape(*shape) for t in tensors],
        dim=concat_at,
    )

get_model_parameters ¶

get_model_parameters(
    model: Module, detach: bool = True, require_grad_only: bool = True
) -> Dict[str, Tensor]

Returns a dictionary of model parameters, optionally restricted to parameters requiring gradients and optionally detaching them from the computation graph.

PARAMETER	DESCRIPTION
`model`	The neural network model. TYPE: `Module`
`detach`	Whether to detach the parameters from the computation graph. TYPE: `bool` DEFAULT: `True`
`require_grad_only`	Whether to include only parameters that require gradients. TYPE: `bool` DEFAULT: `True`

RETURNS	DESCRIPTION
`Dict[str, Tensor]`	A dict of named model parameters.

Source code in src/pydvl/influence/torch/util.py

def get_model_parameters(
    model: torch.nn.Module, detach: bool = True, require_grad_only: bool = True
) -> Dict[str, torch.Tensor]:
    """
    Returns a dictionary of model parameters, optionally restricted to parameters
    requiring gradients and optionally detaching them from the computation
    graph.

    Args:
        model: The neural network model.
        detach: Whether to detach the parameters from the computation graph.
        require_grad_only: Whether to include only parameters that require gradients.

    Returns:
        A dict of named model parameters.
    """

    parameter_dict = {}
    for k, p in model.named_parameters():
        if require_grad_only and not p.requires_grad:
            continue
        parameter_dict[k] = p.detach() if detach else p

    return parameter_dict

reshape_vector_to_tensors ¶

reshape_vector_to_tensors(
    input_vector: Tensor, target_shapes: Iterable[Tuple[int, ...]]
) -> Tuple[Tensor, ...]

Reshape a 1D tensor into multiple tensors with specified shapes.

This function takes a 1D tensor (input_vector) and reshapes it into a series of tensors with shapes given by 'target_shapes'. The reshaped tensors are returned as a tuple in the same order as their corresponding shapes.

Note

The total number of elements in 'input_vector' must be equal to the sum of the products of the shapes in 'target_shapes'.

PARAMETER	DESCRIPTION
`input_vector`	The 1D tensor to be reshaped. Must be 1D. TYPE: `Tensor`
`target_shapes`	An iterable of tuples. Each tuple defines the shape of a tensor to be reshaped from the 'input_vector'. TYPE: `Iterable[Tuple[int, ...]]`

RETURNS	DESCRIPTION
`Tuple[Tensor, ...]`	A tuple of reshaped tensors.

RAISES	DESCRIPTION
`ValueError`	If 'input_vector' is not a 1D tensor or if the total number of elements in 'input_vector' does not match the sum of the products of the shapes in 'target_shapes'.

Source code in src/pydvl/influence/torch/util.py

def reshape_vector_to_tensors(
    input_vector: torch.Tensor, target_shapes: Iterable[Tuple[int, ...]]
) -> Tuple[torch.Tensor, ...]:
    """
    Reshape a 1D tensor into multiple tensors with specified shapes.

    This function takes a 1D tensor (input_vector) and reshapes it into a series of
    tensors with shapes given by 'target_shapes'.
    The reshaped tensors are returned as a tuple in the same order
    as their corresponding shapes.

    Note:
        The total number of elements in 'input_vector' must be equal to the
            sum of the products of the shapes in 'target_shapes'.

    Args:
        input_vector: The 1D tensor to be reshaped. Must be 1D.
        target_shapes: An iterable of tuples. Each tuple defines the shape of a tensor
            to be reshaped from the 'input_vector'.

    Returns:
        A tuple of reshaped tensors.

    Raises:
        ValueError: If 'input_vector' is not a 1D tensor or if the total
            number of elements in 'input_vector' does not
            match the sum of the products of the shapes in 'target_shapes'.
    """

    if input_vector.dim() != 1:
        raise ValueError("Input vector must be a 1D tensor")

    total_elements = sum(math.prod(shape) for shape in target_shapes)

    if total_elements != input_vector.shape[0]:
        raise ValueError(
            f"The total elements in shapes {total_elements} "
            f"does not match the vector length {input_vector.shape[0]}"
        )

    tensors = []
    start = 0
    for shape in target_shapes:
        size = math.prod(shape)  # compute the total size of the tensor with this shape
        tensors.append(
            input_vector[start : start + size].view(shape)
        )  # slice the vector and reshape it
        start += size
    return tuple(tensors)

safe_torch_linalg_eigh ¶

safe_torch_linalg_eigh(*args: Any, **kwargs: Any) -> tuple[Tensor, Tensor]

A wrapper around torch.linalg.eigh that safely handles potential runtime errors by raising a custom TorchLinalgEighException with more context, especially related to the issues reported in https://github.com/pytorch/pytorch/issues/92141.

PARAMETER	DESCRIPTION
`*args`	Positional arguments passed to `torch.linalg.eigh`. TYPE: `Any` DEFAULT: `()`
`**kwargs`	Keyword arguments passed to `torch.linalg.eigh`. TYPE: `Any` DEFAULT: `{}`

RETURNS	DESCRIPTION
`tuple[Tensor, Tensor]`	The result of calling `torch.linalg.eigh` with the provided arguments.

RAISES	DESCRIPTION
`TorchLinalgEighException`	If a `RuntimeError` occurs during the execution of `torch.linalg.eigh`.

Source code in src/pydvl/influence/torch/util.py

@catch_and_raise_exception(RuntimeError, lambda e: TorchLinalgEighException(e))
def safe_torch_linalg_eigh(
    *args: Any, **kwargs: Any
) -> tuple[torch.Tensor, torch.Tensor]:
    """
    A wrapper around `torch.linalg.eigh` that safely handles potential runtime errors
    by raising a custom `TorchLinalgEighException` with more context,
    especially related to the issues reported in
    [https://github.com/pytorch/pytorch/issues/92141](
    https://github.com/pytorch/pytorch/issues/92141).

    Args:
        *args: Positional arguments passed to `torch.linalg.eigh`.
        **kwargs: Keyword arguments passed to `torch.linalg.eigh`.

    Returns:
        The result of calling `torch.linalg.eigh` with the provided arguments.

    Raises:
        TorchLinalgEighException: If a `RuntimeError` occurs during the execution of
            `torch.linalg.eigh`.
    """
    return cast(tuple[torch.Tensor, torch.Tensor], torch.linalg.eigh(*args, **kwargs))

to_model_device ¶

to_model_device(x: Tensor, model: Module) -> Tensor

Returns the tensor x moved to the device of the model, if device of model is set

PARAMETER	DESCRIPTION
`x`	The tensor to be moved to the device of the model. TYPE: `Tensor`
`model`	The model whose device will be used to move the tensor. TYPE: `Module`

RETURNS	DESCRIPTION
`Tensor`	The tensor `x` moved to the device of the `model`, if device of model is set.

Source code in src/pydvl/influence/torch/util.py

def to_model_device(x: torch.Tensor, model: torch.nn.Module) -> torch.Tensor:
    """
    Returns the tensor `x` moved to the device of the `model`, if device of model is set

    Args:
        x: The tensor to be moved to the device of the model.
        model: The model whose device will be used to move the tensor.

    Returns:
        The tensor `x` moved to the device of the `model`, if device of model is set.
    """
    device = next(model.parameters()).device
    return x.to(device)

torch_dataset_to_dask_array ¶

torch_dataset_to_dask_array(
    dataset: Dataset,
    chunk_size: int,
    total_size: Optional[int] = None,
    resulting_dtype: Type[number] = float32,
) -> Tuple[Array, ...]

Construct tuple of dask arrays from a PyTorch dataset, using dask.delayed

PARAMETER	DESCRIPTION
`dataset`	A PyTorch dataset TYPE: `Dataset`
`chunk_size`	The size of the chunks for the resulting Dask arrays. TYPE: `int`
`total_size`	If the dataset does not implement len, provide the length via this parameter. If None the length of the dataset is inferred via accessing the dataset once. TYPE: `Optional[int]` DEFAULT: `None`
`resulting_dtype`	The dtype of the resulting dask.array.Array TYPE: `Type[number]` DEFAULT: `float32`

Example

import torch
from torch.utils.data import TensorDataset
x = torch.rand((20, 3))
y = torch.rand((20, 1))
dataset = TensorDataset(x, y)
da_x, da_y = torch_dataset_to_dask_array(dataset, 4)

RETURNS	DESCRIPTION
`Tuple[Array, ...]`	Tuple of Dask arrays corresponding to each tensor in the dataset.

Source code in src/pydvl/influence/torch/util.py

def torch_dataset_to_dask_array(
    dataset: Dataset,
    chunk_size: int,
    total_size: Optional[int] = None,
    resulting_dtype: Type[np.number] = np.float32,
) -> Tuple[da.Array, ...]:
    """
    Construct tuple of dask arrays from a PyTorch dataset, using dask.delayed

    Args:
        dataset: A PyTorch [dataset][torch.utils.data.Dataset]
        chunk_size: The size of the chunks for the resulting Dask arrays.
        total_size: If the dataset does not implement len, provide the length
            via this parameter. If None
            the length of the dataset is inferred via accessing the dataset once.
        resulting_dtype: The dtype of the resulting [dask.array.Array][dask.array.Array]

    ??? Example
        ```python
        import torch
        from torch.utils.data import TensorDataset
        x = torch.rand((20, 3))
        y = torch.rand((20, 1))
        dataset = TensorDataset(x, y)
        da_x, da_y = torch_dataset_to_dask_array(dataset, 4)
        ```

    Returns:
        Tuple of Dask arrays corresponding to each tensor in the dataset.
    """

    def _infer_data_len(d_set: Dataset):
        try:
            n_data = len(d_set)  # type:ignore
            if total_size is not None and n_data != total_size:
                raise ValueError(
                    f"The number of samples in the dataset ({n_data}), derived "
                    f"from calling ´len´, does not match the provided "
                    f"total number of samples ({total_size}). "
                    f"Call the function without total_size."
                )
            return n_data
        except TypeError as e:
            err_msg = (
                f"Could not infer the number of samples in the dataset from "
                f"calling ´len´. Original error: {e}."
            )
            if total_size is not None:
                logger.warning(
                    err_msg
                    + f" Using the provided total number of samples {total_size}."
                )
                return total_size
            else:
                logger.warning(
                    err_msg + " Infer the number of samples from the dataset, "
                    "via iterating the dataset once. "
                    "This might induce severe overhead, so consider"
                    "providing total_size, if you know the number of samples "
                    "beforehand."
                )
                idx = 0
                while True:
                    try:
                        t = d_set[idx]
                        if all(_t.numel() == 0 for _t in t):
                            return idx
                        idx += 1

                    except IndexError:
                        return idx

    sample = dataset[0]
    if not isinstance(sample, tuple):
        sample = (sample,)

    def _get_chunk(
        start_idx: int, stop_idx: int, d_set: Dataset
    ) -> Tuple[torch.Tensor, ...]:
        try:
            t = d_set[start_idx:stop_idx]
            if not isinstance(t, tuple):
                t = (t,)
            return t  # type:ignore
        except Exception:
            nested_tensor_list = [
                [d_set[idx][k] for idx in range(start_idx, stop_idx)]
                for k in range(len(sample))
            ]
            return tuple(map(torch.stack, nested_tensor_list))

    n_samples = _infer_data_len(dataset)
    chunk_indices = [
        (i, min(i + chunk_size, n_samples)) for i in range(0, n_samples, chunk_size)
    ]
    delayed_dataset = dask.delayed(dataset)
    delayed_chunks = [
        dask.delayed(partial(_get_chunk, start, stop))(delayed_dataset)
        for (start, stop) in chunk_indices
    ]

    delayed_arrays_dict: Dict[int, List[da.Array]] = {k: [] for k in range(len(sample))}

    for chunk, (start, stop) in zip(delayed_chunks, chunk_indices):
        for tensor_idx, sample_tensor in enumerate(sample):
            delayed_tensor = da.from_delayed(
                dask.delayed(lambda t: t.cpu().numpy())(chunk[tensor_idx]),
                shape=(stop - start, *sample_tensor.shape),
                dtype=resulting_dtype,
            )

            delayed_arrays_dict[tensor_idx].append(delayed_tensor)

    return tuple(
        da.concatenate(array_list) for array_list in delayed_arrays_dict.values()
    )

pydvl.influence.torch.util ¶

TorchTensorContainerType module-attribute ¶

BlockMode ¶

EkfacRepresentation dataclass ¶

get_layer_evecs ¶

ModelInfoMixin ¶

ModelParameterDictBuilder dataclass ¶

build ¶

build_from_block_mode ¶

NestedTorchCatAggregator ¶

__call__ ¶

TorchCatAggregator ¶

__call__ ¶

TorchLinalgEighException ¶

TorchNumpyConverter ¶

from_numpy ¶

to_numpy ¶

align_structure ¶

align_with_model ¶

empirical_cross_entropy_loss_fn ¶

flatten_dimensions ¶

get_model_parameters ¶

reshape_vector_to_tensors ¶

safe_torch_linalg_eigh ¶

to_model_device ¶

torch_dataset_to_dask_array ¶

TorchTensorContainerType `module-attribute` ¶

EkfacRepresentation `dataclass` ¶

ModelParameterDictBuilder `dataclass` ¶

call ¶

call ¶