Counterfactual Explanations in Recommender Systems

Counterfactual explanation is a technique that is used in the field of AI to provide explanations on why the model has made a particular choice. In the context of recommender systems, it can be described as “a set of minimal actions performed by the user that, if removed, changes the recommendation to a different item”¹. Generating a counterfactual sequence, which also means changing the minimal number of items in order to have a different prediction, can hint to which are the items that caused the prediction in the first place, effectively providing an explanation.

Counterfactual reasoning has been used not only to provide explanation, but also to do data augmentation² to counteract the high sparseness that recommendation datasets have; to reduce popularity bias³⁴; or as a data augmentation technique for contrastive learning⁵. The first and last cases can be applied because counterfactual data is data that the user might have interacted with, and so it can be thought as sampled from the same distribution of the real data, meaning that it’s better than doing some other type of augmentations.

Counterfactual sequences has been applied both sequential and non-sequential recommender systems, and they have been generated by the usage of perturbation models, such as VAE⁶; with the help of reinforcement learning techniques²; or with graph-based techniques¹.

Techniques can be model-agnostic (in a black-box) fashion if they do not require to know anything about the model, but just having the model as a black-box; but they can also be gray-box, if some information about the model has to be knows, such as its gradients; or it can be white-box, where the model has to be completely open, and the architecture has to be changed in order to provide an explanation.

Zhou et al.⁷ implemented all of these types of models, using attention (even though Attention is not Explanation!⁸) for the white-box, adversarial perturbation for the gray-box and counterfactual perturbation for the black-box.

Counterfactual sequences are evaluated in two ways:

Model Fidelity, which expresses the percentage of data that the model is able to explain, if the technique is used for explanation;
Standard performance evaluation of the recommendation model with and without the counterfactual sequences augmentation, if counterfactual reasoning is used as an augmentation technique.

tags:#ai-explainability/counterfactual #recommender-systems

👨🏽‍💻 Domiziano's Notes

Explorer

Counterfactual Explanations in Recommender Systems

Graph View

Backlinks

👨🏽‍💻 Domiziano's Notes

Explorer

Counterfactual Explanations in Recommender Systems

Footnotes

Graph View

Backlinks