RaCT

Reference:

Sam Lobel et al. “RaCT: Towards Amortized Ranking-Critical Training for Collaborative Filtering.” in ICLR 2020.

class recbole.model.general_recommender.ract.RaCT(config, dataset)[source]

Bases: recbole.model.abstract_recommender.GeneralRecommender, recbole.model.abstract_recommender.AutoEncoderMixin

RaCT is a collaborative filtering model which uses methods based on actor-critic reinforcement learning for training.

We implement the RaCT model with only user dataloader.

calculate_ac_loss(interaction)[source]
calculate_actor_loss(interaction)[source]
calculate_critic_loss(interaction)[source]
calculate_loss(interaction)[source]

Calculate the training loss for a batch data.

Parameters

interaction (Interaction) – Interaction class of the batch.

Returns

Training loss, shape: []

Return type

torch.Tensor

calculate_ndcg(predict_matrix, true_matrix, input_matrix, k)[source]
construct_critic_input(actor_loss)[source]
construct_critic_layers(layer_dims)[source]
critic_forward(actor_loss)[source]
forward(rating_matrix)[source]

Defines the computation performed at every call.

Should be overridden by all subclasses.

Note

Although the recipe for forward pass needs to be defined within this function, one should call the Module instance afterwards instead of this since the former takes care of running the registered hooks while the latter silently ignores them.

full_sort_predict(interaction)[source]

full sort prediction function. Given users, calculate the scores between users and all candidate items.

Parameters

interaction (Interaction) – Interaction class of the batch.

Returns

Predicted scores for given users and all candidate items, shape: [n_batch_users * n_candidate_items]

Return type

torch.Tensor

input_type = 2
mlp_layers(layer_dims)[source]
predict(interaction)[source]

Predict the scores between users and items.

Parameters

interaction (Interaction) – Interaction class of the batch.

Returns

Predicted scores for given users and items, shape: [batch_size]

Return type

torch.Tensor

reparameterize(mu, logvar)[source]
training: bool