recbole.model.layers¶

Common Layers in recommender system

class recbole.model.layers.AttLayer(in_dim, att_dim)[source]¶

Bases: Module

Calculate the attention signal(weight) according the input tensor.

Parameters:: infeatures (torch.FloatTensor) – A 3D input tensor with shape of[batch_size, M, embed_dim].
Returns:: Attention weight of input. shape of [batch_size, M].
Return type:: torch.FloatTensor

forward(infeatures)[source]¶

Defines the computation performed at every call.

Should be overridden by all subclasses.

Note

Although the recipe for forward pass needs to be defined within this function, one should call the Module instance afterwards instead of this since the former takes care of running the registered hooks while the latter silently ignores them.

training: bool¶

class recbole.model.layers.BaseFactorizationMachine(reduce_sum=True)[source]¶

Bases: Module

Calculate FM result over the embeddings

Parameters:: reduce_sum – bool, whether to sum the result, default is True.

Input:: input_x: tensor, A 3D tensor with shape:(batch_size,field_size,embed_dim).
Output: output: tensor, A 3D tensor with shape: (batch_size,1) or (batch_size, embed_dim).

forward(input_x)[source]¶

Defines the computation performed at every call.

Should be overridden by all subclasses.

Note

Although the recipe for forward pass needs to be defined within this function, one should call the Module instance afterwards instead of this since the former takes care of running the registered hooks while the latter silently ignores them.

training: bool¶

class recbole.model.layers.BiGNNLayer(in_dim, out_dim)[source]¶

Bases: Module

Propagate a layer of Bi-interaction GNN

\[output = (L+I)EW_1 + LE \otimes EW_2\]

forward(lap_matrix, eye_matrix, features)[source]¶

Defines the computation performed at every call.

Should be overridden by all subclasses.

Note

Although the recipe for forward pass needs to be defined within this function, one should call the Module instance afterwards instead of this since the former takes care of running the registered hooks while the latter silently ignores them.

training: bool¶

class recbole.model.layers.CNNLayers(channels, kernels, strides, activation='relu', init_method=None)[source]¶

Bases: Module

Parameters:

channels (-) – a list contains the channels of each layer in cnn layers
kernel (-) – a list contains the kernels of each layer in cnn layers
strides (-) – a list contains the channels of each layer in cnn layers
activation (-) – activation function after each layer in mlp layers. Default: ‘relu’ candidates: ‘sigmoid’, ‘tanh’, ‘relu’, ‘leekyrelu’, ‘none’

Shape:

Input: \((N, C_{in}, H_{in}, W_{in})\)
Output: \((N, C_{out}, H_{out}, W_{out})\) where

\[H_{out} = \left\lfloor\frac{H_{in} + 2 \times \text{padding}[0] - \text{dilation}[0] \times (\text{kernel\_size}[0] - 1) - 1}{\text{stride}[0]} + 1\right\rfloor\]

\[W_{out} = \left\lfloor\frac{W_{in} + 2 \times \text{padding}[1] - \text{dilation}[1] \times (\text{kernel\_size}[1] - 1) - 1}{\text{stride}[1]} + 1\right\rfloor\]

Examples:

>>> m = CNNLayers([1, 32, 32], [2,2], [2,2], 'relu')
>>> input = torch.randn(128, 1, 64, 64)
>>> output = m(input)
>>> print(output.size())
>>> torch.Size([128, 32, 16, 16])

forward(input_feature)[source]¶

Defines the computation performed at every call.

Should be overridden by all subclasses.

Note

Although the recipe for forward pass needs to be defined within this function, one should call the Module instance afterwards instead of this since the former takes care of running the registered hooks while the latter silently ignores them.

init_weights(module)[source]¶

training: bool¶

class recbole.model.layers.ContextSeqEmbAbstractLayer[source]¶

Bases: Module

For Deep Interest Network and feature-rich sequential recommender systems, return features embedding matrices.

embed_float_fields(float_fields, type, embed=True)[source]¶

Get the embedding of float fields. In the following three functions(“embed_float_fields” “embed_token_fields” “embed_token_seq_fields”) when the type is user, [batch_size, max_item_length] should be recognised as [batch_size]

Parameters:

float_fields (torch.Tensor) – [batch_size, max_item_length, num_float_field]
type (str) – user or item
embed (bool) – embed or not

Returns:

float fields embedding. [batch_size, max_item_length, num_float_field, embed_dim]

Return type:

torch.Tensor

embed_float_seq_fields(float_seq_fields, type)[source]¶

Embed the float sequence feature columns

Parameters:

float_seq_fields (torch.FloatTensor) – The input tensor. shape of [batch_size, seq_len, 2]
mode (str) – How to aggregate the embedding of feature in this field. default=mean

Returns:

The result embedding tensor of float sequence columns.

Return type:

torch.FloatTensor

embed_input_fields(user_idx, item_idx)[source]¶

Get the embedding of user_idx and item_idx

Parameters:

user_idx (torch.Tensor) – interaction[‘user_id’]
item_idx (torch.Tensor) – interaction[‘item_id_list’]

Returns:

embedding of user feature and item feature

Return type:

dict

embed_token_fields(token_fields, type)[source]¶

Get the embedding of token fields

Parameters:

token_fields (torch.Tensor) – input, [batch_size, max_item_length, num_token_field]
type (str) – user or item

Returns:

token fields embedding, [batch_size, max_item_length, num_token_field, embed_dim]

Return type:

torch.Tensor

embed_token_seq_fields(token_seq_fields, type)[source]¶

Get the embedding of token_seq fields.

Parameters:

token_seq_fields (torch.Tensor) – input, [batch_size, max_item_length, seq_len]`
type (str) – user or item
mode (str) – mean/max/sum

Returns:

result [batch_size, max_item_length, num_token_seq_field, embed_dim]

Return type:

torch.Tensor

forward(user_idx, item_idx)[source]¶

Defines the computation performed at every call.

Should be overridden by all subclasses.

Note

Although the recipe for forward pass needs to be defined within this function, one should call the Module instance afterwards instead of this since the former takes care of running the registered hooks while the latter silently ignores them.

get_embedding()[source]¶: get embedding of all features.

get_fields_name_dim()[source]¶: get user feature field and item feature field.

training: bool¶

class recbole.model.layers.ContextSeqEmbLayer(dataset, embedding_size, pooling_mode, device)[source]¶

Bases: ContextSeqEmbAbstractLayer

For Deep Interest Network, return all features (including user features and item features) embedding matrices.

training: bool¶

class recbole.model.layers.Dice(emb_size)[source]¶

Bases: Module

Dice activation function

\[f(s)=p(s) \cdot s+(1-p(s)) \cdot \alpha s\]

\[p(s)=\frac{1} {1 + e^{-\frac{s-E[s]} {\sqrt {Var[s] + \epsilon}}}}\]

forward(score)[source]¶

Defines the computation performed at every call.

Should be overridden by all subclasses.

Note

Although the recipe for forward pass needs to be defined within this function, one should call the Module instance afterwards instead of this since the former takes care of running the registered hooks while the latter silently ignores them.

training: bool¶

class recbole.model.layers.FLEmbedding(field_dims, offsets, embed_dim)[source]¶

Bases: Module

Embedding for float fields.

Parameters:

field_dims – list, the number of float in each float fields
offsets – list, the dimension offset of each float field
embed_dim – int, the dimension of output embedding vectors

Input:: input_x: tensor, A 3D tensor with shape:(batch_size,field_size,2).

Returns:: tensor, A 3D tensor with shape: (batch_size,field_size,embed_dim).
Return type:: output

forward(input_x)[source]¶

Defines the computation performed at every call.

Should be overridden by all subclasses.

Note

Although the recipe for forward pass needs to be defined within this function, one should call the Module instance afterwards instead of this since the former takes care of running the registered hooks while the latter silently ignores them.

training: bool¶

class recbole.model.layers.FMEmbedding(field_dims, offsets, embed_dim)[source]¶

Bases: Module

Embedding for token fields.

Parameters:

field_dims – list, the number of tokens in each token fields
offsets – list, the dimension offset of each token field
embed_dim – int, the dimension of output embedding vectors

Input:: input_x: tensor, A 3D tensor with shape:(batch_size,field_size).

Returns:: tensor, A 3D tensor with shape: (batch_size,field_size,embed_dim).
Return type:: output

forward(input_x)[source]¶

Defines the computation performed at every call.

Should be overridden by all subclasses.

Note

Although the recipe for forward pass needs to be defined within this function, one should call the Module instance afterwards instead of this since the former takes care of running the registered hooks while the latter silently ignores them.

training: bool¶

class recbole.model.layers.FMFirstOrderLinear(config, dataset, output_dim=1)[source]¶

Bases: Module

Calculate the first order score of the input features. This class is a member of ContextRecommender, you can call it easily when inherit ContextRecommender.

embed_float_fields(float_fields)[source]¶

Embed the float feature columns

Parameters:

float_fields (torch.FloatTensor) – The input dense tensor. shape of [batch_size, num_float_field, 2]
embed (bool) – Return the embedding of columns or just the columns itself. Defaults to True.

Returns:

The result embedding tensor of float columns.

Return type:

torch.FloatTensor

embed_float_seq_fields(float_seq_fields, mode='mean')[source]¶

Embed the float sequence feature columns

Parameters:

float_seq_fields (torch.LongTensor) – The input tensor. shape of [batch_size, seq_len, 2]
mode (str) – How to aggregate the embedding of feature in this field. default=mean

Returns:

The result embedding tensor of float sequence columns.

Return type:

torch.FloatTensor

embed_token_fields(token_fields)[source]¶

Calculate the first order score of token feature columns

Parameters:: token_fields (torch.LongTensor) – The input tensor. shape of [batch_size, num_token_field]
Returns:: The first order score of token feature columns
Return type:: torch.FloatTensor

embed_token_seq_fields(token_seq_fields)[source]¶

Calculate the first order score of token sequence feature columns

Parameters:: token_seq_fields (torch.LongTensor) – The input tensor. shape of [batch_size, seq_len]
Returns:: The first order score of token sequence feature columns
Return type:: torch.FloatTensor

forward(interaction)[source]¶

Defines the computation performed at every call.

Should be overridden by all subclasses.

Note

Although the recipe for forward pass needs to be defined within this function, one should call the Module instance afterwards instead of this since the former takes care of running the registered hooks while the latter silently ignores them.

training: bool¶

class recbole.model.layers.FeatureSeqEmbLayer(dataset, embedding_size, selected_features, pooling_mode, device)[source]¶

Bases: ContextSeqEmbAbstractLayer

For feature-rich sequential recommenders, return item features embedding matrices according to selected features.

training: bool¶

class recbole.model.layers.FeedForward(hidden_size, inner_size, hidden_dropout_prob, hidden_act, layer_norm_eps)[source]¶

Bases: Module

Point-wise feed-forward layer is implemented by two dense layers.

Parameters:: input_tensor (torch.Tensor) – the input of the point-wise feed-forward layer
Returns:: the output of the point-wise feed-forward layer
Return type:: hidden_states (torch.Tensor)

forward(input_tensor)[source]¶

Defines the computation performed at every call.

Should be overridden by all subclasses.

Note

Although the recipe for forward pass needs to be defined within this function, one should call the Module instance afterwards instead of this since the former takes care of running the registered hooks while the latter silently ignores them.

gelu(x)[source]¶

Implementation of the gelu activation function.

For information: OpenAI GPT’s gelu is slightly different (and gives slightly different results):

0.5 * x * (1 + torch.tanh(math.sqrt(2 / math.pi) * (x + 0.044715 * torch.pow(x, 3))))

Also see https://arxiv.org/abs/1606.08415

get_hidden_act(act)[source]¶

swish(x)[source]¶

training: bool¶

class recbole.model.layers.ItemToInterestAggregation(seq_len, hidden_size, k_interests=5)[source]¶

Bases: Module

forward(input_tensor)[source]¶

Defines the computation performed at every call.

Should be overridden by all subclasses.

Note

Although the recipe for forward pass needs to be defined within this function, one should call the Module instance afterwards instead of this since the former takes care of running the registered hooks while the latter silently ignores them.

training: bool¶

class recbole.model.layers.LightMultiHeadAttention(n_heads, k_interests, hidden_size, seq_len, hidden_dropout_prob, attn_dropout_prob, layer_norm_eps)[source]¶

Bases: Module

forward(input_tensor, pos_emb)[source]¶

Defines the computation performed at every call.

Should be overridden by all subclasses.

Note

Although the recipe for forward pass needs to be defined within this function, one should call the Module instance afterwards instead of this since the former takes care of running the registered hooks while the latter silently ignores them.

training: bool¶

transpose_for_scores(x)[source]¶

class recbole.model.layers.LightTransformerEncoder(n_layers=2, n_heads=2, k_interests=5, hidden_size=64, seq_len=50, inner_size=256, hidden_dropout_prob=0.5, attn_dropout_prob=0.5, hidden_act='gelu', layer_norm_eps=1e-12)[source]¶

Bases: Module

One LightTransformerEncoder consists of several LightTransformerLayers.

Parameters:

n_layers (num) – num of transformer layers in transformer encoder. Default: 2
n_heads (num) – num of attention heads for multi-head attention layer. Default: 2
hidden_size (num) – the input and output hidden size. Default: 64
inner_size (num) – the dimensionality in feed-forward layer. Default: 256
hidden_dropout_prob (float) – probability of an element to be zeroed. Default: 0.5
attn_dropout_prob (float) – probability of an attention score to be zeroed. Default: 0.5
hidden_act (str) – activation function in feed-forward layer. Default: ‘gelu’. candidates: ‘gelu’, ‘relu’, ‘swish’, ‘tanh’, ‘sigmoid’
layer_norm_eps (float) – a value added to the denominator for numerical stability. Default: 1e-12

forward(hidden_states, pos_emb, output_all_encoded_layers=True)[source]¶

Parameters:

hidden_states (torch.Tensor) – the input of the TrandformerEncoder
attention_mask (torch.Tensor) – the attention mask for the input hidden_states
output_all_encoded_layers (Bool) – whether output all transformer layers’ output

Returns:

if output_all_encoded_layers is True, return a list consists of all transformer layers’ output, otherwise return a list only consists of the output of last transformer layer.

Return type:

all_encoder_layers (list)

training: bool¶

class recbole.model.layers.LightTransformerLayer(n_heads, k_interests, hidden_size, seq_len, intermediate_size, hidden_dropout_prob, attn_dropout_prob, hidden_act, layer_norm_eps)[source]¶

Bases: Module

One transformer layer consists of a multi-head self-attention layer and a point-wise feed-forward layer.

Parameters:

hidden_states (torch.Tensor) – the input of the multi-head self-attention sublayer
attention_mask (torch.Tensor) – the attention mask for the multi-head self-attention sublayer

Returns:

the output of the point-wise feed-forward sublayer, is the output of the transformer layer

Return type:

feedforward_output (torch.Tensor)

forward(hidden_states, pos_emb)[source]¶

Defines the computation performed at every call.

Should be overridden by all subclasses.

Note

Although the recipe for forward pass needs to be defined within this function, one should call the Module instance afterwards instead of this since the former takes care of running the registered hooks while the latter silently ignores them.

training: bool¶

class recbole.model.layers.MLPLayers(layers, dropout=0.0, activation='relu', bn=False, init_method=None, last_activation=True)[source]¶

Bases: Module

Parameters:

layers (-) – a list contains the size of each layer in mlp layers
dropout (-) – probability of an element to be zeroed. Default: 0
activation (-) – activation function after each layer in mlp layers. Default: ‘relu’. candidates: ‘sigmoid’, ‘tanh’, ‘relu’, ‘leekyrelu’, ‘none’

Shape:

Input: (\(N\), *, \(H_{in}\)) where * means any number of additional dimensions \(H_{in}\) must equal to the first value in layers

Output: (\(N\), *, \(H_{out}\)) where \(H_{out}\) equals to the last value in layers

Examples:

>>> m = MLPLayers([64, 32, 16], 0.2, 'relu')
>>> input = torch.randn(128, 64)
>>> output = m(input)
>>> print(output.size())
>>> torch.Size([128, 16])

forward(input_feature)[source]¶

Defines the computation performed at every call.

Should be overridden by all subclasses.

Note

Although the recipe for forward pass needs to be defined within this function, one should call the Module instance afterwards instead of this since the former takes care of running the registered hooks while the latter silently ignores them.

init_weights(module)[source]¶

training: bool¶

class recbole.model.layers.MultiHeadAttention(n_heads, hidden_size, hidden_dropout_prob, attn_dropout_prob, layer_norm_eps)[source]¶

Bases: Module

Multi-head Self-attention layers, a attention score dropout layer is introduced.

Parameters:

input_tensor (torch.Tensor) – the input of the multi-head self-attention layer
attention_mask (torch.Tensor) – the attention mask for input tensor

Returns:

the output of the multi-head self-attention layer

Return type:

hidden_states (torch.Tensor)

forward(input_tensor, attention_mask)[source]¶

Defines the computation performed at every call.

Should be overridden by all subclasses.

Note

Although the recipe for forward pass needs to be defined within this function, one should call the Module instance afterwards instead of this since the former takes care of running the registered hooks while the latter silently ignores them.

training: bool¶

transpose_for_scores(x)[source]¶

class recbole.model.layers.SequenceAttLayer(mask_mat, att_hidden_size=(80, 40), activation='sigmoid', softmax_stag=False, return_seq_weight=True)[source]¶

Bases: Module

Attention Layer. Get the representation of each user in the batch.

Parameters:

queries (torch.Tensor) – candidate ads, [B, H], H means embedding_size * feat_num
keys (torch.Tensor) – user_hist, [B, T, H]
keys_length (torch.Tensor) – mask, [B]

Returns:

result

Return type:

torch.Tensor

forward(queries, keys, keys_length)[source]¶

Defines the computation performed at every call.

Should be overridden by all subclasses.

Note

Although the recipe for forward pass needs to be defined within this function, one should call the Module instance afterwards instead of this since the former takes care of running the registered hooks while the latter silently ignores them.

training: bool¶

class recbole.model.layers.SparseDropout(p=0.5)[source]¶

Bases: Module

This is a Module that execute Dropout on Pytorch sparse tensor.

forward(x)[source]¶

Defines the computation performed at every call.

Should be overridden by all subclasses.

Note

Although the recipe for forward pass needs to be defined within this function, one should call the Module instance afterwards instead of this since the former takes care of running the registered hooks while the latter silently ignores them.

training: bool¶

class recbole.model.layers.TransformerEncoder(n_layers=2, n_heads=2, hidden_size=64, inner_size=256, hidden_dropout_prob=0.5, attn_dropout_prob=0.5, hidden_act='gelu', layer_norm_eps=1e-12)[source]¶

Bases: Module

One TransformerEncoder consists of several TransformerLayers.

Parameters:

n_layers (num) – num of transformer layers in transformer encoder. Default: 2
n_heads (num) – num of attention heads for multi-head attention layer. Default: 2
hidden_size (num) – the input and output hidden size. Default: 64
inner_size (num) – the dimensionality in feed-forward layer. Default: 256
hidden_dropout_prob (float) – probability of an element to be zeroed. Default: 0.5
attn_dropout_prob (float) – probability of an attention score to be zeroed. Default: 0.5
hidden_act (str) – activation function in feed-forward layer. Default: ‘gelu’ candidates: ‘gelu’, ‘relu’, ‘swish’, ‘tanh’, ‘sigmoid’
layer_norm_eps (float) – a value added to the denominator for numerical stability. Default: 1e-12

forward(hidden_states, attention_mask, output_all_encoded_layers=True)[source]¶

Parameters:

hidden_states (torch.Tensor) – the input of the TransformerEncoder
attention_mask (torch.Tensor) – the attention mask for the input hidden_states
output_all_encoded_layers (Bool) – whether output all transformer layers’ output

Returns:

if output_all_encoded_layers is True, return a list consists of all transformer layers’ output, otherwise return a list only consists of the output of last transformer layer.

Return type:

all_encoder_layers (list)

training: bool¶

class recbole.model.layers.TransformerLayer(n_heads, hidden_size, intermediate_size, hidden_dropout_prob, attn_dropout_prob, hidden_act, layer_norm_eps)[source]¶

Bases: Module

One transformer layer consists of a multi-head self-attention layer and a point-wise feed-forward layer.

Parameters:

hidden_states (torch.Tensor) – the input of the multi-head self-attention sublayer
attention_mask (torch.Tensor) – the attention mask for the multi-head self-attention sublayer

Returns:

The output of the point-wise feed-forward sublayer,: is the output of the transformer layer.

Return type:

feedforward_output (torch.Tensor)

forward(hidden_states, attention_mask)[source]¶

Defines the computation performed at every call.

Should be overridden by all subclasses.

Note

Although the recipe for forward pass needs to be defined within this function, one should call the Module instance afterwards instead of this since the former takes care of running the registered hooks while the latter silently ignores them.

training: bool¶

class recbole.model.layers.VanillaAttention(hidden_dim, attn_dim)[source]¶

Bases: Module

Vanilla attention layer is implemented by linear layer.

Parameters:: input_tensor (torch.Tensor) – the input of the attention layer
Returns:: the outputs of the attention layer weights (torch.Tensor): the attention weights
Return type:: hidden_states (torch.Tensor)

forward(input_tensor)[source]¶

Defines the computation performed at every call.

Should be overridden by all subclasses.

Note

Although the recipe for forward pass needs to be defined within this function, one should call the Module instance afterwards instead of this since the former takes care of running the registered hooks while the latter silently ignores them.

training: bool¶

recbole.model.layers.activation_layer(activation_name='relu', emb_dim=None)[source]¶

Construct activation layers

Parameters:

activation_name – str, name of activation function
emb_dim – int, used for Dice activation

Returns:

activation layer

Return type:

activation