Wide & Deep#

class libreco.algorithms.WideDeep(task, data_info=None, loss_type='cross_entropy', embed_size=16, n_epochs=20, lr=None, lr_decay=False, epsilon=1e-05, reg=None, batch_size=256, sampler='random', num_neg=1, use_bn=True, dropout_rate=None, hidden_units=(128, 64, 32), multi_sparse_combiner='sqrtn', seed=42, lower_upper_bound=None, tf_sess_config=None)[source]#

Bases: TfBase

Wide & Deep algorithm.

Parameters:

task ({'rating', 'ranking'}) – Recommendation task. See Task.
data_info (DataInfo object) – Object that contains useful information for training and inference.
loss_type ({'cross_entropy', 'focal'}, default: 'cross_entropy') – Loss for model training.
embed_size (int, default: 16) – Vector size of embeddings.
n_epochs (int, default: 10) – Number of epochs for training.
lr (dict, default: {"wide": 0.01, "deep": 1e-4}) – Learning rate for training. The parameter should be a dict that contains learning rate of wide and deep parts.
lr_decay (bool, default: False) – Whether to use learning rate decay.
epsilon (float, default: 1e-5) – A small constant added to the denominator to improve numerical stability in Adam optimizer. According to the official comment, default value of 1e-8 for epsilon is generally not good, so here we choose 1e-5. Users can try tuning this hyperparameter if the training is unstable.
reg (float or None, default: None) – Regularization parameter, must be non-negative or None.
batch_size (int, default: 256) – Batch size for training.
sampler ({'random', 'unconsumed', 'popular'}, default: 'random') –
Negative sampling strategy.
- 'random' means random sampling.
- 'unconsumed' samples items that the target user did not consume before.
- 'popular' has a higher probability to sample popular items as negative samples.
New in version 1.1.0.
num_neg (int, default: 1) – Number of negative samples for each positive sample, only used in ranking task.
use_bn (bool, default: True) – Whether to use batch normalization.
dropout_rate (float or None, default: None) – Probability of an element to be zeroed. If it is None, dropout is not used.
hidden_units (int, list of int or tuple of (int,), default: (128, 64, 32)) –
Number of layers and corresponding layer size in MLP.

Changed in version 1.0.0: Accept type of int, list or tuple, instead of str.
multi_sparse_combiner ({'normal', 'mean', 'sum', 'sqrtn'}, default: 'sqrtn') – Options for combining multi_sparse features.
seed (int, default: 42) – Random seed.
lower_upper_bound (tuple or None, default: None) – Lower and upper score bound for rating task.
tf_sess_config (dict or None, default: None) – Optional TensorFlow session config, see ConfigProto options.

Notes

According to the original paper, the Wide part uses FTRL with L1 regularization as the optimizer, so we’ll also adopt it here. Note this may not be suitable for your specific task.

References

Heng-Tze Cheng et al. Wide & Deep Learning for Recommender Systems.

fit(train_data, neg_sampling, verbose=1, shuffle=True, eval_data=None, metrics=None, k=10, eval_batch_size=8192, eval_user_num=None, num_workers=0)#

Fit TF model on the training data.

Parameters:

train_data (TransformedSet object) – Data object used for training.
neg_sampling (bool) –
Whether to perform negative sampling for training or evaluating data.

New in version 1.1.0.

Note

Negative sampling is needed if your data is implicit(i.e., task is ranking) and ONLY contains positive labels. Otherwise, it should be False.
verbose (int, default: 1) –
Print verbosity.
- verbose <= 0: Print nothing.
- verbose == 1: Print progress bar and training time.
- verbose > 1 : Print evaluation metrics if eval_data is provided.
shuffle (bool, default: True) – Whether to shuffle the training data.
eval_data (TransformedSet object, default: None) – Data object used for evaluating.
metrics (list or None, default: None) – List of metrics for evaluating.
k (int, default: 10) – Parameter of metrics, e.g. recall at k, ndcg at k
eval_batch_size (int, default: 8192) – Batch size for evaluating.
eval_user_num (int or None, default: None) – Number of users for evaluating. Setting it to a positive number will sample users randomly from eval data.
num_workers (int, default: 0) –
How many subprocesses to use for training data loading. 0 means that the data will be loaded in the main process, which is slower than multiprocessing.

New in version 1.1.0.

Caution

Using multiprocessing(num_workers > 0) may consume more memory than single processing. See Multi-process data loading.

Raises:

RuntimeError – If fit() is called from a loaded model(load()).
AssertionError – If neg_sampling parameter is not bool type.

classmethod load(path, model_name, data_info, manual=True)#

Load saved TF model for inference.

Parameters:

path (str) – File folder path to save model.
model_name (str) – Name of the saved model file.
data_info (DataInfo object) – Object that contains some useful information.
manual (bool, default: True) – Whether to load model variables using numpy. If you save the model using manual, you should also load the mode using manual.

Returns:

model – Loaded TF model.

Return type:

type(cls)