TransformedSet#

class libreco.data.TransformedSet(user_indices=None, item_indices=None, labels=None, sparse_indices=None, dense_values=None, train=True)[source]#

Dataset after transforming.

Often generated by calling functions in DatasetPure or DatasetFeat, then TransformedSet will be used in formal training.

Parameters:
  • user_indices (numpy.ndarray) – All user rows in data, represented in inner id.

  • item_indices (numpy.ndarray) – All item rows in data, represented in inner id.

  • labels (numpy.ndarray) – All labels in data.

  • sparse_indices (numpy.ndarray or None, default: None) – All sparse rows in data, represented in inner id.

  • dense_values (numpy.ndarray or None, default: None) – All dense rows in data.

  • train (bool, default: True) – Whether it is train data.

build_negative_samples(data_info, num_neg=1, item_gen_mode='random', seed=42)[source]#

Perform negative sampling on all the data contained.

Deprecated since version 1.1.0: Use neg_sampling parameter instead of explicitly calling this method for negative sampling. See Negative Sampling.

Parameters:
  • data_info (DataInfo) – Object contains data information.

  • num_neg (int, default: 1) – Number of negative samples for each positive sample.

  • item_gen_mode (str, default: 'random') – Sampling strategy, currently only ‘random’ is supported.

  • seed (int, default: 42) – Random seed.

property user_indices#

All user rows in data

property item_indices#

All item rows in data

property sparse_indices#

All sparse rows in data

property dense_values#

All dense rows in data

property labels#

All labels in data

property sparse_interaction#

User-item interaction data, in scipy.sparse.csr_matrix format.