:: DeveloperApi :: Statistics of a block in ALS computation.
:: DeveloperApi :: Given an RDD of ratings, number of user blocks, and number of product blocks, computes the statistics of each block in ALS computation.
:: DeveloperApi :: Given an RDD of ratings, number of user blocks, and number of product blocks, computes the statistics of each block in ALS computation. This is useful for estimating cost and diagnosing load balance.
an RDD of ratings
number of user blocks
number of product blocks
statistics of user blocks and product blocks
Train a matrix factorization model given an RDD of ratings given by users to some products, in the form of (userID, productID, rating) pairs.
Train a matrix factorization model given an RDD of ratings given by users to some products,
in the form of (userID, productID, rating) pairs. We approximate the ratings matrix as the
product of two lower-rank matrices of a given rank (number of features). To solve for these
features, we run a given number of iterations of ALS. The level of parallelism is determined
automatically based on the number of partitions in ratings
.
RDD of (userID, productID, rating) pairs
number of features to use
number of iterations of ALS (recommended: 10-20)
Train a matrix factorization model given an RDD of ratings given by users to some products, in the form of (userID, productID, rating) pairs.
Train a matrix factorization model given an RDD of ratings given by users to some products,
in the form of (userID, productID, rating) pairs. We approximate the ratings matrix as the
product of two lower-rank matrices of a given rank (number of features). To solve for these
features, we run a given number of iterations of ALS. The level of parallelism is determined
automatically based on the number of partitions in ratings
.
RDD of (userID, productID, rating) pairs
number of features to use
number of iterations of ALS (recommended: 10-20)
regularization factor (recommended: 0.01)
Train a matrix factorization model given an RDD of ratings given by users to some products, in the form of (userID, productID, rating) pairs.
Train a matrix factorization model given an RDD of ratings given by users to some products,
in the form of (userID, productID, rating) pairs. We approximate the ratings matrix as the
product of two lower-rank matrices of a given rank (number of features). To solve for these
features, we run a given number of iterations of ALS. This is done using a level of
parallelism given by blocks
.
RDD of (userID, productID, rating) pairs
number of features to use
number of iterations of ALS (recommended: 10-20)
regularization factor (recommended: 0.01)
level of parallelism to split computation into
Train a matrix factorization model given an RDD of ratings given by users to some products, in the form of (userID, productID, rating) pairs.
Train a matrix factorization model given an RDD of ratings given by users to some products,
in the form of (userID, productID, rating) pairs. We approximate the ratings matrix as the
product of two lower-rank matrices of a given rank (number of features). To solve for these
features, we run a given number of iterations of ALS. This is done using a level of
parallelism given by blocks
.
RDD of (userID, productID, rating) pairs
number of features to use
number of iterations of ALS (recommended: 10-20)
regularization factor (recommended: 0.01)
level of parallelism to split computation into
random seed
Train a matrix factorization model given an RDD of 'implicit preferences' ratings given by users to some products, in the form of (userID, productID, rating) pairs.
Train a matrix factorization model given an RDD of 'implicit preferences' ratings given by
users to some products, in the form of (userID, productID, rating) pairs. We approximate the
ratings matrix as the product of two lower-rank matrices of a given rank (number of features).
To solve for these features, we run a given number of iterations of ALS. The level of
parallelism is determined automatically based on the number of partitions in ratings
.
Model parameters alpha
and lambda
are set to reasonable default values
RDD of (userID, productID, rating) pairs
number of features to use
number of iterations of ALS (recommended: 10-20)
Train a matrix factorization model given an RDD of 'implicit preferences' given by users to some products, in the form of (userID, productID, preference) pairs.
Train a matrix factorization model given an RDD of 'implicit preferences' given by users to
some products, in the form of (userID, productID, preference) pairs. We approximate the
ratings matrix as the product of two lower-rank matrices of a given rank (number of features).
To solve for these features, we run a given number of iterations of ALS. The level of
parallelism is determined automatically based on the number of partitions in ratings
.
RDD of (userID, productID, rating) pairs
number of features to use
number of iterations of ALS (recommended: 10-20)
regularization factor (recommended: 0.01)
Train a matrix factorization model given an RDD of 'implicit preferences' given by users to some products, in the form of (userID, productID, preference) pairs.
Train a matrix factorization model given an RDD of 'implicit preferences' given by users
to some products, in the form of (userID, productID, preference) pairs. We approximate the
ratings matrix as the product of two lower-rank matrices of a given rank (number of features).
To solve for these features, we run a given number of iterations of ALS. This is done using
a level of parallelism given by blocks
.
RDD of (userID, productID, rating) pairs
number of features to use
number of iterations of ALS (recommended: 10-20)
regularization factor (recommended: 0.01)
level of parallelism to split computation into
confidence parameter (only applies when immplicitPrefs = true)
Train a matrix factorization model given an RDD of 'implicit preferences' given by users to some products, in the form of (userID, productID, preference) pairs.
Train a matrix factorization model given an RDD of 'implicit preferences' given by users
to some products, in the form of (userID, productID, preference) pairs. We approximate the
ratings matrix as the product of two lower-rank matrices of a given rank (number of features).
To solve for these features, we run a given number of iterations of ALS. This is done using
a level of parallelism given by blocks
.
RDD of (userID, productID, rating) pairs
number of features to use
number of iterations of ALS (recommended: 10-20)
regularization factor (recommended: 0.01)
level of parallelism to split computation into
confidence parameter (only applies when immplicitPrefs = true)
random seed
Top-level methods for calling Alternating Least Squares (ALS) matrix factorization.