public class MatrixFactorizationModel extends Object implements Saveable, scala.Serializable, Logging
Note: If you create the model directly using constructor, please be aware that fast prediction requires cached user/product features and their associated partitioners.
param: rank Rank for the features in this model. param: userFeatures RDD of tuples where each tuple represents the userId and the features computed for this user. param: productFeatures RDD of tuples where each tuple represents the productId and the features computed for this product.
Constructor and Description |
---|
MatrixFactorizationModel(int rank,
RDD<scala.Tuple2<Object,double[]>> userFeatures,
RDD<scala.Tuple2<Object,double[]>> productFeatures) |
Modifier and Type | Method and Description |
---|---|
static MatrixFactorizationModel |
load(SparkContext sc,
String path)
Load a model from the given path.
|
double |
predict(int user,
int product)
Predict the rating of one user for one product.
|
JavaRDD<Rating> |
predict(JavaPairRDD<Integer,Integer> usersProducts)
Java-friendly version of
MatrixFactorizationModel.predict . |
RDD<Rating> |
predict(RDD<scala.Tuple2<Object,Object>> usersProducts)
Predict the rating of many users for many products.
|
RDD<scala.Tuple2<Object,double[]>> |
productFeatures() |
int |
rank() |
Rating[] |
recommendProducts(int user,
int num)
Recommends products to a user.
|
RDD<scala.Tuple2<Object,Rating[]>> |
recommendProductsForUsers(int num)
Recommends topK products for all users.
|
Rating[] |
recommendUsers(int product,
int num)
Recommends users to a product.
|
RDD<scala.Tuple2<Object,Rating[]>> |
recommendUsersForProducts(int num)
Recommends topK users for all products.
|
void |
save(SparkContext sc,
String path)
Save this model to the given path.
|
RDD<scala.Tuple2<Object,double[]>> |
userFeatures() |
equals, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
initializeIfNecessary, initializeLogging, isTraceEnabled, log_, log, logDebug, logDebug, logError, logError, logInfo, logInfo, logName, logTrace, logTrace, logWarning, logWarning
public static MatrixFactorizationModel load(SparkContext sc, String path)
The model should have been saved by Saveable.save
.
sc
- Spark context used for loading model files.path
- Path specifying the directory to which the model was saved.public int rank()
public RDD<scala.Tuple2<Object,double[]>> userFeatures()
public RDD<scala.Tuple2<Object,double[]>> productFeatures()
public double predict(int user, int product)
public RDD<Rating> predict(RDD<scala.Tuple2<Object,Object>> usersProducts)
usersProducts
- RDD of (user, product) pairs.public JavaRDD<Rating> predict(JavaPairRDD<Integer,Integer> usersProducts)
MatrixFactorizationModel.predict
.usersProducts
- (undocumented)public Rating[] recommendProducts(int user, int num)
user
- the user to recommend products tonum
- how many products to return. The number returned may be less than this.Rating
objects, each of which contains the given user ID, a product ID, and a
"score" in the rating field. Each represents one recommended product, and they are sorted
by score, decreasing. The first returned is the one predicted to be most strongly
recommended to the user. The score is an opaque value that indicates how strongly
recommended the product is.public Rating[] recommendUsers(int product, int num)
product
- the product to recommend users tonum
- how many users to return. The number returned may be less than this.Rating
objects, each of which contains a user ID, the given product ID, and a
"score" in the rating field. Each represents one recommended user, and they are sorted
by score, decreasing. The first returned is the one predicted to be most strongly
recommended to the product. The score is an opaque value that indicates how strongly
recommended the user is.public void save(SparkContext sc, String path)
This saves: - human-readable (JSON) model metadata to path/metadata/ - Parquet formatted data to path/data/
The model may be loaded using Loader.load
.
public RDD<scala.Tuple2<Object,Rating[]>> recommendProductsForUsers(int num)
num
- how many products to return for every user.public RDD<scala.Tuple2<Object,Rating[]>> recommendUsersForProducts(int num)
num
- how many users to return for every product.