Understanding Spark MLlib ALS.trainImplicit input format

Question

Understanding Spark MLlib ALS.trainImplicit input format

395 views Asked by Maria At 28 December 2016 at 11:25

I`m trying to make a recommender system based on purchase history using trainImplicit. My input is in domain [1, +inf) (the sum of views and purchases).

So the element of my input RDD looks like this: [(user_id,item_id),rating] --> [(123,5564),6] - the user(id = 123) interacted with the item(id=5564) 6 times.

Should I add to my RDD elements such as [(user_id,item_id),rating] --> [(123,2222),0], meaning that given user has never interacted with given item or the ALS.implicitTrain does this implicitly?

Original Q&A

There are 1 answers

**user7337271** · Accepted Answer · 2016-12-28T16:48:33+00:00

user7337271 On 28 December 2016 at 16:48 BEST ANSWER

It it not necessary (for implicit) and shouldn't be done (for explicit) so in this case bass only data you actually have.

TechQA.

Understanding Spark MLlib ALS.trainImplicit input format

There are 1 answers

Related Questions in PYTHON

Related Questions in PYSPARK

Related Questions in COLLABORATIVE-FILTERING

Popular Questions

Popular Tags

Trending Questions