I'm using Mahout with the Pearson Correlation algorithm to compare and find similar users based on their preferences for several items. The problem I'm running into is that Mahout and/or Pearson is ignoring users that select the same preference for every item. Does anyone know if there is a way to configure Mahout to NOT ignore people that select the same preference value for every item.
Apache Mahout + Pearson Correlation Ignores Users With Same Preference For Every Item
959 views Asked by SGT Grumpy Pants At
1
There are 1 answers
Related Questions in MAHOUT
- Dealing with Pearson Similarity returning 0 for users with equal item counts - Mahout
- Running Mahout in Hadoop Cluster - java.lang.ClassNotFoundException
- Why does the result of ItemSimilarityJob lack some similarities of itemId-pair?
- hadoop "Can not create a Path from an empty string"
- How to resolve job_1634335400729_0001 failed with state FAILED due to: Application application_1634335400729_0001 failed 2 times error on hadoop
- Does Hadoop 3 support Mahout?
- Off-line clustering using solr?
- Apache Mahout Vs Apache Spark in local mode with nutch data
- Linux pointing to wrong version of Mahout
- Mahout custom data
- Errors caused by adding Mahout Dependency to Gradle
- MojoExecution Exception in mahout library
- Error: Could not find or load main class org.apache.mahout.driver
- When I run k-Means by Mahout,always show this tip
- How to implement a trending recommender in mahout
Related Questions in PEARSON
- Dealing with Pearson Similarity returning 0 for users with equal item counts - Mahout
- R Function for Relative Frequency to all columns
- How to find a Pearson correlation starting from two column Pandas DataFrame?
- Do a correlation test between two datasets of different sizes (one has 24, the other one 25 values) in R
- How to use for loop to perform Pearson correlation in r
- Is Pearson correlation faster than Spearman correlation in R?
- Calculating Correlation between genes of different treatments
- R k-means cluster with pearson (ClusterR)
- replacement length error which doing correlation in R using cor function
- Cross-lagged Pearson correlation in R
- How to slice and calculate the pearson correlation coefficient between one big and small array with "overlapping" windows arrays
- Memory error calculating Pearson correlation on huge dask.dataframe
- performance of calculating pearson coefficient of one vector with n vectors
- pearson correlation for genes in gene expression data
- Problem with creating Pearson correlation coefficient in python
Popular Questions
- How do I undo the most recent local commits in Git?
- How can I remove a specific item from an array in JavaScript?
- How do I delete a Git branch locally and remotely?
- Find all files containing a specific text (string) on Linux?
- How do I revert a Git repository to a previous commit?
- How do I create an HTML button that acts like a link?
- How do I check out a remote Git branch?
- How do I force "git pull" to overwrite local files?
- How do I list all files of a directory?
- How to check whether a string contains a substring in JavaScript?
- How do I redirect to another webpage?
- How can I iterate over rows in a Pandas DataFrame?
- How do I convert a String to an int in Java?
- Does Python have a string 'contains' substring method?
- How do I check if a string contains a specific word?
Trending Questions
- UIImageView Frame Doesn't Reflect Constraints
- Is it possible to use adb commands to click on a view by finding its ID?
- How to create a new web character symbol recognizable by html/javascript?
- Why isn't my CSS3 animation smooth in Google Chrome (but very smooth on other browsers)?
- Heap Gives Page Fault
- Connect ffmpeg to Visual Studio 2008
- Both Object- and ValueAnimator jumps when Duration is set above API LvL 24
- How to avoid default initialization of objects in std::vector?
- second argument of the command line arguments in a format other than char** argv or char* argv[]
- How to improve efficiency of algorithm which generates next lexicographic permutation?
- Navigating to the another actvity app getting crash in android
- How to read the particular message format in android and store in sqlite database?
- Resetting inventory status after order is cancelled
- Efficiently compute powers of X in SSE/AVX
- Insert into an external database using ajax and php : POST 500 (Internal Server Error)
It is not a question of configuration. The Pearson correlation is undefined in this case, so there can be no similarity computed between them using this metric.
Essentially -- Pearson is the ratio of the two preference series' covariance to the product of their standard deviations. But when one or both sequences are identical, the standard deviation is 0, as is the covariance, so the correlation is 0/0.
(This and a few other Pearson gotchas are covered in Chapter 4 of Mahout in Action, and I'm author of this part of the book and code.)