[jira] Created: (MAHOUT-341) org.apache.mahout.cf.taste.hadoop.slopeone is not complete

classic Classic list List threaded Threaded
4 messages Options
Reply | Threaded
Open this post in threaded view
|

[jira] Created: (MAHOUT-341) org.apache.mahout.cf.taste.hadoop.slopeone is not complete

JIRA jira@apache.org
org.apache.mahout.cf.taste.hadoop.slopeone is not complete
----------------------------------------------------------

                 Key: MAHOUT-341
                 URL: https://issues.apache.org/jira/browse/MAHOUT-341
             Project: Mahout
          Issue Type: Improvement
          Components: Collaborative Filtering
    Affects Versions: 0.3
            Reporter: Hui Wen Han
             Fix For: 0.4


slopeone arithmetic using hadoop is not complete .
can not use it do recommendation for dataset that has rating.

hope to complete it and gave  a full solution

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply | Threaded
Open this post in threaded view
|

[jira] Updated: (MAHOUT-341) org.apache.mahout.cf.taste.hadoop.slopeone could have an off-line implementation

JIRA jira@apache.org

     [ https://issues.apache.org/jira/browse/MAHOUT-341?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Sean Owen updated MAHOUT-341:
-----------------------------

    Priority: Minor  (was: Major)
     Summary: org.apache.mahout.cf.taste.hadoop.slopeone could have an off-line implementation  (was: org.apache.mahout.cf.taste.hadoop.slopeone is not complete)

It is not advertised as a complete recommender -- it is what it says. It does the precomputation phase for slope-one, and produces diffs. It is used as input to the on-line slope-one recommender.

You can use the 'pseudo-distributed' Hadoop recommender to run these slope-one recommenders on Hadoop, on top of the diffs computed by this Hadoop job. That's fairly complete.

You could write a new special-purpose Hadoop-based job to finish the recommender computation too. It could be even better. If you mean you'd like to implement that I can leave this open to track this.

> org.apache.mahout.cf.taste.hadoop.slopeone could have an off-line implementation
> --------------------------------------------------------------------------------
>
>                 Key: MAHOUT-341
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-341
>             Project: Mahout
>          Issue Type: Improvement
>          Components: Collaborative Filtering
>    Affects Versions: 0.3
>            Reporter: Hui Wen Han
>            Priority: Minor
>             Fix For: 0.4
>
>
> slopeone arithmetic using hadoop is not complete .
> can not use it do recommendation for dataset that has rating.
> hope to complete it and gave  a full solution

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply | Threaded
Open this post in threaded view
|

[jira] Commented: (MAHOUT-341) org.apache.mahout.cf.taste.hadoop.slopeone could have an off-line implementation

JIRA jira@apache.org
In reply to this post by JIRA jira@apache.org

    [ https://issues.apache.org/jira/browse/MAHOUT-341?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12849680#action_12849680 ]

Hui Wen Han commented on MAHOUT-341:
------------------------------------

Thanks for your response and advice,
I plan to use SlopeOnePrefsToDiffs job and SlopeOneDiffsToAverages job to instead of ItemSimilarityEstimator and use the cooccurence  arithmetic to calculate  recommendation for item's had rating.

could you give me some advice ?

> org.apache.mahout.cf.taste.hadoop.slopeone could have an off-line implementation
> --------------------------------------------------------------------------------
>
>                 Key: MAHOUT-341
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-341
>             Project: Mahout
>          Issue Type: Improvement
>          Components: Collaborative Filtering
>    Affects Versions: 0.3
>            Reporter: Hui Wen Han
>            Priority: Minor
>             Fix For: 0.4
>
>
> slopeone arithmetic using hadoop is not complete .
> can not use it do recommendation for dataset that has rating.
> hope to complete it and gave  a full solution

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply | Threaded
Open this post in threaded view
|

[jira] Commented: (MAHOUT-341) org.apache.mahout.cf.taste.hadoop.slopeone could have an off-line implementation

JIRA jira@apache.org
In reply to this post by JIRA jira@apache.org

    [ https://issues.apache.org/jira/browse/MAHOUT-341?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12850064#action_12850064 ]

Sean Owen commented on MAHOUT-341:
----------------------------------

You could use the output of this job as something like a co-occurrence matrix, with modification. You would want to consider the following modifications:

- Right now, it just outputs item-item / float pairs as its output. It would have to output item / user-vector pairs like the co-occurrence matrix in order to use with the rest of the co-occurrence-based jobs.
- You probably don't want to output the average diff, though that would work OK. Don't divide by 'count' in SlopeOneDiffsToAveragesReducer

But at that point, you're very close to outputting co-occurrence counts anyway! So I'd advise you to just use the 'item' implementation anyway.

You're talking about ItemSimilarityEstimator -- that's not part of the implementation I'm suggesting you use, in order to effectively use longs. Look to the 'item' package.


> org.apache.mahout.cf.taste.hadoop.slopeone could have an off-line implementation
> --------------------------------------------------------------------------------
>
>                 Key: MAHOUT-341
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-341
>             Project: Mahout
>          Issue Type: Improvement
>          Components: Collaborative Filtering
>    Affects Versions: 0.3
>            Reporter: Hui Wen Han
>            Priority: Minor
>             Fix For: 0.4
>
>
> slopeone arithmetic using hadoop is not complete .
> can not use it do recommendation for dataset that has rating.
> hope to complete it and gave  a full solution

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.