Quantcast

Error in getting vector-ids from seqdumper output

Next Topic
 
classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view
|  
Report Content as Inappropriate

Error in getting vector-ids from seqdumper output

manisha dubey
This post has NOT been accepted by the mailing list yet.
This post was updated on .
I am using mahout for k-means clustering on a directory containing 12 documents. I am using following commands:

mahout seq2sparse -i /user/manisha1414/dir_001-seqfiles -o /user/manisha1414/dir_001-vectors --maxDFPercent 85 --namedVector

mahout seqdumper -i /user/manisha1414/dir_001-kmeans-clusters/clusteredPoints/part-m-00000 > ./dir_001-cluster-docs.txt

I am getting the foll0wing Output

Key: 0: Value: wt: 1.0 distance: 47.44299700930014  vec: [{"0":2.386},{"2":1.875},{"9":2.386},{"14":2.386.........
Key: 11: Value: wt: 1.0 distance: 217.4603558919857  vec: [{"0":2.386},{"2":1.875},{".........



I am not getting vector-ids in above seqdumper output.

Please help !!
Loading...