Re: TFIDF Transformation

2015-08-10 Thread pradyumnad
If you want to convert the hash to word, the very thought defies the usage of hashing. You may map the words with hashing, but that wouldn't be good. -- View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/TFIDF-Transformation-tp24086p24203.html Sent fro

Re: TFIDF Transformation

2015-08-04 Thread clark djilo kuissu
tp://apache-spark-user-list.1001560.n3.nabble.com/TFIDF-Transformation-tp24086.html Sent from the Apache Spark User List mailing list archive at Nabble.com. - To unsubscribe, e-mail: user-unsubscr...@spark.apache.org For additional

Re: TFIDF Transformation

2015-08-04 Thread Yanbo Liang
f > transform(document: > Iterable[_]): > Vector > = > { blah blah blah} ———> This part of the code does the counting and spreads > the current array into two separate ones using Vectors.sparse. > > > Thanks in advance and I hope to hear from you soon! > Best, &

TFIDF Transformation

2015-07-30 Thread hans ziqiu li
es the counting and spreads the current array into two separate ones using Vectors.sparse. Thanks in advance and I hope to hear from you soon! Best, Hans -- View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/TFIDF-Transformation-tp24086.html Sent from the Apache

TFIDF Transformation

2015-07-29 Thread ziqiu.li
Hello spark users, I hope your week is going fantastic! I am having some troubles with the TFIDF in MLlib and was wondering if anyone can point me to the right direction. The data ingestion and the initial term frequency count code taken from the example works fine (I am using the first example