OK, so the solution to the issue was to add the following to my core-site.xml
fs.file.impl
org.apache.hadoop.fs.LocalFileSystem The
FileSystem for file: uris.
fs.hdfs.impl
org.apache.hadoop.hdfs.DistributedFileSystem The
FileSystem for hdfs: uris.
On Monday, March
Mahout is on Spark 1.1.0 (before last week) and 1.1.1 as of current master.
Running locally should use these but make sure these are installed if you run
with anything other than —master local
The next thing to try is see which versions of Hadoop both Mahout and Spark are
compiled for, they mus
Here is what I get with hadoop fs -ls
-rw-r--r-- 1 username supergroup 5510526 2015-03-09 11:10 transactions.csv
Yes, I am trying to run a local version of Spark (trying to run everything
local at the moment)
and when I run
./bin/mahout spark-itemsimilarity -i transactions.csv -o output -fc
From the command line can you run:
hadoop fs -ls
And see SomeDir/transactions.csv? It looks like HDFS is not accessible from
wherever you are running spark-itemsimilarity.
Are you trying to run a local version of Spark because the default is "--master
local” This can still access a cluster
bump...anybody???
On Wednesday, March 4, 2015 9:22 PM, Jeff Isenhart
wrote:
I am having issue getting a simple itemsimilarity example to work. I know
hadoop is up and functional (ran the example mapreduce program anyway)
But when I run either of these
./mahout spark-itemsimilarity -
I am having issue getting a simple itemsimilarity example to work. I know
hadoop is up and functional (ran the example mapreduce program anyway)
But when I run either of these
./mahout spark-itemsimilarity -i "SomeDir/transactions.csv" -o
"hdfs://localhost:9000/users/someuser/output" -fc 1 -ic 2