[ https://issues.apache.org/jira/browse/HIVE-9557?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14603987#comment-14603987 ]
Nishant Kelkar commented on HIVE-9557: -------------------------------------- Hi [~apivovarov], I followed your instructions, and everything went fine till the step where I run the TestCliDriver with 'mvn test'. I get the following exception in ./itests/qtest/tmp/log/hive.log: {code} 2015-06-26 22:25:47,656 DEBUG [main]: util.Shell (Shell.java:checkHadoopHome(320)) - Failed to detect a valid hadoop home directory java.io.IOException: HADOOP_HOME or hadoop.home.dir are not set. at org.apache.hadoop.util.Shell.checkHadoopHome(Shell.java:302) at org.apache.hadoop.util.Shell.<clinit>(Shell.java:327) at org.apache.hadoop.hive.conf.HiveConf$ConfVars.findHadoopBinary(HiveConf.java:2371) at org.apache.hadoop.hive.conf.HiveConf$ConfVars.<clinit>(HiveConf.java:366) at org.apache.hadoop.hive.conf.HiveConf.<clinit>(HiveConf.java:105) at org.apache.hadoop.hive.ql.QTestUtil.<init>(QTestUtil.java:354) at org.apache.hadoop.hive.cli.TestCliDriver.<clinit>(TestCliDriver.java:53) at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57) at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) at java.lang.reflect.Method.invoke(Method.java:606) at org.junit.internal.runners.SuiteMethod.testFromSuiteMethod(SuiteMethod.java:35) at org.junit.internal.runners.SuiteMethod.<init>(SuiteMethod.java:24) at org.junit.internal.builders.SuiteMethodBuilder.runnerForClass(SuiteMethodBuilder.java:11) at org.junit.runners.model.RunnerBuilder.safeRunnerForClass(RunnerBuilder.java:59) at org.junit.internal.builders.AllDefaultPossibilitiesBuilder.runnerForClass(AllDefaultPossibilitiesBuilder.java:26) at org.junit.runners.model.RunnerBuilder.safeRunnerForClass(RunnerBuilder.java:59) at org.junit.internal.requests.ClassRequest.getRunner(ClassRequest.java:26) at org.apache.maven.surefire.junit4.JUnit4Provider.execute(JUnit4Provider.java:262) at org.apache.maven.surefire.junit4.JUnit4Provider.executeTestSet(JUnit4Provider.java:153) at org.apache.maven.surefire.junit4.JUnit4Provider.invoke(JUnit4Provider.java:124) at org.apache.maven.surefire.booter.ForkedBooter.invokeProviderInSameClassLoader(ForkedBooter.java:200) at org.apache.maven.surefire.booter.ForkedBooter.runSuitesInProcess(ForkedBooter.java:153) at org.apache.maven.surefire.booter.ForkedBooter.main(ForkedBooter.java:103) 2015-06-26 22:25:47,669 DEBUG [main]: util.Shell (Shell.java:isSetsidSupported(392)) - setsid is not available on this machine. So not using it. 2015-06-26 22:25:47,669 DEBUG [main]: util.Shell (Shell.java:isSetsidSupported(396)) - setsid exited with exit code 0 2015-06-26 22:25:48,408 WARN [main]: conf.HiveConf (HiveConf.java:initialize(2802)) - HiveConf of name hive.dummyparam.test.server.specific.config.metastoresite does not exist 2015-06-26 22:25:48,409 WARN [main]: conf.HiveConf (HiveConf.java:initialize(2802)) - HiveConf of name hive.ql.log.PerfLogger.level does not exist 2015-06-26 22:25:48,409 WARN [main]: conf.HiveConf (HiveConf.java:initialize(2802)) - HiveConf of name hive.dummyparam.test.server.specific.config.hivesite does not exist 2015-06-26 22:25:48,409 WARN [main]: conf.HiveConf (HiveConf.java:initialize(2802)) - HiveConf of name hive.dummyparam.test.server.specific.config.override does not exist 2015-06-26 22:25:48,410 WARN [main]: conf.HiveConf (HiveConf.java:initialize(2802)) - HiveConf of name hive.metastore.metadb.dir does not exist 2015-06-26 22:25:48,477 INFO [main]: server.ZooKeeperServer (Environment.java:logEnv(100)) - Server environment:zookeeper.version=3.4.6-1569965, built on 02/20/2014 09:09 GMT 2015-06-26 22:25:48,477 INFO [main]: server.ZooKeeperServer (Environment.java:logEnv(100)) - Server environment:zookeeper.version=3.4.6-1569965, built on 02/20/2014 09:09 GMT 2015-06-26 22:25:48,477 INFO [main]: server.ZooKeeperServer (Environment.java:logEnv(100)) - Server environment:host.name=localhost 2015-06-26 22:25:48,477 INFO [main]: server.ZooKeeperServer (Environment.java:logEnv(100)) - Server environment:host.name=localhost 2015-06-26 22:25:48,477 INFO [main]: server.ZooKeeperServer (Environment.java:logEnv(100)) - Server environment:java.version=1.7.0_67 2015-06-26 22:25:48,477 INFO [main]: server.ZooKeeperServer (Environment.java:logEnv(100)) - Server environment:java.version=1.7.0_67 2015-06-26 22:25:48,477 INFO [main]: server.ZooKeeperServer (Environment.java:logEnv(100)) - Server environment:java.vendor=Oracle Corporation 2015-06-26 22:25:48,477 INFO [main]: server.ZooKeeperServer (Environment.java:logEnv(100)) - Server environment:java.vendor=Oracle Corporation 2015-06-26 22:25:48,477 INFO [main]: server.ZooKeeperServer (Environment.java:logEnv(100)) - Server environment:java.home=/Library/Java/JavaVirtualMachines/jdk1.7.0_67.jdk/Contents/Home/jre 2015-06-26 22:25:48,477 INFO [main]: server.ZooKeeperServer (Environment.java:logEnv(100)) - Server environment:java.home=/Library/Java/JavaVirtualMachines/jdk1.7.0_67.jdk/Contents/Home/jre {code} I'm on a Mac OS X 10.9.5 by the way, and don't have Hadoop set up. I thought however, that Hive spins up in memory and runs in isolation? > create UDF to measure strings similarity using Cosine Similarity algo > --------------------------------------------------------------------- > > Key: HIVE-9557 > URL: https://issues.apache.org/jira/browse/HIVE-9557 > Project: Hive > Issue Type: Improvement > Components: UDF > Reporter: Alexander Pivovarov > Assignee: Nishant Kelkar > Labels: CosineSimilarity, SimilarityMetric, UDF > Attachments: udf_cosine_similarity-v01.patch > > > algo description http://en.wikipedia.org/wiki/Cosine_similarity > {code} > --one word different, total 2 words > str_sim_cosine('Test String1', 'Test String2') = (2 - 1) / 2 = 0.5f > {code} > reference implementation: > https://github.com/Simmetrics/simmetrics/blob/master/src/uk/ac/shef/wit/simmetrics/similaritymetrics/CosineSimilarity.java -- This message was sent by Atlassian JIRA (v6.3.4#6332)