[ https://issues.apache.org/jira/browse/NUTCH-1924?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14302984#comment-14302984 ]
Talat UYARER commented on NUTCH-1924: ------------------------------------- Hi [~rrydziu] and [~lewismc], First of all Thanks for this wonderful stuff. I have some comments. You can see below: - Do we need Hadoop 2.x on our docker images ? Dockers small containers. Nutch can run local mode. Building Hadoop and other things. IMHO they are overkill. - At the present Nutch 2.x write/read on Hbase that runs on Hadoop 2 or Hadoop 1. Nutch 2.x can not run on Map Reduce of Hadoop 2. Because of Nutch's Hadoop 1 dependecies confilct with Hadoop 2 cluster. In our dockerfile Hbase is compiled for Hadoop 2. AFAIK This is unnecessary. There is not difference clients of Hbase 0.94 - Our docker file is too big. we can update it this way: We create a docker file which Hbase and Nutch (local mode) run on it without Hadoop 2. > Nutch + HBase Docker > -------------------- > > Key: NUTCH-1924 > URL: https://issues.apache.org/jira/browse/NUTCH-1924 > Project: Nutch > Issue Type: Sub-task > Components: build > Reporter: Lewis John McGibbney > Assignee: Radosław Stankiewicz > Fix For: 2.3.1 > > Attachments: NUTCH-1924.patch > > > ZooKeeper 3.4.5 Hadoop 0.20.204 HBase 0.90.4 Nutch 2.2.1 > https://registry.hub.docker.com/u/stankiewicz/hbase_hadoop_nutch/dockerfile/ -- This message was sent by Atlassian JIRA (v6.3.4#6332)