[ 
https://issues.apache.org/jira/browse/NUTCH-1924?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14302984#comment-14302984
 ] 

Talat UYARER commented on NUTCH-1924:
-------------------------------------

Hi [~rrydziu] and [~lewismc],

First of all Thanks for this wonderful stuff. I have some comments.  You can 
see below:
- Do we need Hadoop 2.x on our docker images ?  Dockers small containers. Nutch 
can run local mode. Building Hadoop and other things. IMHO they are overkill.  
- At the present Nutch 2.x write/read on Hbase that runs on Hadoop 2 or Hadoop 
1. Nutch 2.x can not run on Map Reduce of Hadoop 2. Because of Nutch's Hadoop 1 
dependecies confilct with Hadoop 2 cluster.  In our dockerfile Hbase is 
compiled for Hadoop 2. AFAIK This is unnecessary. There is not difference 
clients of Hbase 0.94
- Our docker file is too big. we can update it this way: We create a docker 
file which Hbase and Nutch (local mode) run on it without Hadoop 2.



> Nutch + HBase Docker
> --------------------
>
>                 Key: NUTCH-1924
>                 URL: https://issues.apache.org/jira/browse/NUTCH-1924
>             Project: Nutch
>          Issue Type: Sub-task
>          Components: build
>            Reporter: Lewis John McGibbney
>            Assignee: Radosław Stankiewicz
>             Fix For: 2.3.1
>
>         Attachments: NUTCH-1924.patch
>
>
> ZooKeeper 3.4.5 Hadoop 0.20.204 HBase 0.90.4 Nutch 2.2.1
> https://registry.hub.docker.com/u/stankiewicz/hbase_hadoop_nutch/dockerfile/



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to