Nandakumar created HDFS-12213: --------------------------------- Summary: Ozone: Corona: Support for online mode Key: HDFS-12213 URL: https://issues.apache.org/jira/browse/HDFS-12213 Project: Hadoop HDFS Issue Type: Sub-task Components: ozone Reporter: Nandakumar Assignee: Nandakumar
This jira brings support for online mode in corona. In online mode, common crawl data from AWS will be used to populate ozone with data. Default source is [CC-MAIN-2017-17/warc.paths.gz | https://commoncrawl.s3.amazonaws.com/crawl-data/CC-MAIN-2017-17/warc.paths.gz] (it contains the path to actual data segment), user can override this using -source. The following values are derived from URL of Common Crawl data * Domain will be used as Volume * URL will be used as Bucket * FileName will be used as Key -- This message was sent by Atlassian JIRA (v6.4.14#64029) --------------------------------------------------------------------- To unsubscribe, e-mail: hdfs-dev-unsubscr...@hadoop.apache.org For additional commands, e-mail: hdfs-dev-h...@hadoop.apache.org