Re: Use DTS instead of DFS for data warehouse

2010-02-04 Thread jian yi
> > Hi hammer, > Thank you. I don't know Zebra, but it sounds very good. > > > 发件人: Jeff Hammerbacher > 日期: 2010年2月5日 上午4:33 > 主题: Re: Use DTS instead of DFS for data warehouse > 收件人: hdfs-dev@hadoop.apache.org > > > > Hey 易剑, > > Your propo

Re: Use DTS instead of DFS for data warehouse

2010-02-04 Thread Jeff Hammerbacher
Hey 易剑, Your proposed system sounds quite a bit like Zebra, which is a contributed project under the Pig subproject: http://wiki.apache.org/pig/zebra. Have you taken a look at Zebra? Thanks, Jeff 2010/2/4 易剑 > *Glossary* > DTS: Distributed Table System, not a bigtable > DFS: Distributed File S

Use DTS instead of DFS for data warehouse

2010-02-04 Thread 易剑
*Glossary* DTS: Distributed Table System, not a bigtable DFS: Distributed File System DFS is better for unstructed data, but DTS is better for structed data, data warehouse is structed, so I think a table is better than a file. DTS is following: 1. Break a logic big table into a many physical sma