Thank you, Adriaan. Mark
On Tue, Sep 27, 2011 at 2:20 AM, Adriaan Tijsseling <adri...@tijsseling.com>wrote: > Use a regexserde to split the text in words. There's documentation on the > hive wiki. > But it might be better to use a script. See the post by Shouguo Li earlier > on this mailing list. > After all, when you use a Python script, for example, you could use the > Natural Language Processing Toolkit to get much better function that splits > text into a list of words. > > Adriaan > > On 2011/09/27, at 05:40, Mark Kerzner wrote: > > > Hi, > > > > a simple question - if I have a book as a text, and I want to load it > into a > > Hive table, with one word forming one entry, how should I do it? > > > > Thank you, > > Mark > >