When I've tried to create concurrent CLI sessions, I thought the 2nd one got an error about not being able to lock the metadata store.
Is that error a real thing, or have I been mistaken this whole time? --Tom On Thursday, January 10, 2013, Qiang Wang wrote: > The HWI will create a cli session for each query through hive libs, so > several queries can run concurrently. > > > 2013/1/11 Tom Brown <tombrow...@gmail.com <javascript:_e({}, 'cvml', > 'tombrow...@gmail.com');>> > >> How is concurrency achieved with this solution? >> >> >> On Thursday, January 10, 2013, Qiang Wang wrote: >> >>> I believe the HWI (Hive Web Interface) can give you a hand. >>> >>> https://github.com/anjuke/hwi >>> >>> You can use the HWI to submit and run queries concurrently. >>> Partition management can be achieved by creating crontabs using the HWI. >>> >>> It's simple and easy to use. Hope it helps. >>> >>> Regards, >>> Qiang >>> >>> >>> 2013/1/11 Tom Brown <tombrow...@gmail.com> >>> >>>> All, >>>> >>>> I want to automate jobs against Hive (using an external table with >>>> ever growing partitions), and I'm running into a few challenges: >>>> >>>> Concurrency - If I run Hive as a thrift server, I can only safely run >>>> one job at a time. As such, it seems like my best bet will be to run >>>> it from the command line and setup a brand new instance for each job. >>>> That quite a bit of a hassle to solves a seemingly common problem, so >>>> I want to know if there are any accepted patterns or best practices >>>> for this? >>>> >>>> Partition management - New partitions will be added regularly. If I >>>> have to setup multiple instances of Hive for each (potentially) >>>> overlapping job, it will be difficult to keep track of the partitions >>>> that have been added. In the context of the preceding question, what >>>> is the best way to add metadata about new partitions? >>>> >>>> Thanks in advance! >>>> >>>> --Tom >>>> >>> >>> >