1- Does partitioning improve performance? --Only if you make use of partitions in your queries (mostly in where clause to limit data to your query for a specific value of partitioned column)
2- Do i have to create partition table new or i can create partition on existing table by renaming that date column and add partition column event_date (the actual column name) ? you can not create partitions on already existing data unless the data is in partitioned directories on hdfs. I would recommend create a new table with partitioned columns. load data from old table into partitioned table dump old table 3- can i import data directly into partition table using sqoop command? you can import data directly into a partition. for exported data, you don't have to worry. it remains as it is On Tue, Jun 4, 2013 at 12:41 PM, Hamza Asad <hamza.asa...@gmail.com> wrote: > No i don't want to change my queries. I want that my queries work on same > table and partition does not change its schema. > and from schema i means schema on mysql (exported data). > > Few more things > 1- Does partitioning improve performance? > 2- Do i have to create partition table new or i can create partition on > existing table by renaming that date column and add partition column > event_date (the actual column name) ? > 3- can i import data directly into partition table using sqoop command? > > > > > On Tue, Jun 4, 2013 at 11:40 AM, Nitin Pawar <nitinpawar...@gmail.com>wrote: > >> partitioning of data in hive is more for the reasons on how you layout >> data in a well defined manner so that when you access your data , you >> request only for specific data by specifying the partition columns in where >> clause. >> >> to answer your question, >> do you have to change your queries? out of the box the queries should >> work as it is unless and until you are changing the table schema by >> removing/adding new columns. >> does the format change when you export data? if your select statement is >> not changing it will not change >> will table schema change? do you mean schema on hive or mysql ? >> >> >> On Tue, Jun 4, 2013 at 11:37 AM, Hamza Asad <hamza.asa...@gmail.com>wrote: >> >>> thats far more better :) .. >>> Please tell me few more things. Do i have to change my query if i create >>> table with partition on date? rest of the columns would be same as it is? >>> Also if i export that partitioned table to mysql, does schema of that table >>> would same as it was before partition? >>> >>> >>> On Tue, Jun 4, 2013 at 12:09 AM, Stephen Sprague <sprag...@gmail.com>wrote: >>> >>>> there is no delete semantic. >>>> >>>> you either partition on the data you want to drop and use drop >>>> partition (or drop table for the whole shebang) or you can do as Nitin >>>> suggests by selecting the inverse of the data you want to delete and store >>>> it back into the table itself. Not ideal but maybe it could work for your >>>> situation. >>>> >>>> Now here's another idea. This was just _recently_ discussed on this >>>> group as coincidence would have it. if you were to have scanned just a >>>> little of the groups messages you would have seen that and could then have >>>> added to the discussion! :) >>>> >>>> >>>> On Mon, Jun 3, 2013 at 2:19 AM, Hamza Asad <hamza.asa...@gmail.com>wrote: >>>> >>>>> Thanx for your response nitin. Anybody else have any better solution? >>>>> >>>>> >>>>> On Mon, Jun 3, 2013 at 1:27 PM, Nitin Pawar >>>>> <nitinpawar...@gmail.com>wrote: >>>>> >>>>>> hive does not give you a record level deletion as of now. >>>>>> >>>>>> so unless you have partitioned, other option is you overwrite the >>>>>> table with data which you want >>>>>> please wait for others to suggest you more options. this one is just >>>>>> mine and can be costly too >>>>>> >>>>>> >>>>>> On Mon, Jun 3, 2013 at 12:36 PM, Hamza Asad >>>>>> <hamza.asa...@gmail.com>wrote: >>>>>> >>>>>>> no, its not partitioned by date. >>>>>>> >>>>>>> >>>>>>> On Mon, Jun 3, 2013 at 11:19 AM, Nitin Pawar < >>>>>>> nitinpawar...@gmail.com> wrote: >>>>>>> >>>>>>>> how is the data laid out? >>>>>>>> is it partitioned data by the date? >>>>>>>> >>>>>>>> >>>>>>>> On Mon, Jun 3, 2013 at 11:20 AM, Hamza Asad <hamza.asa...@gmail.com >>>>>>>> > wrote: >>>>>>>> >>>>>>>>> Dear all, >>>>>>>>> How can i remove data of specific dates from HDFS >>>>>>>>> using hive query language? >>>>>>>>> >>>>>>>>> -- >>>>>>>>> *Muhammad Hamza Asad* >>>>>>>>> >>>>>>>> >>>>>>>> >>>>>>>> >>>>>>>> -- >>>>>>>> Nitin Pawar >>>>>>>> >>>>>>> >>>>>>> >>>>>>> >>>>>>> -- >>>>>>> *Muhammad Hamza Asad* >>>>>>> >>>>>> >>>>>> >>>>>> >>>>>> -- >>>>>> Nitin Pawar >>>>>> >>>>> >>>>> >>>>> >>>>> -- >>>>> *Muhammad Hamza Asad* >>>>> >>>> >>>> >>> >>> >>> -- >>> *Muhammad Hamza Asad* >>> >> >> >> >> -- >> Nitin Pawar >> > > > > -- > *Muhammad Hamza Asad* > -- Nitin Pawar