Hello:
I have analysis apache log from hive, but there is a problem
When I write hive command in Script file and use crontab for schedule it
The result is different with run in hive container
The attachment file is 2 way process detail
Hive_error.txt is run hive command in script
Hive_normal.txt is run hive command in hive container
Best Regard
MSN: [email protected]
SKYPE: chen0727
Mobil: 886-937545215
Tel: 886-2-8798-2988 #222
Fax:886-2-8751-5499
hadoop@hadoop-00:~$ uname
Linux
hadoop@hadoop-00:~$ cat /proc/version
Linux version 2.6.35-22-server (buildd@allspice) (gcc version 4.4.5
(Ubuntu/Linaro 4.4.4-14ubuntu4) ) #33-Ubuntu SMP Sun Sep 19 20:48:58 UTC 2010
/home/hadoop/hive-0.6.0/bin/hive -e "insert overwrite table varnish_data select
host,to_date(from_unixtime(unix_timestamp(regexp_extract(time, '^.([^:]*):.*',
1),'dd/MMM/yyyy'))),substr(regexp_extract(request, 'http://(\\S*)\/..', 1), 1,
instr(regexp_extract(request, 'http://(\\S*)\/..',
1),'/')-1),status,size,regexp_extract(referer, 'http://(\\S*)/', 1),agent FROM
varnish;"
hive> desc varnish_data;
OK
host string
time string
request string
status string
size int
referer string
agent string
Time taken: 0.073 seconds
hive> select * from varnish_data limit 5;
OK
115.87.233.142 2011-08-04 200 5012 "Mozilla/4.0
(compatible; MSIE 8.0; Windows NT 5.1; Trident/4.0; .NET CLR 2.0.50727; .NET
CLR 3.0.4506.2152; .NET CLR 3.5.30729; InfoPath.2)"
202.76.19.134 2011-08-04 200 397 "Mozilla/4.0
(compatible; MSIE 6.0; Windows NT 5.2; SV1; .NET CLR 1.1.4322)"
171.243.127.174 2011-08-04 200 941 "Mozilla/4.0
(compatible; MSIE 6.0; Windows NT 5.1; SV1; GTB7.1; MS Internet Explorer; .NET
CLR 2.0.50727)"
121.33.94.45 2011-08-04 200 4273 "Mozilla/4.0
(compatible; MSIE 6.0; Windows NT 5.1; SV1; .NET4.0C; .NET4.0E)"
118.68.36.97 2011-08-04 200 941 "Mozilla/4.0
(compatible; MSIE 6.0; Windows NT 5.1; SV1)"
Time taken: 0.491 seconds
hadoop@hadoop-00:~$ uname
Linux
hadoop@hadoop-00:~$ cat /proc/version
Linux version 2.6.35-22-server (buildd@allspice) (gcc version 4.4.5
(Ubuntu/Linaro 4.4.4-14ubuntu4) ) #33-Ubuntu SMP Sun Sep 19 20:48:58 UTC 2010
/home/hadoop/hive-0.6.0/bin/hive -e "insert overwrite table varnish_data select
host,to_date(from_unixtime(unix_timestamp(regexp_extract(time, '^.([^:]*):.*',
1),'dd/MMM/yyyy'))),substr(regexp_extract(request, 'http://(\\S*)\/..', 1), 1,
instr(regexp_extract(request, 'http://(\\S*)\/..',
1),'/')-1),status,size,regexp_extract(referer, 'http://(\\S*)/', 1),agent FROM
varnish;"
hive> desc varnish_data;
OK
host string
time string
request string
status string
size int
referer string
agent string
Time taken: 0.073 seconds
hive> insert overwrite table varnish_data select
host,to_date(from_unixtime(unix_timestamp(regexp_extract(time, '^.([^:]*):.*',
1),'dd/MMM/yyyy'))),substr(regexp_extract(request, 'http://(\\S*)\/..', 1), 1,
instr(regexp_extract(request, 'http://(\\S*)\/..',
1),'/')-1),status,size,regexp_extract(referer, 'http://(\\S*)/', 1),agent FROM
varnish;
hive> select * from varnish_data limit 5;
OK
115.87.233.142 2011-08-04 image.155551.com 200
5012 7uwak.155551.com "Mozilla/4.0 (compatible; MSIE 8.0; Windows NT
5.1; Trident/4.0; .NET CLR 2.0.50727; .NET CLR 3.0.4506.2152; .NET CLR
3.5.30729; InfoPath.2)"
202.76.19.134 2011-08-04 image.8899988.com 200 397
dxiqd.8899988.com "Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.2;
SV1; .NET CLR 1.1.4322)"
171.243.127.174 2011-08-04 image.333332.com 200 941
a5vpni7x.333332.com "Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1;
SV1; GTB7.1; MS Internet Explorer; .NET CLR 2.0.50727)"
121.33.94.45 2011-08-04 image.8899988.com 200 4273
u1u1u.8899988.com "Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1;
SV1; .NET4.0C; .NET4.0E)"
118.68.36.97 2011-08-04 image.333332.com 200 941
q9r8m.333332.com "Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1;
SV1)"
Time taken: 0.418 seconds