On Tue, Jun 22, 2010 at 9:47 PM, Sharma, Ashish <ashish.shar...@hp.com> wrote:
> Hi,
>
> I have deployed an mail receiving postfix server combined with amavisd (with 
> clamAV and spamassassin) by using the reference: 
> http://wiki.centos.org/HowTos/Amavisd
>
> on a CentOS 5.5(64 bit) machine.
>
> Now I have deployed the FuzzyOCR module for image spam, but the test cases as 
> provided for fuzzy OCR are not getting passed.
>
> spamassassin version : 3.3.1
> Perl version: 5.8.8
> FuzzyOcr version: 3.6.0
> Postfix version: 3.6.5
>
> Following is one such debug output that I am getting:
>
> $ spamassassin --debug FuzzyOcr < 
> /mnt/fuzzyOCR/FuzzyOcr-3.6.0/samples/ocr-wrongext.eml > /dev/null
>
> Jun 22 15:44:48.653 [12849] dbg: FuzzyOcr: focr_bin_helper: 
> 'pnmnorm,pnminvert,convert,ppmtopgm,tesseract'
> Jun 22 15:44:48.653 [12849] info: FuzzyOcr: Adding <5> new helper apps
> Jun 22 15:44:48.665 [12849] info: FuzzyOcr: Starting preprocessor parser for 
> file "/etc/mail/spamassassin/FuzzyOcr.preps"...
> Jun 22 15:44:48.665 [12849] dbg: FuzzyOcr: line: preprocessor normalize {
> Jun 22 15:44:48.665 [12849] dbg: FuzzyOcr: line: command = pnmnorm
> Jun 22 15:44:48.665 [12849] dbg: FuzzyOcr: line: }
> Jun 22 15:44:48.665 [12849] dbg: FuzzyOcr: line: preprocessor invert {
> Jun 22 15:44:48.665 [12849] dbg: FuzzyOcr: line: command = pnminvert
> Jun 22 15:44:48.665 [12849] dbg: FuzzyOcr: line: }
> Jun 22 15:44:48.665 [12849] dbg: FuzzyOcr: line: preprocessor ppmtopgm {
> Jun 22 15:44:48.665 [12849] dbg: FuzzyOcr: line: command = ppmtopgm
> Jun 22 15:44:48.665 [12849] dbg: FuzzyOcr: line: }
> Jun 22 15:44:48.665 [12849] dbg: FuzzyOcr: line: preprocessor pamtopnm {
> Jun 22 15:44:48.665 [12849] dbg: FuzzyOcr: line: command = pamtopnm
> Jun 22 15:44:48.666 [12849] dbg: FuzzyOcr: line: }
> Jun 22 15:44:48.666 [12849] dbg: FuzzyOcr: line: preprocessor pamthreshold {
> Jun 22 15:44:48.666 [12849] dbg: FuzzyOcr: line: command = pamthreshold
> Jun 22 15:44:48.666 [12849] dbg: FuzzyOcr: line: args = -simple -threshold 0.5
> Jun 22 15:44:48.666 [12849] dbg: FuzzyOcr: line: }
> Jun 22 15:44:48.666 [12849] dbg: FuzzyOcr: line: preprocessor maketiff {
> Jun 22 15:44:48.666 [12849] dbg: FuzzyOcr: line: command = pnmtotiff
> Jun 22 15:44:48.666 [12849] dbg: FuzzyOcr: line: args = -color -truecolor
> Jun 22 15:44:48.666 [12849] dbg: FuzzyOcr: line: }
> Jun 22 15:44:48.666 [12849] info: FuzzyOcr: Starting scanset parser for file 
> "/etc/mail/spamassassin/FuzzyOcr.scansets"...
> Jun 22 15:44:48.666 [12849] dbg: FuzzyOcr: line scanset ocrad {
> Jun 22 15:44:48.666 [12849] dbg: FuzzyOcr: line command = $ocrad
> Jun 22 15:44:48.666 [12849] dbg: FuzzyOcr: line args = -s5 $input
> Jun 22 15:44:48.667 [12849] dbg: FuzzyOcr: line }
> Jun 22 15:44:48.667 [12849] dbg: FuzzyOcr: line scanset ocrad-invert {
> Jun 22 15:44:48.667 [12849] dbg: FuzzyOcr: line command = $ocrad
> Jun 22 15:44:48.667 [12849] dbg: FuzzyOcr: line args = -s5 -i $input
> Jun 22 15:44:48.667 [12849] dbg: FuzzyOcr: line }
> Jun 22 15:44:48.667 [12849] dbg: FuzzyOcr: line scanset 
> ocrad-decolorize-invert {
> Jun 22 15:44:48.667 [12849] dbg: FuzzyOcr: line preprocessors = ppmtopgm, 
> pamthreshold, pamtopnm
> Jun 22 15:44:48.667 [12849] dbg: FuzzyOcr: line command = $ocrad
> Jun 22 15:44:48.667 [12849] dbg: FuzzyOcr: line args = -s5 -i $input
> Jun 22 15:44:48.667 [12849] dbg: FuzzyOcr: line }
> Jun 22 15:44:48.667 [12849] dbg: FuzzyOcr: line scanset ocrad-decolorize {
> Jun 22 15:44:48.667 [12849] dbg: FuzzyOcr: line preprocessors = ppmtopgm, 
> pamthreshold, pamtopnm
> Jun 22 15:44:48.667 [12849] dbg: FuzzyOcr: line command = $ocrad
> Jun 22 15:44:48.667 [12849] dbg: FuzzyOcr: line args = -s5 $input
> Jun 22 15:44:48.667 [12849] dbg: FuzzyOcr: line }
> Jun 22 15:44:48.668 [12849] dbg: FuzzyOcr: line scanset gocr {
> Jun 22 15:44:48.668 [12849] dbg: FuzzyOcr: line command = $gocr
> Jun 22 15:44:48.668 [12849] dbg: FuzzyOcr: line args = -i $input
> Jun 22 15:44:48.668 [12849] dbg: FuzzyOcr: line }
> Jun 22 15:44:48.668 [12849] dbg: FuzzyOcr: line scanset gocr-180 {
> Jun 22 15:44:48.668 [12849] dbg: FuzzyOcr: line command = $gocr
> Jun 22 15:44:48.668 [12849] dbg: FuzzyOcr: line args = -l 180 -d 2 -i $input
> Jun 22 15:44:48.668 [12849] dbg: FuzzyOcr: line }
> Jun 22 15:44:49.009 [12849] info: FuzzyOcr: Searching in: 
> /usr/local/netpbm/bin
> Jun 22 15:44:49.009 [12849] info: FuzzyOcr: Searching in: /usr/local/bin
> Jun 22 15:44:49.009 [12849] info: FuzzyOcr: Searching in: /usr/bin
> Jun 22 15:44:49.010 [12849] info: FuzzyOcr: Using gifsicle => 
> /usr/bin/gifsicle
> Jun 22 15:44:49.010 [12849] info: FuzzyOcr: Using giffix => /usr/bin/giffix
> Jun 22 15:44:49.010 [12849] info: FuzzyOcr: Using giftext => /usr/bin/giftext
> Jun 22 15:44:49.010 [12849] info: FuzzyOcr: Using gifinter => 
> /usr/bin/gifinter
> Jun 22 15:44:49.010 [12849] info: FuzzyOcr: Using giftopnm => 
> /usr/bin/giftopnm
> Jun 22 15:44:49.010 [12849] info: FuzzyOcr: Using jpegtopnm => 
> /usr/bin/jpegtopnm
> Jun 22 15:44:49.010 [12849] info: FuzzyOcr: Using pngtopnm => 
> /usr/bin/pngtopnm
> Jun 22 15:44:49.010 [12849] info: FuzzyOcr: Using bmptopnm => 
> /usr/bin/bmptopnm
> Jun 22 15:44:49.010 [12849] info: FuzzyOcr: Using tifftopnm => 
> /usr/bin/tifftopnm
> Jun 22 15:44:49.010 [12849] info: FuzzyOcr: Using ppmhist => /usr/bin/ppmhist
> Jun 22 15:44:49.010 [12849] info: FuzzyOcr: Using pamfile => /usr/bin/pamfile
> Jun 22 15:44:49.010 [12849] info: FuzzyOcr: Using ocrad => /usr/bin/ocrad
> Jun 22 15:44:49.010 [12849] info: FuzzyOcr: Using gocr => /usr/bin/gocr
> Jun 22 15:44:49.011 [12849] info: FuzzyOcr: Using pnmnorm => /usr/bin/pnmnorm
> Jun 22 15:44:49.011 [12849] info: FuzzyOcr: Using pnminvert => 
> /usr/bin/pnminvert
> Jun 22 15:44:49.011 [12849] info: FuzzyOcr: Using convert => /usr/bin/convert
> Jun 22 15:44:49.011 [12849] info: FuzzyOcr: Using ppmtopgm => 
> /usr/bin/ppmtopgm
> Jun 22 15:44:49.011 [12849] info: FuzzyOcr: Using tesseract => 
> /usr/bin/tesseract
> Jun 22 15:44:49.011 [12849] dbg: FuzzyOcr: Threshold[max_hash] => 5
> Jun 22 15:44:49.011 [12849] dbg: FuzzyOcr: Threshold[c] => 5
> Jun 22 15:44:49.011 [12849] dbg: FuzzyOcr: Threshold[s] => 0.01
> Jun 22 15:44:49.011 [12849] dbg: FuzzyOcr: Threshold[w] => 0.01
> Jun 22 15:44:49.011 [12849] dbg: FuzzyOcr: Threshold[h] => 0.01
> Jun 22 15:44:49.011 [12849] dbg: FuzzyOcr: Threshold[cn] => 0.01
> Jun 22 15:44:49.012 [12849] dbg: FuzzyOcr: focr_add_score => 1
> Jun 22 15:44:49.012 [12849] dbg: FuzzyOcr: focr_autodisable_negative_score => 
> -5
> Jun 22 15:44:49.012 [12849] dbg: FuzzyOcr: focr_autodisable_score => 1000
> Jun 22 15:44:49.012 [12849] dbg: FuzzyOcr: focr_autosort_buffer => 10
> Jun 22 15:44:49.012 [12849] dbg: FuzzyOcr: focr_autosort_scanset => 1
> Jun 22 15:44:49.012 [12849] dbg: FuzzyOcr: focr_base_score => 5
> Jun 22 15:44:49.012 [12849] dbg: FuzzyOcr: focr_corrupt_score => 2.5
> Jun 22 15:44:49.012 [12849] dbg: FuzzyOcr: focr_corrupt_unfixable_score => 5
> Jun 22 15:44:49.012 [12849] dbg: FuzzyOcr: focr_counts_required => 2
> Jun 22 15:44:49.012 [12849] dbg: FuzzyOcr: focr_db_hash => 
> /etc/mail/spamassassin/FuzzyOcr.db
> Jun 22 15:44:49.012 [12849] dbg: FuzzyOcr: focr_db_max_days => 35
> Jun 22 15:44:49.012 [12849] dbg: FuzzyOcr: focr_db_safe => 
> /etc/mail/spamassassin/FuzzyOcr.safe.db
> Jun 22 15:44:49.012 [12849] dbg: FuzzyOcr: focr_digest_db => 
> /etc/mail/spamassassin/FuzzyOcr.hashdb
> Jun 22 15:44:49.012 [12849] dbg: FuzzyOcr: focr_enable_image_hashing => 2
> Jun 22 15:44:49.012 [12849] dbg: FuzzyOcr: focr_global_timeout => 0
> Jun 22 15:44:49.012 [12849] dbg: FuzzyOcr: focr_global_wordlist => 
> /etc/mail/spamassassin/FuzzyOcr.words
> Jun 22 15:44:49.012 [12849] dbg: FuzzyOcr: focr_hashing_learn_scanned => 1
> Jun 22 15:44:49.012 [12849] dbg: FuzzyOcr: focr_keep_bad_images => 0
> Jun 22 15:44:49.013 [12849] dbg: FuzzyOcr: focr_log_pmsinfo => 1
> Jun 22 15:44:49.013 [12849] dbg: FuzzyOcr: focr_log_stderr => 1
> Jun 22 15:44:49.013 [12849] dbg: FuzzyOcr: focr_max_height => 800
> Jun 22 15:44:49.013 [12849] dbg: FuzzyOcr: focr_max_width => 800
> Jun 22 15:44:49.013 [12849] dbg: FuzzyOcr: focr_min_height => 4
> Jun 22 15:44:49.013 [12849] dbg: FuzzyOcr: focr_min_width => 4
> Jun 22 15:44:49.013 [12849] dbg: FuzzyOcr: focr_minimal_scanset => 1
> Jun 22 15:44:49.013 [12849] dbg: FuzzyOcr: focr_mysql_db => FuzzyOcr
> Jun 22 15:44:49.013 [12849] dbg: FuzzyOcr: focr_mysql_hash => Hash
> Jun 22 15:44:49.013 [12849] dbg: FuzzyOcr: focr_mysql_host => localhost
> Jun 22 15:44:49.013 [12849] dbg: FuzzyOcr: focr_mysql_port => 3306
> Jun 22 15:44:49.013 [12849] dbg: FuzzyOcr: focr_mysql_safe => Safe
> Jun 22 15:44:49.013 [12849] dbg: FuzzyOcr: focr_mysql_update_hash => 0
> Jun 22 15:44:49.013 [12849] dbg: FuzzyOcr: focr_mysql_user => fuzzyocr
> Jun 22 15:44:49.013 [12849] dbg: FuzzyOcr: focr_no_homedirs => 0
> Jun 22 15:44:49.013 [12849] dbg: FuzzyOcr: focr_path_bin => 
> /usr/local/netpbm/bin:/usr/local/bin:/usr/bin
> Jun 22 15:44:49.013 [12849] dbg: FuzzyOcr: focr_pdf_maxpages => 1
> Jun 22 15:44:49.013 [12849] dbg: FuzzyOcr: focr_personal_wordlist => 
> __userstate__/FuzzyOcr.words
> Jun 22 15:44:49.013 [12849] dbg: FuzzyOcr: focr_preprocessor_file => 
> /etc/mail/spamassassin/FuzzyOcr.preps
> Jun 22 15:44:49.013 [12849] dbg: FuzzyOcr: focr_scan_pdfs => 0
> Jun 22 15:44:49.013 [12849] dbg: FuzzyOcr: focr_scanset_file => 
> /etc/mail/spamassassin/FuzzyOcr.scansets
> Jun 22 15:44:49.014 [12849] dbg: FuzzyOcr: focr_score_ham => 0
> Jun 22 15:44:49.014 [12849] dbg: FuzzyOcr: focr_skip_bmp => 0
> Jun 22 15:44:49.014 [12849] dbg: FuzzyOcr: focr_skip_gif => 0
> Jun 22 15:44:49.014 [12849] dbg: FuzzyOcr: focr_skip_jpeg => 0
> Jun 22 15:44:49.014 [12849] dbg: FuzzyOcr: focr_skip_png => 0
> Jun 22 15:44:49.014 [12849] dbg: FuzzyOcr: focr_skip_tiff => 0
> Jun 22 15:44:49.014 [12849] dbg: FuzzyOcr: focr_skip_updates => 0
> Jun 22 15:44:49.014 [12849] dbg: FuzzyOcr: focr_strip_numbers => 1
> Jun 22 15:44:49.014 [12849] dbg: FuzzyOcr: focr_threshold => 0.25
> Jun 22 15:44:49.014 [12849] dbg: FuzzyOcr: focr_timeout => 10
> Jun 22 15:44:49.014 [12849] dbg: FuzzyOcr: focr_twopass_scoring_factor => 1.5
> Jun 22 15:44:49.014 [12849] dbg: FuzzyOcr: focr_unique_matches => 0
> Jun 22 15:44:49.014 [12849] dbg: FuzzyOcr: focr_verbose => 3
> Jun 22 15:44:49.014 [12849] dbg: FuzzyOcr: focr_wrongctype_score => 1.5
> Jun 22 15:44:49.014 [12849] dbg: FuzzyOcr: focr_wrongext_score => 1.5
> Jun 22 15:44:49.014 [12849] info: FuzzyOcr: Loaded preprocessor normalize: 
> /usr/bin/pnmnorm
> Jun 22 15:44:49.014 [12849] info: FuzzyOcr: Loaded preprocessor invert: 
> /usr/bin/pnminvert
> Jun 22 15:44:49.014 [12849] info: FuzzyOcr: Loaded preprocessor ppmtopgm: 
> /usr/bin/ppmtopgm
> Jun 22 15:44:49.014 [12849] info: FuzzyOcr: Loaded preprocessor pamtopnm: 
> pamtopnm
> Jun 22 15:44:49.015 [12849] info: FuzzyOcr: Loaded preprocessor pamthreshold: 
> pamthreshold -simple -threshold 0.5
> Jun 22 15:44:49.015 [12849] info: FuzzyOcr: Loaded preprocessor maketiff: 
> pnmtotiff -color -truecolor
> Jun 22 15:44:49.015 [12849] info: FuzzyOcr: Using scan ocrad: /usr/bin/ocrad 
> -s5 $input
> Jun 22 15:44:49.015 [12849] info: FuzzyOcr: Using scan ocrad-invert: 
> /usr/bin/ocrad -s5 -i $input
> Jun 22 15:44:49.015 [12849] info: FuzzyOcr: Using scan 
> ocrad-decolorize-invert: /usr/bin/ocrad -s5 -i $input
> Jun 22 15:44:49.015 [12849] info: FuzzyOcr: Using scan ocrad-decolorize: 
> /usr/bin/ocrad -s5 $input
> Jun 22 15:44:49.015 [12849] info: FuzzyOcr: Using scan gocr: /usr/bin/gocr -i 
> $input
> Jun 22 15:44:49.015 [12849] info: FuzzyOcr: Using scan gocr-180: 
> /usr/bin/gocr -l 180 -d 2 -i $input
> Jun 22 15:44:49.015 [12849] info: FuzzyOcr: Added <43> words from 
> "/etc/mail/spamassassin/FuzzyOcr.words"
> Jun 22 15:44:50.042 [12849] info: rules: meta test ADVANCE_FEE_3_NEW_FORM has 
> dependency 'ADVANCE_FEE_3_NEW' with a zero score
> Jun 22 15:44:50.056 [12849] info: rules: meta test ADVANCE_FEE_3_NEW_MONEY 
> has dependency 'ADVANCE_FEE_3_NEW' with a zero score
> Jun 22 15:44:50.102 [12849] dbg: FuzzyOcr: Starting FuzzyOcr...
> Jun 22 15:44:50.102 [12849] info: FuzzyOcr: Processing Message with ID 
> "<yurtenl-66711-5576735...@alexalvarez.com>" (Clifton Ballard 
> <m7zmendel...@alexalvarez.com> -> naray...@web.de)
> Jun 22 15:44:50.102 [12849] dbg: FuzzyOcr: fname: "sbillet" => "sbillet"
> Jun 22 15:44:50.102 [12849] info: FuzzyOcr: GIF: [327x549] sbillet (7239)
> Jun 22 15:44:50.102 [12849] dbg: FuzzyOcr: Saved: 
> /tmp/.spamassassin12849jvyvJgtmp/sbillet
> Jun 22 15:44:50.102 [12849] dbg: FuzzyOcr: Saved: 
> /tmp/.spamassassin12849jvyvJgtmp/raw.eml
> Jun 22 15:44:50.103 [12849] info: FuzzyOcr: Found: 1 images
> Jun 22 15:44:50.103 [12849] dbg: FuzzyOcr: pfile => 
> /tmp/.spamassassin12849jvyvJgtmp/sbillet.pnm
> Jun 22 15:44:50.103 [12849] dbg: FuzzyOcr: efile => 
> /tmp/.spamassassin12849jvyvJgtmp/sbillet.err
> Jun 22 15:44:50.103 [12849] dbg: FuzzyOcr: Errors to: 
> /tmp/.spamassassin12849jvyvJgtmp/raw.err
> Jun 22 15:44:50.103 [12849] dbg: FuzzyOcr: File has Content-Type "image/jpeg" 
> and no File Extension
> Jun 22 15:44:50.103 [12849] info: FuzzyOcr: Found GIF header name="sbillet"
> Jun 22 15:44:50.103 [12849] info: FuzzyOcr: Image has format "GIF" but 
> content-type is "image/jpeg"
> Jun 22 15:44:50.116 [12850] dbg: FuzzyOcr: Exec : /usr/bin/giftext 
> /tmp/.spamassassin12849jvyvJgtmp/sbillet
> Jun 22 15:44:50.117 [12850] dbg: FuzzyOcr: Stdout: 
> >/tmp/.spamassassin12849jvyvJgtmp/giftext.info
> Jun 22 15:44:50.117 [12850] dbg: FuzzyOcr: Stderr: 
> >>/tmp/.spamassassin12849jvyvJgtmp/giftext.err
> save_execute: Insecure dependency in open while running with -T switch at 
> /etc/mail/spamassassin/FuzzyOcr/Misc.pm line 92.
> save_execute: Insecure dependency in open while running with -T switch at 
> /etc/mail/spamassassin/FuzzyOcr/Misc.pm line 92.
> Jun 22 15:44:50.134 [12849] dbg: FuzzyOcr: Saved pid: 12850
> Jun 22 15:44:50.134 [12849] dbg: FuzzyOcr: Elapsed [12850]: 0.030674 sec. 
> (/usr/bin/giftext: exit 8)
> Jun 22 15:44:50.134 [12849] warn: readline() on closed filehandle INFILE at 
> /etc/mail/spamassassin/FuzzyOcr/Misc.pm line 205.
> Jun 22 15:44:50.135 [12849] info: FuzzyOcr: Image is single non-interlaced...
> Jun 22 15:44:50.142 [12851] dbg: FuzzyOcr: Exec : /usr/bin/giffix 
> /tmp/.spamassassin12849jvyvJgtmp/sbillet
> Jun 22 15:44:50.143 [12851] dbg: FuzzyOcr: Stdout: 
> >/tmp/.spamassassin12849jvyvJgtmp/sbillet-fixed.gif
> Jun 22 15:44:50.143 [12851] dbg: FuzzyOcr: Stderr: 
> >>/tmp/.spamassassin12849jvyvJgtmp/sbillet.err
> save_execute: Insecure dependency in open while running with -T switch at 
> /etc/mail/spamassassin/FuzzyOcr/Misc.pm line 92.
> save_execute: Insecure dependency in open while running with -T switch at 
> /etc/mail/spamassassin/FuzzyOcr/Misc.pm line 92.
> Jun 22 15:44:50.149 [12849] dbg: FuzzyOcr: Saved pid: 12851
> Jun 22 15:44:50.149 [12849] dbg: FuzzyOcr: Elapsed [12851]: 0.014443 sec. 
> (/usr/bin/giffix: exit 8)
> Jun 22 15:44:50.156 [12852] dbg: FuzzyOcr: Exec : /usr/bin/giftopnm 
> /tmp/.spamassassin12849jvyvJgtmp/sbillet-fixed.gif
> Jun 22 15:44:50.157 [12852] dbg: FuzzyOcr: Stdout: 
> >/tmp/.spamassassin12849jvyvJgtmp/sbillet.pnm
> Jun 22 15:44:50.157 [12852] dbg: FuzzyOcr: Stderr: 
> >>/tmp/.spamassassin12849jvyvJgtmp/sbillet.err
> save_execute: Insecure dependency in open while running with -T switch at 
> /etc/mail/spamassassin/FuzzyOcr/Misc.pm line 92.
> save_execute: Insecure dependency in open while running with -T switch at 
> /etc/mail/spamassassin/FuzzyOcr/Misc.pm line 92.
> Jun 22 15:44:50.173 [12849] dbg: FuzzyOcr: Saved pid: 12852
> Jun 22 15:44:50.174 [12849] dbg: FuzzyOcr: Elapsed [12852]: 0.023864 sec. 
> (/usr/bin/giftopnm: exit 8)
> Jun 22 15:44:50.174 [12849] error: FuzzyOcr: /usr/bin/giftopnm: Returned 
> [2048], skipping...
> Jun 22 15:44:50.175 [12849] dbg: FuzzyOcr: Remove DIR: 
> /tmp/.spamassassin12849jvyvJgtmp
> Jun 22 15:44:50.175 [12849] dbg: FuzzyOcr: FuzzyOcr ending successfully...
> Jun 22 15:44:50.175 [12849] dbg: FuzzyOcr: Processed in 0.073397 sec.
>
>
>
> Can anybody tell me what's wrong with my deployment?
>

Ashish,

Have ever found a workaround for this issue?

Thx

Reply via email to