On Tue, Aug 03, 2010 at 12:12:26PM -0700, Dino Vliet wrote:
>  Dear debian people,
> 
>  Can you help me with this task I have? I have a lot of files in a 
> subdirectory
>  containing the following text:

You should use awk.

- cut -

>  I need to parse this file to get in a csv file the following information:
> 
>  Correctly Classified Instances, Kappa statistic, Total Number of Instances,
>  Precision {1}, Recall {1}, F-Measure {1},Precision {2}, Recall {2}, F-Measure
>  {2},Precision {3}, Recall {3}, F-Measure {3},a,b,c,a,b,c,a,b,c
>  56.6808, 0.2443, 5324760, 0.681,0.618,0.648,0.617,0.519,0.564,
>  
> 0.056,0.296,0.094,1784321,684983,416649,787342,1190428,314537,49255,53877,43368
> 
>  Does anyone have an idea how this could be accomplished?
>  I not that great in programming so writing a ruby or shell script do do this
>  would take me weeks:-(

A starting in Awk for processing a single file would be:

BEGIN {
  n_equals = 0;
}

n_equals == 0 && /Correctly Classified/ {
  CCI = $(NF - 2);
}

n_equals == 0 && /Incorrectly Classified/ {
  ICI = $(NF - 2);
}

n_equals == 0 && /Kappa statistic/ {
  KS = $NF
}

…

/ ===/ { n_equals = n_equals + 1 }

n_equals == 1 && /TP Rate/ {
  next;
}

// More complicated processing

END {
  printf "%d,", CCI
  printf "%d,", ICI
  printf "%f", KS
  …
}

You ought to read the Awk manual, and then it would be a mattle of a
couple of hours of thought at most.

HTH.

Kumar
-- 
"Even more amazing was the realization that God has Internet access.  I
wonder if He has a full newsfeed?"
(By Matt Welsh)


-- 
To UNSUBSCRIBE, email to debian-user-requ...@lists.debian.org 
with a subject of "unsubscribe". Trouble? Contact listmas...@lists.debian.org
Archive: 
http://lists.debian.org/20100803194333.ga9...@bluemoon.alumni.iitm.ac.in

Reply via email to