It's driving me bonkers and can't afford any more psychiatic bills. The data
is a saved .txt file when viewing from a website. The vendor will not give
us an actual file even though we payed a montly fee for use of the database.
I have around 5000 records that need to be converted to MARC cataloging
records. I need to either have the data from each heading on 1 line or have
the script extract each heading and all the subsequent lines.
The script is only extracting the first line of the heading.. I can only
have 1 blank line between each record which works in the script. If I right
click then import to excel when viewing the records at the website, each
heading is a continous string, which is what I need. I can then save as a
tab delimited file and the lines for each heading remian continuous, which
works. But we have ceased our subscription and I now only have saved .txt
files of the 5000 records. I can't figure out how and where to modify the
script to work on the files. I suppose I could spend a couple months
manually joining lines, but that really cuts into naptime.
Sample data:
Title 10 fastest growing careers: jobs for the future part four
business
and computer technology (03616)
Physical Color; Sound; 15 minutes
Copyrighted 1990
Producer GUIDANCE ASSOCIATES (GUID)
Dewey 371.425
Synopsis Contents: The business community depends on up-to-the minute
technology - technology that is changing rapidly. As a result, careers
in
technology, especially computers and specialized areas such as
accounting
are much in demand. Takes a look at three business and computer
careers:
software engineering, computer programming and accounting.
Subjects CAREER GUIDANCE; CAREER SERVICES
Holdings
1/2 VHS video: Head Office, 1 copy
Title 10 fastest growing careers: jobs for the future part one legal
and
health (03613)
Physical Color; Sound; 15 minutes
Copyrighted 1990
Producer GUIDANCE ASSOCIATES (GUID)
Dewey 371.425
Synopsis Contents: Takes a look at the fast growing health and legal
fields. Talks to a registered nurse about her changing role in a major
hospital, a physician's assistant who works with two doctors in a busy
family practice, and a paralegal who works with an attorney.
Subjects CAREER GUIDANCE; CAREER SERVICES
Holdings
1/2 VHS video: Head Office, 1 copy
HERE IS THE SCRIPT
open(MYINPUTFILE, "<1000chomp.txt"); # open for input
my(@lines) = <MYINPUTFILE>; # read file into list
my $title;
my $series;
my $subjects;
my $physical;
my $synopsis;
my $producer;
my $copyrighted;
my $dewey;
for my $line (@lines)
{
$line =~ /Title/ and $title = $line;
$line =~ /Title/ and print "=LDR 00000nam 2200000Ia 45e0\n","=245
00\$a",$line;
$line =~ /Dewey/ and $dewey = $line;
$line =~ /Dewey/ and print "=082 \\\\\$a",$line;
$line =~ /Producer/ and $producer = $line;
$line =~ /Producer/ and print "=040 \\\\\$aCaSRRI\n","=260
\\\\\$a",$line;
$line =~ /Copyrighted/ and $copyrighted = $line;
$line =~ /Copyrighted/ and print "=261 \\\\\$c",$line;
$line =~ /Physical/ and $physical = $line;
$line =~ /Physical/ and print "=300 \\\\\$a1 videocassette ( min.)
:\$bsd., col. ;\$c13 mm.",$line;
$line =~ /Series/ and $series = $line;
$line =~ /Series/ and print "=440 0\\\$a",$line;
$line =~ /Synopsis/ and $synopsis = $line;
$line =~ /Synopsis/ and print "=520 \\\\\$a",$line;
$line =~ /Subjects/ and $subjects = $line;
$line =~ /Subjects/ and print "=550 \\\\\$a",$line,"\n";
--
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
<http://learn.perl.org/> <http://learn.perl.org/first-response>