On Mon, May 23, 2005 at 08:43:39PM +0300, avraham wrote:
> On Fri, May 20, 2005 at 12:52:05PM +0300, Tzafrir Cohen wrote:
> > On Fri, May 20, 2005 at 11:51:41AM +0300, avraham wrote:
> >>...skip
> > Any chance you could write those commands as a script, for reference?
> > ...skip
> > 
> > =================================================================
> > To unsubscribe, send mail to [EMAIL PROTECTED] with
> > the word "unsubscribe" in the message body, e.g., run the command
> > echo unsubscribe | mail [EMAIL PROTECTED]
> 
> Hi Tzafrir,
> The lull I had at work has finished, so I think it's better to
> sum up whatever I learned till now rather to report after
> finishing. Some may benefit from it and some others are much
> better qualified than me to improve.
> My experience originates in a small number of exercises in which
> I took some existing texts and transformed them into tex
> ->dvi->pdf files. 

BTW: there are also pdf[e[la]]tex to create a PDF file directly from
TeX.

> As exercises in LaTeX that saved me the typing,
> allowed me to concentrate in specific aspects of LaTeX and, by
> yielding a final product of interest, maintained my motivation.
> The first, Brett Hamilton: Installing debian Woody, to practice
> the \verb+  + command and the \verbatim environement.
> Main conclusion: In order to get a legible dvi file from the
> start, double space the original document
> $ cat source_file | sed G >preprocessed_file.

Hmm... Can't you do such double-spacing in LaTeX?

> Then, when applying the correct LaTeX formatting methods, it is
> very easy to get rid of the empty lines.
> Second, Sven Guckes: VIM Editing Intro, where I tried to produce
> all those symbols and commands of VIM with no, or minimal use of
> verbatim.
> Conclusion: To get a smooth translation to LaTeX it is worthwhile
> to apply an additional filter which first replaces the charaters
> &, ^ and $, which have special meanings in regular expressions with 
> placebos, and then replaces \, ~, # ,% _, {, }, |, <, >, ^, $,
> and & with their LaTeX symbols in text context. All this is, of
> course, prefixed by #!sh, written in the file lpreproc and made
> executable. 
> #!sh
> cat $1 | sed 's/\&/ personalET /g' | sed 's/\^/ personalHAT /g' | sed 's/\$/ 
> personalDOLLAR /g' | sed 's/\\/ \\textbackslash /g' | sed 's/~/ 
> \\textasciitilde /g' | sed 's/#/ \\# /g' | sed 's/%/ \\% /g' | sed 's/_/ \\_ 
> /g' | sed 's/{/ \\{ /g' | sed 's/}/ \\} /g' | sed 's/|/ \\textbar \\  /g' | 
> sed 's/</ \\textless /g' | sed 's/>/ \\textgreater /g' | sed 's/personalHAT/ 
> \\textasciicircum /g' | sed 's/personalDOLLAR/ \\$/g' | sed 's/personalET/ 
> \\\& /g' 
> 
> Clearly, if any of the placebos appears by chance in the file to
> be processed, the script has to be modified.
> (I know that some of these symbols have better looking LaTeX
> implementations among the mathematical symbols. I refrained from
> using them because I did not learn yet this chapter.)
> 
> Third, Chekhov: The sea-gull from the Guttenberg project(this is 
> just because I saw on TV the French film "La petite Lili", a modern 
> version of the play, and discovered that I had forgotten most of it).
> Conclusion: LaTeX may misinterpretate text in square brackets (in
> that case stage instructions) as misformed or mistaken command
> arguments. It's worth replacing to avoid trouble.
> Fourth, My wife's cake recipes, to practice multilanguage
> environement and indexing.
> Conversion from DOS hebrew:
> cat source_file | '\200-\232' '\340-\372' (from the Hebrew-HOWTO)

  cat source_file | tr '\200-\232' '\340-\372' 

?

You could probably try:

  iconv -f cp862 -t iso-8859-8

If you want to convert directly to UTF-8, that is probably the approach
you should take.

> When the source file originates from wp 5.2 for DOS (I have no
> experience with other sources) the numbers and parantheses come
> out the wrong way. The simplest way I found to treat the
> multitude of possible cases was to reverse everything left-right
> with:
> sed '/\n/!G;s/\(.\)\(.*\n\)/&\2\1/;//D;s/.//' (collection of sed
> 1liners -sorry did not write down the source)
> and then reverse back the text only, with bidiv 
> 
> In multi-language environement, in the end result of this treatment
> the English lines are reversed. As I had only very few English
> text, I separated them out with grep, before the treatment, and
> recombined back with vimdiff. This is not the proper solution:
> TODO.
> 
> After all these treatments, add a preamble at the beginning and
> an \end{document} at the end, and try to latex (elatex, for text
> containing Hebrew).
> Simple preable for English-only txt:
> \documentclass[12pt]{article}
> \pagestyle{empty}
> \begin{document}
> And for multilanguage environement:
> \documentclass[letterpaper,12pt]{article} 
> \pagestyle{plain}
> \usepackage[english,german,hebrew]{babel}

Why 'german'?

> \usepackage{hebcal}
> \usepackage{hebfont}

Why 'hebcal'? Why 'hebfont'?

> \begin{document}
> This is best done in the editor which you will use to refine the
> LaTeX source and deal with the problems caused by non-printable
> control characters carried over from the source.
> 
> Most if not all the sed and tr commands I mentioned have
> equivalents in the major editors. I wished to steer away from the
> holly war between the adepts of one or another of them. I hope
> that people will not find very hard to use a pipe of filters/or
> to translate them into the form understood by their editor.
> 
> Still other TODOs:
> 1-An efficient way to find non-printable characters in the middle
> of Hebrew text.
> 2-Specific to WP: Dashes between Hebrew text and numbers/english
> text are consistently misplaced.
> 
> Hoping it will be of use to someone, cheers,
> Avraham
> 
> =================================================================
> To unsubscribe, send mail to [EMAIL PROTECTED] with
> the word "unsubscribe" in the message body, e.g., run the command
> echo unsubscribe | mail [EMAIL PROTECTED]
> 

-- 
Tzafrir Cohen         | [EMAIL PROTECTED] | VIM is
http://tzafrir.org.il |                           | a Mutt's  
[EMAIL PROTECTED] |                           |  best
ICQ# 16849755         |                           | friend

=================================================================
To unsubscribe, send mail to [EMAIL PROTECTED] with
the word "unsubscribe" in the message body, e.g., run the command
echo unsubscribe | mail [EMAIL PROTECTED]

Reply via email to