I just discovered the LaTeXtoXML project: http://dlmf.nist.gov/LaTeXML/
which is actively developed and may actually come very close to what we are aiming at. It is a perl-based attempt to recreate a subset of TeX with XML output. It is very math-oriented and, from a first look, not so bibliorgaphy oriented (although it does parse bibtex). did anyone know of it? I am going to try it on our test document and will report back on its current performance. In the meanwhile, if you have any reactions to such an approach, do not hesitate to share them. Stefano -- __________________________________________________ Stefano Franchi Associate Research Professor Department of Hispanic Studies Ph: +1 (979) 845-2125 Texas A&M University Fax: +1 (979) 845-6421 College Station, Texas, USA stef...@tamu.edu http://stefano.cleinias.org