Re: "nested" languages

Hans Aberg Tue, 28 Jun 2005 05:19:21 -0700

At 11:40 +1000 2005/06/28, Neil Conway wrote:

I'm trying to use Bison to construct a parser for a language A thatneeds to allow syntax from a second language B to be embedded withinit (at well-defined positions within A). For example, suppose aproduction from the grammar of A is:
        if_stmt: IF expr THEN if_body END IF ;
where "expr" is a production in *another* Bison grammar. Language Bis defined by a fairly complex grammar that changes with someregularity (in this case, the embedded language B is SQL, defined byan ~8500 line bison grammar). B's grammar is maintained separately-- merging B's grammar into the grammar for A and maintaining twocopies is a headache I'd like to avoid, if possible. Ideally I'dlike to have A's parser "call into" B's parser, have it accept asmuch input as it can, and then return control to A's parser.
Is this possible? One idea would be to use hand-written parsers forboth languages, although it would be nice to stick with Bison ifpossible.

Bison only one single .y file as input, and must be presented such afile. One way to do it, might to use a preprocessor (Perl, M4perhaps) so merge the files.


The problem might be, though, that the combined grammar isn't LALR(1) anymore.

(At present this is implemented by having the lexer look for adelimiter in the input that we know marks the end of the tokens ofthe embedded language. We then concatenate these tokens into astring, and eventually pass that string to the yyparse() of theembedded language's parser. So in the above example we wouldbasically consume tokens until we see a "THEN", and assume thatanything between the "IF" and the "THEN" is a SQL statement. This isugly, to say the least.)

There are a few methods in use to combine grammars. One is to mergegrammars together, as you suggested. Another is what you already isusing. Sometimes, one is using a hybrid:

If one has a large number dynamic operator precedences, as in Prologfor example, then it is not really feasible to write that into alarge Bison's grammar directly. So one would a small grammar, puttingthe operators on a stack, and then let a sort out the precedences inthe actions. If the number of precedence is small, as Haskell, whichonly has ten levels, one could write it in directly in a staticgrammar, though. There, some do it, others use the first method.

In general, no method is better than the other; in the specific case,one chooses the one most convenient, simply.

You might also get more inputs on this question in the newsgroupcomp.compilers.

--
  Hans Aberg


_______________________________________________
Help-bison@gnu.org http://lists.gnu.org/mailman/listinfo/help-bison

Re: "nested" languages

Reply via email to