Hi Jacob;

As you have probably figured out yourself by now, coming up with some
syntax for parametrized types (and perhaps even a set of formalized rules)
is the easy part of a complete implementation.

If you have been working on the existing compiler, you must have adjusted
the cmd/compile/internal/syntax package which is quite straight-forward and
easy to play with. The result is a parse tree using syntax.Nodes.
Presumably you have introduced new new types and fields to handle the extra
information you need to represent and adjusted the parser code. (Correct me
if you're doing something else).

In cmd/compile, currently (we're in a transition phase) this syntax tree is
then converted by the gc noder into another syntax tree, the one that
historically was generated by the compiler's parser directly. It's bigger,
has only a handful of distinct nodes, and is generally (much) harder to
deal with. I don't know if you have attempted to generate this one as well.
It's much more difficult to extend this node structure and its
documentation is sparse to non-existent (hence we're hoping to replace it
eventually). In cmd/compile, it's that tree that is type-checked. If you
haven't gotten this far you may want to try to type-check the syntax tree
instead.

Type-checking is a complicated process: For Go it must implement
essentially three basic functions: 1) resolution of identifiers: Which
object (package, constant, type, variable, function, label, etc.) does an
identifier stand for. In general this resolution requires all source code
of a package to be available because only then one can know for sure what
an identifier (which might be defined at the very last line of the last
file) stand for. 2) Constant folding: For all constant expressions, the
constant value is determined. 3) Type inference: For each expression its
type is determined. There's some finer-grained phases to associate methods
with types, to determine initialization order, etc.

In Go, only phase two can be done truly independently. Phases 2 and 3 are
somewhat interdependent because a constant value (say, the length of an
array, or unsafe.Sizeof a variable) may depend on a type. Vice versa, an
array type may depend on the value of a constant which in turn may affect
the constant size of the array type.

The cmd/compile type checker mixes some of these phases. In fact some
identifier resolution (phase 1) is even done at parse time (and sometimes
needs to be corrected afterwards). But the main phases are at least
outlined in gc/main.go:439ff. The type checker is also called whenever the
compiler is not sure it has the type of a node. Each node "knows" if it was
type-checked, and there's a mechanism to stop endless recursion
(Node.Typecheck). It's all pretty complicated unfortunately - which is why
we're trying to clean it up over time.

If your goal is to eventually generate code, you'd have to deal with the
compiler's node structure, adjust the type checker, and probably backend
(and all that depends on that). It's a huge project.

If your goal is to have a proof of concept where you can parse and
type-check source code, it might be a lot easier to have your own
type-checker. You could borrow almost all of go/types
<https://goto.google.com/types> which is a complete type-checker for Go,
and which is hopefully a bit easier to read. It operates on go/AST
<https://goto.google.com/AST> nodes which are a bit different from syntax
nodes, but in fact most of the nodes are the same or very similar (syntax
nodes mostly have a simpler structure for grouped declarations). That might
be a simpler approach but it won't get you to code generation.

You'd still have to deal with import and export somehow which would need to
be extended so it can represent the new data structures.

All that said, if you actually manage to type-check a single package using
parametrized types correctly (in whatever glorious form you have devised),
you've got the first half of an implementation.

Hopefully this is a little bit helpful.
- gri

On Wed, Sep 20, 2017 at 4:46 PM, Jacob McCollum <jacob.r.mccol...@gmail.com>
wrote:

> All,
>
> This is my first time posting to the golang-nuts mailing list, but I've
> hit a wall and I'm not sure where else to reach out for some assistance.
>
> I'm knee-deep in a research implementation of parameterized types for Go.
> It started out as an experiment, but I really enjoyed working on it and it
> has turned into a toy project for me.
>
> To clarify, this is not (necessarily) something I intend to submit as a
> PR, nor do I intend to release it as a fork or in any way use it to
> fracture the community. This is simply a project I undertook to develop a
> better understanding of the technical challenges facing the team when it
> comes to implementation of generic types, and to potentially bring some new
> ideas to that discussion (eventually). I don't have future plans for this
> beyond learning at this point.
>
> That being said, I'm to a point where I've got a formalized set of rules,
> the syntax is parsing correctly this information is stored in the parse
> tree. The next step is type checking & concrete type generation/replacement.
>
> I am considering 2 possible implementations for this phase. One possible
> plan is to augment the existing type checking logic with new functionality.
> Another is to re-implement a lightweight type checker that gathers only the
> information it needs to modify the parse tree, before dropping that down to
> the cmd/compile/internal/gc package's processing logic. Right now, I don't
> feel that I understand the existing type-checking flow well enough to make
> that call.
>
> Would it be a reasonable request to ask for a high-level overview of the
> process that gc uses to get from syntax's parse tree to the node DAG that
> is type-checked? I can fill in a lot of the blanks on my own once I have a
> better high-level understanding, but right now I'm struggling to determine
> the exact relationship between processing the syntax tree with *noder,
> typecheck, the order in which those two processes should occur, etc. I've
> seen the different phases as presented in gc's Main method, but trying to
> understand the high level intent via the code feels like I'm failing to see
> the forest through the trees. A little cognitive glue would go a long way
> in helping me coalesce my understanding.
>
> Thanks,
>
> Jacob R. McCollum
>
>
>
>
>
> --
> You received this message because you are subscribed to the Google Groups
> "golang-nuts" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to golang-nuts+unsubscr...@googlegroups.com.
> For more options, visit https://groups.google.com/d/optout.
>

-- 
You received this message because you are subscribed to the Google Groups 
"golang-nuts" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to golang-nuts+unsubscr...@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Reply via email to