On Fri, Mar 29, 2019 at 02:39:27PM -0700, Jonathan Tan wrote:

> Teach oid_object_info_extended() to support a new flag that inhibits
> fetching of missing objects. This is equivalent to setting
> fetch_is_missing to 0, calling oid_object_info_extended(), then setting
> fetch_if_missing to whatever it was before. Update unpack-trees.c to use
> this new flag instead of repeatedly setting fetch_if_missing.
> 
> This new flag complicates things slightly in that there are now 2 ways
> to do the same thing. But this eliminates the need to repeatedly set a
> global variable, and more importantly, allows prefetching to be done in
> parallel (in the future); hence, this patch.

Sorry I'm a little late to review this. I don't have any critical
comments, so if this gets ignored, I'll live with it.

> +/*
> + * Do not attempt to fetch the object if missing (even if fetch_is_missing is
> + * nonzero). This is meant for bulk prefetching of missing blobs in a partial
> + * clone. Implies OBJECT_INFO_QUICK.
> + */
> +#define OBJECT_INFO_FOR_PREFETCH (32 + OBJECT_INFO_QUICK)

Mostly I found the name and semantics of this flag to be a little
confusing. Really what we want is to tell oid_object_info() not do any
on-demand fetching for us. That seems like a thing that we might
eventually want for other purposes (e.g., a diff operation that could
produce a real blob diff but would be happy outputting a less-detailed
tree diff).

If it were just OBJECT_INFO_NO_FETCH or similar, that tells more clearly
what it does, and would make sense in more contexts.

I suspect that QUICK would be the norm when used with it, though I
probably would have kept the two orthogonal for the sake of simplicity
and clarity.

> diff --git a/unpack-trees.c b/unpack-trees.c
> index 22c41a3ba8..381b0cd65e 100644
> --- a/unpack-trees.c
> +++ b/unpack-trees.c
> @@ -404,20 +404,21 @@ static int check_updates(struct unpack_trees_options *o)
>                * below.
>                */
>               struct oid_array to_fetch = OID_ARRAY_INIT;
> -             int fetch_if_missing_store = fetch_if_missing;
> -             fetch_if_missing = 0;
>               for (i = 0; i < index->cache_nr; i++) {
>                       struct cache_entry *ce = index->cache[i];
> -                     if ((ce->ce_flags & CE_UPDATE) &&
> -                         !S_ISGITLINK(ce->ce_mode)) {
> -                             if (!has_object_file(&ce->oid))
> -                                     oid_array_append(&to_fetch, &ce->oid);
> -                     }
> +
> +                     if (!(ce->ce_flags & CE_UPDATE) ||
> +                         S_ISGITLINK(ce->ce_mode))
> +                             continue;
> +                     if (!oid_object_info_extended(the_repository, &ce->oid,
> +                                                   NULL,
> +                                                   OBJECT_INFO_FOR_PREFETCH))
> +                             continue;
> +                     oid_array_append(&to_fetch, &ce->oid);

Here we get rid of the global set/restore dance, which is nice. But
there's also a behavior change, as we've picked up QUICK. I think that's
probably the right thing to do, but I was a bit surprised not to see any
discussion in the commit message.

-Peff

Reply via email to