Re: Deprecation notice - change to Treeherder's jobdetail API

Steve Fink Fri, 10 Apr 2020 11:03:47 -0700

On 4/10/20 10:06 AM, Andrew McCreight wrote:

On Thu, Apr 9, 2020 at 6:51 PM Sarah Clements <[email protected]> wrote:

Hello,

As part of maintenance work and performance improvements to Treeherder,
we're making changes to some of the data that's stored in our database. On 
*April
30th*, we will no longer be retrieving uploaded artifacts for jobs and
storing them in the JobDetail table.

Uploaded artifacts can be retrieved via a taskcluster API, with the
appropriate rootUrl, taskId and runId. TaskId and runId (synonymous with
retryId in Treeherder) can be surfaced from any of our /jobs/ API's. See
this
<https://github.com/mnoorenberghe/mozscreenshots/issues/43#issue-592994567>
for implementation details.

As somebody who has dumped files to MOZ_UPLOAD_DIR and used the job detail
tab to download those files (just this week, in fact!), but has only a
vague idea of what the "taskcluster API" even is, let alone a "rootURL" or
the rest of it, is there any introductory documentation about how I might
retrieve files written to MOZ_UPLOAD_DIR? Am I going to have to write some
kind of Python script to retrieve my files in the future? That GitHub issue
seems to require a lot of background information that I do not have.

I'm in a similar boat. I very frequently set additional options on mytry pushes in order to generate nondefault files in MOZ_UPLOAD_DIR. Itis unclear to me from this description whether I will still be able tosee those files in the treeherder UI after this change.

We're also planning to deprecate the parsing and storage of TinderboxPrint
log lines in the JobDetail table. They'll no longer be shown in
Treeherder's job detail panel but you can still read them in the raw logs
via the log-viewer.

If you have any questions or concerns (or need a longer timeline to make
changes that rely on this data), don't hesitate to reach out.

Similarly, I also regularly depend on being able to emit log lines frommy jobs that will be easily visible in treeherder (*without* reading theentire raw log), both for my own use and others'. Is there a replacementfor this functionality? I am not at all sad to see the specificimplementation go ("TinderboxPrint" plus an opaque set of rules for whatcan go after it), and would be happy to convert my jobs over to adifferent system. If that involves something like writing out structuredjson to a separate file, that's still fine with me. (I'm only speakingfor myself; some people may depend on being able to simply drop in`fprintf(stderr, "TinderboxPrint: ...")` into their code and have itshow up.)

Needing to load the raw log files isn't great because (1) it takes anoticeable amount of time for large files (I'm sure this is much worsefor some locations), and (2) when you're doing things like this, you mayvery well need to be loading several of them.

If there *is* an intended replacement, I could suggest other things thatI'd find useful (or would like to preserve with the current setup). Forexample, I have a job that checks for multiple types of errors, andwants to output a link to documentation specific to the class of errorthat was detected. Not critical functionality, but it definitely makesfor a better workflow. I am currently using TinderboxPrint for this, ina way that once worked but has now regressed. Example: see the'documentation' link in Job Details athttps://treeherder.mozilla.org/#/jobs?repo=try&author=sfink%40mozilla.com&fromchange=39a2e0ffb371099b5ffd42cb7932367a183d6128&selectedJob=295880330


_______________________________________________
dev-platform mailing list
[email protected]
https://lists.mozilla.org/listinfo/dev-platform

Re: Deprecation notice - change to Treeherder's jobdetail API

Reply via email to