[oe] [Bitbake-dev] Bitbake task logging

Tue Jan 4 18:02:16 UTC 2011

On Tue, 2011-01-04 at 08:30 -0700, Chris Larson wrote:
> On Sat, Jan 1, 2011 at 11:14 AM, Richard Purdie
> <richard.purdie at linuxfoundation.org> wrote:
> > I've been looking at the changes in bitbake master and those in Poky.
> > Both are trying to improve the logging situation but I think we need to
> > discuss/agree what we're trying to achieve.
> >
> > I'm trying to think about this from the user point of view. What does a
> > user need and expect from bitbake to be as easy to use as possible and
> > figure out whats going on. New users often ask "what is it doing? what
> > functions does it run?" and its hard to work out for someone new to it.
> >
> > Some of the list below works today, some is close, some doesn't but I'm
> > looking to work out what the experience should be, then we can figure
> > out how to get there technically.
> >
> > The things I think we need are:
> >
> > a) A logfile per task.
> 
> Agreed, and I'm pretty sure I merged this commit from poky already, to
> make it task bound rather than function bound.

You did, I mention this mainly for context.

> > b) All output from the task should end up in the logfile, copied or
> > otherwise, be it bitbake LogRecords, output to stdout, stderr or
> > anything else.
> 
> This is reasonable, though it could encourage python code to use
> non-bitbake mechanisms like print rather than the correct bitbake APIs
> for messaging.  The only piece of this which is not happening today is
> stdout/stderr of python functions.  If we do what you describe here,
> we can use the file descriptor redirection context manager in
> https://github.com/kergoth/bitbake/commit/3acd45c to do it in a clean
> way, rather than the old code which was crammed into exec_func.

I like the idea of that done in exec_task so any python output from the
tasks ends up in the right logfile.

Another benefit is that the logfile for a shell function executed by
exec_func from within a python task would then become less of a problem
as I think otherwise it becomes os.devnull?

I understand the concern about not using the appropriate bitbake APIs, I
don't think thats a good enough reason to hide potentially real uncaught
failure information from the user though as we want to make things
easier for the user if possible.

For python executed in server context this is trickier and I'm unsure
about that. I suspect we should probably save cooker output to a
"cooker" logfile rather than os.devnull as despite our best intentions
its possible something happens server side that someone has to debug and
the output ends up on stdout/stderr.

> > c) The logfile should be able to include different output from that
> > shown in the UI console. For example, I think "note" output in the
> > logfiles might be useful all the time, I'm wondering if it ever is
> > useful on the UI console. Turning off "note" for the console but keeping
> > it and maybe using it more in tasks could be useful. For example,
> > exec_func() could note which functions are being executed and I can
> > imagine a user finding that very useful in the log file for debugging
> > (which is when a user would look there). Debug output is probably too
> > verbose even for the log files in general but I'm open to persuasion one
> > way or another on that.
> 
> I think this point is up to the UIs.  They have the log records, and
> can decide what to do with them.

Yes, I should have been clearer here that I'm thinking about knotty in
some of the comments above (and below).

Even if we decide on a given policy for knotty, other UIs are free to
have a different policy, they just need to have enough context to manage
that (e.g. know a "note" is from a task, not the server).

>   I'd suggest that we add more context
> to the log records (which we really need to do anyway, to ensure that
> UIs like goggle can *always* associate messages with a specific task),
> and let the UI decide which messages from where to display.  I agree
> wholeheartedly on debug output being too verbose.  We need to ensure
> that all debug messages are helpful to the user, and developers
> helping them -- I suspect that some of the ones we have today are
> remnants of us debugging bitbake internal behavior as opposed to that
> which is helpful in assisting the user, but that's just a guess.

I think you might be right...

> > d) Handling of the default stdout is a tricky one. I think it makes most
> > sense to redirect this to the logfile always, for both python and shell
> > tasks. For bitbake itself, nothing in bitbake should be putting output
> > there though IMO. This makes sense since we can control stdout from
> > within bitbake itself but from within generic tasks we cannot make
> > guarantees. Python tasks may make os.system calls, or run binaries and
> > it would make sense for stdout to just work for these rather than need
> > special handling or wrapper functions especially for log handling (they
> > could be a good idea for other reasons).
> 
> You've already stated that you want this in b) -- "output to stdout,
> stderr or anything else.".  I'm not sure how covering this twice in
> the same email helps, other than elaborating the aforementioned point.
>
> Even if we switch back to the file descriptor redirections to send
> everything to the log file for all tasks, we still need to do
> something to capture what's going to the log file and get it to the UI
> for the debug case, and blindly using tee the way it was before is not
> the solution, as that was going to the server stdout, not the UI.

Right, that was just broken and we're in agreement.

> If
> we do it the way you describe, the same for all tasks, we'd likely
> have to spawn off a separate thread or process to read the log file
> and send that to the UI.  This is an uglier solution, in my opinion,
> and risks more potential performance impact, though should only be
> necessary for the debug case, so its arguable.

If we do want the debug info in the UI and you make a good case for it
for the goggle UI below, I agree with the approach.

>   I'd question whether
> this is worthwhile, given the additional ugliness it adds to the code
> in multiple ways, simply to support python functions doing something
> they shouldn't be doing anyway -- print/os.system.
> 
> > e) I'm not 100% on this yet but I'm wondering if we should scrap
> > stdout/stderr output to the UI console in debug mode, just make it clear
> > where the output went on the console? If you ever have two tasks running
> > the output is unreadable anyway.
> 
> First, the stdout/stderr output doesn't go to the "UI console",
> they're messages sent to the UI, and the UI decides what to do with
> it.  The messages aren't unreadable for the goggle UI, only for
> knotty.  If you want to silence that for the one UI where it's
> confusing, I'd agree with that.  To see what I mean by this, try doing
> a build with goggle with current master with -D -- though note that
> stdout/stderr output from python tasks is going to the stdout/stderr
> of the server currently.  I'd suggest that whether we capture python
> task stdout/stderr to the UI or not, we need to silence the
> stdout/stderr of the server the way I do in above linked commit on
> github, to ensure its not possible to see this sort of problem with
> other UIs.  To return to the original point, the output is not
> unreadable for all UIs, so I don't see a good argument for removing a
> useful feature across the board.

In the context above I was thinking about knotty again and I should have
made that clear, sorry. I 100% agree with the functionality being there
for goggle and am not suggesting removing it, just tweaking the knotty
UI to show where the logfile is, rather that spew out unreadable output.

It does touch on another concern I have which is about UIs being able to
control what events they receive but this is a orthogonal problem. I
think we both are likely to agree that having event handlers being able
to select what they receive would be a good move and could help
performance too.

Cheers,

Richard