On 12-07-24 03:32 PM, Richard Purdie wrote:
On Tue, 2012-07-24 at 15:25 -0400, Yao Zhao wrote:
On 12-07-24 03:07 PM, Richard Purdie wrote:
On Tue, 2012-07-24 at 13:39 -0500, Mark Hatle wrote:
On 7/24/12 8:57 AM, Burton, Ross wrote:
On 24 July 2012 14:49, Yao Zhao <yao.z...@windriver.com> wrote:
when bzip2-native is installed in parallel to sysroot, it is possible that
some packages are using bzip2 to unpack, there are chances that bzip2 is
installed to sysroot but libbz2.so.0 not installed yet because parallel
installation.
link bzip2 and bzip2recover statically to avoid this problem and don't lose
parallel installation. libbz2.so is still available.
Is it me, or is this officially getting silly?  This probably happens
for *every* binary in the sysroot that links to a library, which is
probably a fair proportion of them.  Statically linking every single
one and then special-casing further problems where a static link isn't
sufficient (see pythonnative) just isn't going to scale.
The problem is that there are a handful of things that are needed, and for some
reason (valid or not) packages are not "requiring", either because they assume
the system is valid or they simply don't know they have the requirement.

Both python and perl may be used by a number of auto* utilities as well as by
packages themselves.  We could attempt to add DEPENDS for all of the cases, but
is it really worth it?  I suspect we'll end up specify the DEPENDS for 90% of
the things in the system, simply due to them incidentally running some system
level python or perl script.  If we make the python/perl into external items
that only get loaded into the environment when we're building python/perl
modules then things become increasingly easy.

As for the bzip2 issue.. This is another of those "we assume it's valid
binaries"..  And the problem appears to be when it gets build, it's not valid
for a small window of time.

   From what Yao has explained to me (offline) the issue is that the
assume_provided for bzip2-native works just fine, we use the system
version...but then python (or other) utilities come in and say they need
"bzip2-full-native".  Which triggers the build.. and opens a small window of
time where bzip2 is being installed, where it isn't functional -- something else
comes in and has a requirement of bzip2-native, which is satisfied by the
assume_provided and we get into the race where it executes a broken binary.

The underlying issue is simply that if we're installing tooling that is either
assume-provided (based on an alternative 'full' version) or incidental usage
like python or perl we need to take precautions to ensure that the build system
never sees the "broken" version during the short install window triggering the
race condition.

I'll let Yao comment further on this particular issue, but there is an overall
class of issues with -native that we need to track to get rid of these subtle 
races.
The set of problems are cases where we have things in ASSUME_PROVIDED.
It is near impossible to have an empty set there and sometimes we need
to build things ourselves which are already listed there. python-native
and perl-native are the pathological cases but as also have issues with
bzip2, tar and likely undiscovered yet, now git.

I'd been searching for bzip2-replacement-native rather than full which
is why I didn't see this earlier. I suspect in the python/bzip2 case,
not installing the bzip2 binary might well solve many of the problems as
python really wants a native libbz2 and the host bzip2 binary is ok.
Hi Richard,

Can I provide a patch to change bzip2-full-native to bzip2-native to
finish this problem?
I have tested that the bzip2-native is not built any more.
No, since I think the libbz2 component that python needs will then not
be present and this causes other problems (see the previous commit or
mailing list archives for what the exact issue was, I don't remember
offhand).
An update on this thread.

I made a mistake on this problem. I thought do_install will install to sysroot directly for native build but actually not. The do_install only installs to image so it doesn't matter whether do_install is installed paralleled. The installation to sysroot happens on populate_sysroot stage and the code do a copytree to copy files,dirs from sysroot-destdir to sysroot dir. I didn't debug this python code, but debugged the code below when writting to the manifest file. bzip2 is copied earlier than libbz2.so... so it will happen if in this small window, unpacking referencing bzip2.

To me this routine should be written to install by dependency order, like packages-split dir. This should fix a lot of native tools problem without changing code in upper level. Of course I am not sure it will fix python problem.

And today we see another problem:
/bin/sh: /builds-2012-07-25-003940/qemux86_std/bitbake_build/tmp/sysroots/x86_64-linux/usr/bin/bzip2: Text file busy
tar: This does not look like a tar archive
tar: Exiting with failure status due to previous errors

So looks like even just a copy it may cause problem too.


Is it possible that let python-native only depends on libbz2 sub package? I looked at python-native package, when bzip2-native is not built, it seems it has problems to find bz2 library although it builds still ok.

thanks,
yao
Cheers,

Richard






_______________________________________________
Openembedded-core mailing list
Openembedded-core@lists.openembedded.org
http://lists.linuxtogo.org/cgi-bin/mailman/listinfo/openembedded-core

Reply via email to