[OE-core] [PATCH] arch-arm: Do not add -march options for arm architecture

Martin Jansa martin.jansa at gmail.com
Sat Jan 26 17:39:42 UTC 2019


Which -mcpu/-march combination is causing issue in gcc9?

I know we had this issue with armv7a tunes before, that's why I've added
separate armv7ve before the switch from -mtune to -mcpu.

I still believe -mcpu (with or without -march) is better option.

Using -mtune without indicating which -mtune was actually used in the
package arch is bad (especially if you have multiple MACHINEs with
different DEFAULTTUNEs e.g. cortexa7, cortexa8 in -mtune, but both ending
in armv7a package arch, unpacking all packages from sstate every time you
switch MACHINE and the package manager on target never knowing which -mtune
was used to build given package upgrade from feed).

Switching from -mcpu to -mtune while keeping the current cpu-specific
package architectures is better, but then the use case, that I build
everything e.g. with cortexa8 in -mtune and install this cortexa8 package
feed on cortexa7 MACHINEs also looks strange, yes it will work in runtime,
but why not use generic armv7a DEFAULTTUNE to make it clear (to package
manager as well as end user) that I'm not really optimizing for given CPU).

>From the naive openssl benchmarks I did few years back, there was very
small difference between mtune/mcpu/just-march, but still for projects
where I have separate package feeds for each device (e.g. LG TVs) I prefer
to use -mcpu (because I really know that such packages won't ever be used
on device with different CPU) and e.g. in LuneOS we're more concerned about
package feed size and build time, so we select more generic DEFAULTTUNE
which is shared across many MACHINEs and that doesn't have any -mcpu (nor
-mtune) options in it, so should be fine as well.

We can easily extend the tune-test script I've used back then when
switching to -mcpu to not only use bitbake -e, but also call gcc-cross to
check for possible warnings from incompatible march/mcpu combinations and
fix those. The old version is here:
http://git.openembedded.org/openembedded-core-contrib/log/?h=jansa/tune-test
it was used in mcpu switch here:
http://git.openembedded.org/openembedded-core-contrib/log/?h=jansa/tune2-test

http://git.openembedded.org/openembedded-core-contrib/log/?h=jansa/tune3

>From the gcc git log I don't see many changes in:
https://github.com/gcc-mirror/gcc/commits/master/gcc/config/arm/driver-arm.c
nor
https://github.com/gcc-mirror/gcc/commits/master/gcc/config/arm/arm-cpus.in
so I'm really interested which combination issues the warning now.

Regards,

On Sat, Jan 26, 2019 at 5:29 PM Khem Raj <raj.khem at gmail.com> wrote:

>
> On Fri, Jan 25, 2019 at 12:32 PM Mark Hatle <mark.hatle at windriver.com>
> wrote:
> >
> > On 1/24/19 9:22 PM, Khem Raj wrote:
> > > On Wed, Jan 23, 2019 at 6:11 PM Andre McCurdy <armccurdy at gmail.com>
> wrote:
> > >>
> > >> On Wed, Jan 23, 2019 at 2:50 PM Khem Raj <raj.khem at gmail.com> wrote:
> > >>>
> > >>> On Wed, Jan 23, 2019 at 5:27 PM Andre McCurdy <armccurdy at gmail.com>
> wrote:
> > >>>>
> > >>>> On Wed, Jan 23, 2019 at 12:05 PM Khem Raj <raj.khem at gmail.com>
> wrote:
> > >>>>>
> > >>>>> tune files which inherit the arch definitions already define
> appropriate
> > >>>>> -mcpu option, which is equivalent of right -march and -mtune
> combination
> > >>>>
> > >>>> And what about machines which inherit an arch definition instead of
> a
> > >>>> cpu definition? Is that no longer supported?
> > >>>
> > >>> I could not find such a machine configuration in several BSP layers
> I looked at.
> > >>> Do you know of machine definitions where these files are included
> > >>> directly in machine configs ?
> > >>
> > >> Using a CPU specific include but then leaving the generic DEFAULTTUNE
> > >> (ie not enabling any CPU specific TUNE_CCARGS) is equivalent, and I
> > >> guess quite common.
> > >
> > > defaulttune does not play much part in appending -mcpu option to
> > > compiler flags, A case
> > > where this could bypass the setting mcpu is when we do not include a
> > > tune-* for arm based machine
> > > and thats the case I was interested to know.
> > >
> > > again if there is such a BSP I would be interested to test it out.
> > >>
> > >>> if so, then we have to either use _remove or introduce a weak
> variable
> > >>> to set march only if
> > >>> mcpu is not set which
> > >>
> > >> I think the solution is to ensure that the CPU specific tuning options
> > >> are always compatible with any architecture specific options which may
> > >> be active.
> > >>
> > >> In general our CPU specific tuning files are a little sloppy (e.g.
> > >> defaulting to combinations of VFP versions and architecture levels
> > >> which never exist in the real world). If newer versions of gcc are
> > >> making those issues more apparent then why not take the opportunity to
> > >> fix them properly?
> > >
> > > the mcpu option is helping with right march/mtune combination. while
> we can
> > > use specific -march=arch(+-)features syntax and combine with right
> > > -mtune to match -mcpu I think we should use
> > > compiler defaults and not try to recreate something that -mcpu would
> > > do it easily for us.
> > >
> > > So either we drop the idea of setting mcpu and use basic -march value
> > > combined with -mtune ( which btw. was default in past)
> > > or if we want to let compiler use ISA for a particular arch
> > > implementation ( cortexXXX ) then we should just use -mcpu and drop
> > > using -mtune and -march.
> > >
> > > I would include towards option 1.
> >
> > I know this is a bit of a 'me too' response, but we have to pick one
> format and
> > stay with it.  Either option, but not both.
>
> I concur.
>
>
> >
> > -IF- we know that the appropriate -mcpu options are available for all of
> the
> > potential combinations out there, then this is by far the preferred
> method in my
> > mind.  Much easier to handle and manage...
>
> -mcpu options are available for known CPU implementations. But there
> always will
> be weird ones, given how people can pick and choose subsystems for SOC on
> this
> architecture, and its highly probable that they wont be in gcc. so real
> question we need to answer for is, should we design for those cases and how
> often does this often.
>
>
> >
> > However, the more complex route came out as various 'odd-ball' (and
> people claim
> > don't exist, but they do) custom ARM cores get released without
> 'mandatory' math
> > units, or some other odd ball variation.  (I've seen this numerous times
> over
> > the last 20 years that something that doesn't exist, does... all because
> it's a
> > custom part.)
>
> Yes, I am dealing with one such design even today
>
>
> >
> > If we have an easy way to define the -mcpu= means some magic combination
> of
> > -march/-mtune within the GCC spec itself, then I'm 100% for moving to
> -mcpu to
> > rid ourselves of this mess.. and then we still have the flexibility to
> easily
> > update/add tunes and GCC configuration for the odd-balls that seem to
> crop up
> > regularly.
>
> So yes we can do nearly same but not exactly same. We have to start adding
> or deleting
> features via -march and -mtune would be same value, but there are cases
> where gcc defines
> internal properties based on mcpu, that is not representable in those
> suffixes we will add. so
> we will come very close but not exact and I think compiler is right in
> this case because we
> are not exactly telling it about a CPU implementation, but we are telling
> the cpu has these properties, so it really may not be one of standard cpu
> implementation.
>
>
> >
> > (I know -I- can always patch GCC for new -mcpu combinations, but I'm not
> > thinking about 'me', I'm thinking about the user of OE who is told 'use
> this
> > CPU, it has this set of functions' and is NOT a compiler expert who can
> wade
> > through GCC spec files and generate patches with any confidence.)
> >
>
> I would think we can solve this for these niche cases, via using
> -march/-mtune and drop
> -mcpu, we can append adding/deleting a given feature e.g. neon or vfp or
> vfpv3 etc.
> but this would mean we have to rename our package feed names as well to
> match this
> new representation, which currently is mapped to mcpu values. This is
> quite a departure
> from package feeds point of view. but its more aligned with how we have
> defined arch and tunes.
> so pick the battles.
>
>
> > --Mark
> >
> > >>
> > >>>>> and is preferred since gcc is getting stricter and stricter with
> option
> > >>>>> check semantics and can now find incompatible -march and -mcpu
> options
> > >>>>> better with every release. It does internal feature consistency
> check
> > >>>>> and if it finds out discrepency between what -mcpu would expand to
> as
> > >>>>> compared to -march it will flag the options to be incompatible, for
> > >>>>> naked eye it sounds wrong but gcc would translate -mcpu to a given
> > >>>>> -march internally and it might not match to what we set in these
> arch
> > >>>>> files.
> > >>>>>
> > >>>>> The effects are quite subtle, where this can result in configure
> test
> > >>>>> failing to compile due to these incompatible options and a feature
> > >>>>> option getting disabled for a recipe for no reason.
> > >>>>>
> > >>>>> e.g. with gcc9 which can now detect that -mcpu=cortex-a5 and
> > >>>>> -march=armv7-a are incompatible, many features in libstdc++ ends up
> > >>>>> disabled due to configure check failures e.g. size_t size,
> ptrdiff_t
> > >>>>> sizes, which inturn results in compiling libstdc++ with unwanted
> > >>>>> disabled features.
> > >>>>
> > >>>> It would be interesting to see more specific details of this.
> > >>>
> > >>> These are configure tests which fail when -Werror is in use and is
> not
> > >>> limited to libstdc++
> > >>> it can happen in any package. Since the diagnostic about incompatible
> > >>> march mcpu pair
> > >>> is a warning in general, it does not matter, but when we have -Werror
> > >>> enabled this turns
> > >>> into error, You can simply try a small helloworld example to see the
> effects.
> > >>>
> > >>>>
> > >>>>> Signed-off-by: Khem Raj <raj.khem at gmail.com>
> > >>>>> ---
> > >>>>>  meta/conf/machine/include/arm/arch-armv4.inc   | 1 -
> > >>>>>  meta/conf/machine/include/arm/arch-armv5.inc   | 1 -
> > >>>>>  meta/conf/machine/include/arm/arch-armv6.inc   | 1 -
> > >>>>>  meta/conf/machine/include/arm/arch-armv7a.inc  | 1 -
> > >>>>>  meta/conf/machine/include/arm/arch-armv7ve.inc | 1 -
> > >>>>>  meta/conf/machine/include/tune-iwmmxt.inc      | 2 +-
> > >>>>>  6 files changed, 1 insertion(+), 6 deletions(-)
> > >>>>>
> > >>>>> diff --git a/meta/conf/machine/include/arm/arch-armv4.inc
> b/meta/conf/machine/include/arm/arch-armv4.inc
> > >>>>> index 47a7ad2830..52d8ab1e8f 100644
> > >>>>> --- a/meta/conf/machine/include/arm/arch-armv4.inc
> > >>>>> +++ b/meta/conf/machine/include/arm/arch-armv4.inc
> > >>>>> @@ -2,7 +2,6 @@ DEFAULTTUNE ?= "armv4"
> > >>>>>
> > >>>>>  TUNEVALID[arm] = "Enable ARM instruction set"
> > >>>>>  TUNEVALID[armv4] = "Enable instructions for ARMv4"
> > >>>>> -TUNE_CCARGS .= "${@bb.utils.contains('TUNE_FEATURES', 'armv4', '
> -march=armv4t', '', d)}"
> > >>>>>  # enable --fix-v4bx when we have armv4 in TUNE_FEATURES, but then
> disable it when we have also armv5 or thumb
> > >>>>>  # maybe we should extend bb.utils.contains to support check for
> any checkvalues in value, now it does
> > >>>>>  # checkvalues.issubset(val) which cannot be used for negative
> test of foo neither bar in value
> > >>>>> diff --git a/meta/conf/machine/include/arm/arch-armv5.inc
> b/meta/conf/machine/include/arm/arch-armv5.inc
> > >>>>> index f9068af9de..1fe1b6b8e4 100644
> > >>>>> --- a/meta/conf/machine/include/arm/arch-armv5.inc
> > >>>>> +++ b/meta/conf/machine/include/arm/arch-armv5.inc
> > >>>>> @@ -2,7 +2,6 @@ DEFAULTTUNE ?= "armv5"
> > >>>>>
> > >>>>>  TUNEVALID[armv5] = "Enable instructions for ARMv5"
> > >>>>>  TUNECONFLICTS[armv5] = "armv4"
> > >>>>> -TUNE_CCARGS .= "${@bb.utils.contains('TUNE_FEATURES', 'armv5', '
> -march=armv5t${ARMPKGSFX_DSP}', '', d)}"
> > >>>>>  MACHINEOVERRIDES =. "${@bb.utils.contains('TUNE_FEATURES',
> 'armv5', 'armv5:', '' ,d)}"
> > >>>>>
> > >>>>>  require conf/machine/include/arm/arch-armv4.inc
> > >>>>> diff --git a/meta/conf/machine/include/arm/arch-armv6.inc
> b/meta/conf/machine/include/arm/arch-armv6.inc
> > >>>>> index 6c838e999c..adb9be8050 100644
> > >>>>> --- a/meta/conf/machine/include/arm/arch-armv6.inc
> > >>>>> +++ b/meta/conf/machine/include/arm/arch-armv6.inc
> > >>>>> @@ -2,7 +2,6 @@ DEFAULTTUNE ?= "armv6hf"
> > >>>>>
> > >>>>>  TUNEVALID[armv6] = "Enable instructions for ARMv6"
> > >>>>>  TUNECONFLICTS[armv6] = "armv4 armv5"
> > >>>>> -TUNE_CCARGS .= "${@bb.utils.contains('TUNE_FEATURES', 'armv6', '
> -march=armv6', '', d)}"
> > >>>>>  MACHINEOVERRIDES =. "${@bb.utils.contains('TUNE_FEATURES',
> 'armv6', 'armv6:', '' ,d)}"
> > >>>>>
> > >>>>>  require conf/machine/include/arm/arch-armv5-dsp.inc
> > >>>>> diff --git a/meta/conf/machine/include/arm/arch-armv7a.inc
> b/meta/conf/machine/include/arm/arch-armv7a.inc
> > >>>>> index a2663d8008..09d2c03a5d 100644
> > >>>>> --- a/meta/conf/machine/include/arm/arch-armv7a.inc
> > >>>>> +++ b/meta/conf/machine/include/arm/arch-armv7a.inc
> > >>>>> @@ -3,7 +3,6 @@ ARM_INSTRUCTION_SET ?= "thumb"
> > >>>>>
> > >>>>>  TUNEVALID[armv7a] = "Enable instructions for ARMv7-a"
> > >>>>>  TUNECONFLICTS[armv7a] = "armv4 armv5 armv6 armv7"
> > >>>>> -TUNE_CCARGS .= "${@bb.utils.contains('TUNE_FEATURES', 'armv7a', '
> -march=armv7-a', '', d)}"
> > >>>>>  MACHINEOVERRIDES =. "${@bb.utils.contains('TUNE_FEATURES',
> 'armv7a', 'armv7a:', '' ,d)}"
> > >>>>>
> > >>>>>  require conf/machine/include/arm/arch-armv6.inc
> > >>>>> diff --git a/meta/conf/machine/include/arm/arch-armv7ve.inc
> b/meta/conf/machine/include/arm/arch-armv7ve.inc
> > >>>>> index 4d9260fecb..31e334f645 100644
> > >>>>> --- a/meta/conf/machine/include/arm/arch-armv7ve.inc
> > >>>>> +++ b/meta/conf/machine/include/arm/arch-armv7ve.inc
> > >>>>> @@ -2,7 +2,6 @@ DEFAULTTUNE ?= "armv7vethf"
> > >>>>>
> > >>>>>  TUNEVALID[armv7ve] = "Enable instructions for ARMv7ve"
> > >>>>>  TUNECONFLICTS[armv7ve] = "armv4 armv5 armv6 armv7 armv7a"
> > >>>>> -TUNE_CCARGS .= "${@bb.utils.contains('TUNE_FEATURES', 'armv7ve',
> ' -march=armv7ve', '', d)}"
> > >>>>>  MACHINEOVERRIDES =. "${@bb.utils.contains('TUNE_FEATURES',
> 'armv7ve', 'armv7ve:', '' ,d)}"
> > >>>>>
> > >>>>>  require conf/machine/include/arm/arch-armv7a.inc
> > >>>>> diff --git a/meta/conf/machine/include/tune-iwmmxt.inc
> b/meta/conf/machine/include/tune-iwmmxt.inc
> > >>>>> index f27423cb2e..6e577697cc 100644
> > >>>>> --- a/meta/conf/machine/include/tune-iwmmxt.inc
> > >>>>> +++ b/meta/conf/machine/include/tune-iwmmxt.inc
> > >>>>> @@ -6,7 +6,7 @@ DEFAULTTUNE ?= "iwmmxt"
> > >>>>>  require conf/machine/include/arm/arch-armv5-dsp.inc
> > >>>>>
> > >>>>>  TUNEVALID[iwmmxt] = "Enable Intel PXA27x specific processor
> optimizations"
> > >>>>> -TUNE_CCARGS .= "${@bb.utils.contains('TUNE_FEATURES', 'iwmmxt', '
> -march=iwmmxt -mcpu=iwmmxt', '', d)}"
> > >>>>> +TUNE_CCARGS .= "${@bb.utils.contains('TUNE_FEATURES', 'iwmmxt', '
> -mcpu=iwmmxt', '', d)}"
> > >>>>>
> > >>>>>  AVAILTUNES += "iwmmxt"
> > >>>>>  ARMPKGARCH_tune-iwmmxt = "iwmmxt"
> > >>>>> --
> > >>>>> 2.20.1
> > >>>>>
> > >>>>> --
> > >>>>> _______________________________________________
> > >>>>> Openembedded-core mailing list
> > >>>>> Openembedded-core at lists.openembedded.org
> > >>>>> http://lists.openembedded.org/mailman/listinfo/openembedded-core
> >
> --
> _______________________________________________
> Openembedded-core mailing list
> Openembedded-core at lists.openembedded.org
> http://lists.openembedded.org/mailman/listinfo/openembedded-core
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.openembedded.org/pipermail/openembedded-core/attachments/20190126/e0fc5b0e/attachment-0001.html>


More information about the Openembedded-core mailing list