[OE-core] [PATCHv2] openssl: switch ARM builds from linux-elf-arm to linux-armv4 config
Koen Kooi
koen at dominion.thruhere.net
Thu Oct 17 05:54:36 UTC 2013
Op 16 okt. 2013, om 11:45 heeft Koen Kooi <koen at dominion.thruhere.net> het volgende geschreven:
>
> Op 16 okt. 2013, om 11:20 heeft Phil Blundell <pb at pbcl.net> het volgende geschreven:
>
>> On Wed, 2013-10-16 at 09:25 +0200, Koen Kooi wrote:
>>> From: Koen Kooi <koen.kooi at linaro.org>
>>>
>>> This enables aes and sha1 assembly at buildtime. Openssl does a
>>> runtime check to see which portion gets enabled.
>>
>> [...]
>>
>>> Algo blocksize ops/s after
>>> ops/s before difference
>>> -------------------------------------------
>>> MD5 16 308,766 264,664 -14.28%
>>> 64 277,090 263,340 -4.96%
>>> 256 212,652 197,043 -7.34%
>>> 1024 103,604 100,157 -3.33%
>>> 8192 17,936 17,796 -0.78%
>>
>> Do you know why it's causing MD5 to get slower? I guess md5 with
>> blocksize=16 is not a very common case, but still.
>
> I really don't know and it is the only algo that gets a lot slower. This patch is a preparation for (more) NEON optimizations which seem to fix the regression, see
>
> https://docs.google.com/spreadsheet/ccc?key=0AhgZ33Tf6eBldHVONjRXRnItWld4eFlRWTJ3RzVIdGc&usp=sharing
>
> and
>
> http://dominion.thruhere.net/koen/angstrom/0002-openssl-1.0.1e-add-ARMv7-AES-optimizations.patch
>
> I need to test that patch on A9 and A15 cores as well since it doesn't seem to do a lot on A8 cores :(
And on A9 cores linux-armv4 is 20% *faster* on MD5 with blocksize=16, see spreadsheet above. So it probably is a scheduling issue that favours A9 cores.
regards,
Koen.
>
>> Also, it seems generally a bit unwholesome for openssl to be picking its
>> own CFLAGS at all. Would it be better to just make it use the same
>> CFLAGS as everything else?
>
> openssl.inc already pokes at CLAG(S):
>
> CFLAG = "${@base_conditional('SITEINFO_ENDIANNESS', 'le', '-DL_ENDIAN', '-DB_ENDIAN', d)} \
> -DTERMIO ${CFLAGS} -Wall -Wa,--noexecstack"
>
> The complete command looks like this:
>
> arm-angstrom-linux-gnueabi-gcc -march=armv7-a -mthumb-interwork -mfloat-abi=hard -mfpu=neon -mtune=cortex-a8 --sysroot=/build/v2013.06/build/tmp-angstrom_v2013_06-eglibc/sysroots/beaglebone -I. -I.. -I../include -fPIC -DOPENSSL_PIC -DOPENSSL_THREADS -D_REENTRANT -DDSO_DLFCN -DHAVE_DLFCN_H -DL_ENDIAN -DTERMIO -O2 -pipe -g -feliminate-unused-debug-types -Wall -Wa,--noexecstack -DHAVE_CRYPTODEV -DUSE_CRYPTODEV_DIGESTS -DOPENSSL_BN_ASM_MONT -DOPENSSL_BN_ASM_GF2m -DSHA1_ASM -DSHA256_ASM -DSHA512_ASM -DAES_ASM -DBSAES_ASM -DGHASH_ASM -c -c -o armv4cpuid.o armv4cpuid.S
>
> So the '$cflags = -DTERMIO -O3 -Wall' in linux-armv4 gets overridden by OE, just like we want :)
>
> regards,
>
> Koen
More information about the Openembedded-core
mailing list