This is the mail archive of the
libc-alpha@sourceware.org
mailing list for the glibc project.
Re: [PATCH] Rename __memcmp_sse4_2 to __memcmp_sse4_1.
- From: "H.J. Lu" <hjl dot tools at gmail dot com>
- To: Andreas Jaeger <aj at suse dot com>
- Cc: Matt Turner <mattst88 at gmail dot com>, GNU C Library <libc-alpha at sourceware dot org>, Liubov Dmitrieva <liubov dot dmitrieva at gmail dot com>
- Date: Wed, 10 Jul 2013 08:30:01 -0700
- Subject: Re: [PATCH] Rename __memcmp_sse4_2 to __memcmp_sse4_1.
- References: <CAMe9rOreowCOEH+6zRaRNk_p9sYe3T2bhwPRbKpybW9cO0BhJA at mail dot gmail dot com> <1373419029-19125-1-git-send-email-mattst88 at gmail dot com> <51DCE51F dot 7000001 at suse dot com>
On Tue, Jul 9, 2013 at 9:37 PM, Andreas Jaeger <aj@suse.com> wrote:
> On 07/10/2013 03:17 AM, Matt Turner wrote:
>> It uses SSE 4.1 instructions (ptest) but no SSE 4.2 instructions.
>
> There are two parts to this: It should only run on cpus with those
> instructions but we also need to ensure that it gives a better
> performance on such cpus. HJ, Matt, please do run performance tests on a
> variety of affected cpus to show that this change really helps in all cases,
>
> Andreas
Only Penryn has SSE4.1 without SSE4.2. Liubov, can
you compare performance of memcmp-sse4.S vs
memcmp-ssse3.S on Penryn?
Thanks.
--
H.J.