This is the mail archive of the libc-alpha@sourceware.org mailing list for the glibc project.

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]
Other format:	[Raw text]

Re: [PATCH RFC] Imporve 64bit memcpy performance for Haswell CPU with AVX instruction

From: "H.J. Lu" <hjl dot tools at gmail dot com>
To: Ling Ma <ling dot ma dot program at gmail dot com>
Cc: OndÅej BÃlka <neleai at seznam dot cz>, GNU C Library <libc-alpha at sourceware dot org>, Liubov Dmitrieva <liubov dot dmitrieva at gmail dot com>, yumkam at gmail dot com, Ling Ma <ling dot ml at alibaba-inc dot com>
Date: Wed, 18 Jun 2014 09:46:29 -0700
Subject: Re: [PATCH RFC] Imporve 64bit memcpy performance for Haswell CPU with AVX instruction
Authentication-results: sourceware.org; auth=none
References: <1398055946-4493-1-git-send-email-ling dot ma at alipay dot com> <CAOGi=dOQEbbkkzQGz-ZtQ0-WEHj2=hjmbstZXvZyLqycVy18Kg at mail dot gmail dot com> <20140515202213 dot GA20667 at domone dot podge> <CAOGi=dNbyxj+7gjwcpAVBxYB-MH9E7s=xi2nKwYXkDViasOZrA at mail dot gmail dot com> <CAMe9rOpC5-p7DV=xBfhUknkruz2-Ek+Bpzm+ycZiKdXtSyXxiA at mail dot gmail dot com> <CAOGi=dNHHvNriOMWmj2K3Ym7n6G83mGOyUzMtNY91nFr8=7G9w at mail dot gmail dot com> <CAOGi=dOJX3saKoa5YiDdveOqAb_=Sev4cBKyh7_gkXBU8_4=+g at mail dot gmail dot com>

On Tue, Jun 10, 2014 at 6:54 AM, Ling Ma <ling.ma.program@gmail.com> wrote:
> In this patch as gziped attachment, we take advantage of HSW memory
> bandwidth, manage to reduce miss branch prediction by avoiding using
> branch instructions and
> force destination to be aligned with avx instruction.
>
> The CPU2006 403.gcc benchmark indicates this patch improves performance
> from 2% to 8% compared with pending sse2 version.
>
> Appreciate your comment

Any feedback?  I'd like to check it in before 2.20 code freeze?

Thanks.


-- 
H.J.

Follow-Ups:
- Re: [PATCH RFC] Imporve 64bit memcpy performance for Haswell CPU with AVX instruction
  - From: Ling Ma

References:
- Re: [PATCH RFC] Imporve 64bit memcpy performance for Haswell CPU with AVX instruction
  - From: H.J. Lu
- Re: [PATCH RFC] Imporve 64bit memcpy performance for Haswell CPU with AVX instruction
  - From: Ling Ma
- Re: [PATCH RFC] Imporve 64bit memcpy performance for Haswell CPU with AVX instruction
  - From: Ling Ma

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]