Re: RFR: 8261542: X86 slice and unslice intrinsics for 256-bit byte/short vectors [v4]

classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view
|

Re: RFR: 8261542: X86 slice and unslice intrinsics for 256-bit byte/short vectors [v4]

Vladimir Kozlov-2
On Fri, 19 Feb 2021 03:20:59 GMT, Sandhya Viswanathan <[hidden email]> wrote:

>> The slice and unslice intrinsics for 256-bit byte/short vectors can be implemented for x86 platforms supporting AVX2 using a sequence of instructions.
>>
>> JBS: https://bugs.openjdk.java.net/browse/JDK-8261542
>>
>> The PerfSliceOrigin.java jmh test attached to the JBS shows the following performance on AVX2 platform.
>>
>> Before:
>> Benchmark                                 (size)   Mode  Cnt   Score   Error   Units
>> PerfSliceOrigin.vectorSliceOrigin           1024  thrpt    5  18.887 ± 1.128  ops/ms
>> PerfSliceOrigin.vectorSliceUnsliceOrigin    1024  thrpt    5   9.374 ± 0.370  ops/ms
>>
>> After:
>> Benchmark                                 (size)   Mode  Cnt      Score     Error   Units
>> PerfSliceOrigin.vectorSliceOrigin           1024  thrpt    5  13861.420 ±  19.071  ops/ms
>> PerfSliceOrigin.vectorSliceUnsliceOrigin    1024  thrpt    5   7895.199 ± 142.580  ops/ms
>
> Sandhya Viswanathan has updated the pull request incrementally with one additional commit since the last revision:
>
>   add assert on else path

Marked as reviewed by kvn (Reviewer).

-------------

PR: https://git.openjdk.java.net/jdk/pull/2520