157 Commits

Author SHA1 Message Date
steveatgh
f8a3e4e428 8328998: Encoding support for Intel APX extended general-purpose registers
Reviewed-by: kvn, sviswanathan, jbhateja
2024-05-23 22:54:24 +00:00
Martin Balao
8a8d928898 8330611: AES-CTR vector intrinsic may read out of bounds (x86_64, AVX-512)
Co-authored-by: Francisco Ferrari Bihurriet <fferrari@openjdk.org>
Co-authored-by: Martin Balao <mbalao@openjdk.org>
Reviewed-by: aph, sviswanathan
2024-04-24 20:23:25 +00:00
Scott Gibbons
7a895552c8 8330844: Add aliases for conditional jumps and additional instruction forms for x86
Reviewed-by: kvn, epeter, sviswanathan
2024-04-23 23:35:59 +00:00
Scott Gibbons
bd67ac69a2 8329331: Intrinsify Unsafe::setMemory
Reviewed-by: sviswanathan, jbhateja, kvn
2024-04-21 23:24:46 +00:00
Jatin Bhateja
185e711bfe 8318650: Optimized subword gather for x86 targets.
Reviewed-by: sviswanathan, epeter, psandoz
2024-04-21 23:21:17 +00:00
Andrew Haley
f11a496de6 8180450: secondary_super_cache does not scale well
Co-authored-by: Vladimir Ivanov <vlivanov@openjdk.org>
Reviewed-by: kvn, vlivanov, dlong
2024-04-16 14:21:48 +00:00
Cesar Soares Lucas
31ee5108e0 8241503: C2: Share MacroAssembler between mach nodes during code emission
Reviewed-by: kvn, mdoerr, amitkumar, lucy
2024-04-11 15:44:49 +00:00
vamsi-parasa
7e5ef79f95 8323116: [REDO] Computational test more than 2x slower when AVX instructions are used
Reviewed-by: sviswanathan, kvn
2024-04-08 18:41:32 +00:00
vamsi-parasa
18de9321ce 8325991: Accelerate Poly1305 on x86_64 using AVX2 instructions
Reviewed-by: sviswanathan, jbhateja
2024-03-11 16:42:13 +00:00
Jatin Bhateja
2d4c757e2e 8327041: Incorrect lane size references in avx512 instructions.
Reviewed-by: sviswanathan
2024-03-09 07:12:34 +00:00
vamsi-parasa
7c5e6e74c8 8327147: Improve performance of Math ceil, floor, and rint for x86
Reviewed-by: jbhateja, sviswanathan, dlong
2024-03-07 21:44:46 +00:00
Roman Kennke
f0024f585d 8324734: Relax too-strict assert(VM_Version::supports_evex()) in Assembler::locate_operand()
Co-authored-by: Vladimir Kozlov <kvn@openjdk.org>
Reviewed-by: kvn, shade
2024-01-30 13:26:10 +00:00
Kim Barrett
a678416994 8322805: Eliminate -Wparentheses warnings in x86 code
Reviewed-by: dholmes, kvn
2024-01-02 22:47:18 +00:00
Jatin Bhateja
76637c53c5 8321648: Integral gather optimized mask computation.
Reviewed-by: thartmann, sviswanathan
2023-12-19 07:51:52 +00:00
Jorn Vernee
ce4b257fa5 8320886: Unsafe_SetMemory0 is not guarded
Reviewed-by: dholmes, fparain
2023-12-11 19:05:40 +00:00
steveatgh
82967f45db 8310159: Bulk copy with Unsafe::arrayCopy is slower compared to memcpy
Co-authored-by: Maurizio Cimadamore <mcimadamore@openjdk.org>
Reviewed-by: thartmann, jbhateja, sviswanathan
2023-11-27 17:35:39 +00:00
Aleksey Shipilev
b120a05b22 8319406: x86: Shorter movptr(reg, imm) for 32-bit immediates
Reviewed-by: qamai, kvn
2023-11-14 08:59:15 +00:00
Smita Kamath
1740950036 8314901: AES-GCM interleaved implementation using AVX2 instructions
Reviewed-by: sviswanathan, djelinski
2023-10-19 18:27:51 +00:00
Scott Gibbons
ce2a7ea40a 8314056: Remove runtime platform check from frem/drem
Reviewed-by: sviswanathan, jbhateja
2023-08-30 01:28:27 +00:00
Kim Barrett
c077be4768 8314694: Separate checked_cast from globalDefinitions.hpp
Reviewed-by: stuefe, coleenp, sspitsyn
2023-08-23 22:32:37 +00:00
Christian Hagedorn
49ddb19972 8313760: [REDO] Enhance AES performance
Co-authored-by: Andrew Haley <aph@openjdk.org>
Reviewed-by: adinn, aph, sviswanathan, rhalade, kvn, dlong
2023-08-16 07:21:04 +00:00
Christian Hagedorn
207bd00c51 8313756: [BACKOUT] 8308682: Enhance AES performance
Reviewed-by: thartmann
2023-08-14 12:08:16 +00:00
Coleen Phillimore
7dd47998f0 8311847: Fix -Wconversion for assembler.hpp emit_int8,16 callers
Reviewed-by: dlong, aph
2023-07-24 12:09:27 +00:00
Christian Hagedorn
5c4623b360 8308682: Enhance AES performance
Reviewed-by: rhalade, dlong, kvn
2023-07-18 21:17:09 +00:00
Coleen Phillimore
230bcb769a 8309685: Fix -Wconversion warnings in assembler and register code
Reviewed-by: aph, fparain
2023-06-22 14:22:02 +00:00
Quan Anh Mai
947f14977a 8308444: LoadStoreNode::result_not_used() is too conservative
Reviewed-by: kvn, thartmann
2023-06-15 16:00:18 +00:00
Scott Gibbons
5d5ae35288 8308966: Add intrinsic for float/double modulo for x86 AVX2 and AVX512
Co-authored-by: Marius Cornea <marius.cornea@intel.com>
Reviewed-by: jbhateja, sviswanathan
2023-06-12 15:06:42 +00:00
Jasmine Karthikeyan
fb0b1f0c23 8051725: Improve expansion of Conv2B nodes in the middle-end
Reviewed-by: thartmann, qamai, sviswanathan
2023-05-30 14:11:12 +00:00
Kim Barrett
3599448ad8 8307147: [x86] Dangling pointer warning for Assembler::_attributes
Reviewed-by: dholmes, aph
2023-05-04 02:08:07 +00:00
Guoxiong Li
49726ee3a9 8305690: [X86] Do not emit two REX prefixes in Assembler::prefix
Reviewed-by: kvn, thartmann
2023-04-18 08:01:32 +00:00
Justin King
20830883df 8304828: Lots of constant static data not declared static const in cpu/x86
Reviewed-by: dholmes, kvn
2023-03-27 14:03:04 +00:00
Quan Anh Mai
38e17148fa 8304258: x86: Improve the code generation of VectorRearrange with int and float
Reviewed-by: kvn, jbhateja, sviswanathan
2023-03-25 05:30:16 +00:00
Johan Sjölen
4154a980ca 8301498: Replace NULL with nullptr in cpu/x86
Reviewed-by: dholmes, kvn
2023-03-22 14:18:40 +00:00
Jatin Bhateja
6475501a01 8300208: Optimize Adler32 stub for AVX-512 targets.
Reviewed-by: kvn, sviswanathan
2023-01-29 01:55:46 +00:00
Erik Österlund
b7fb8ef89e 8299323: Allow extended registers for cmpw
Reviewed-by: sviswanathan, kvn
2023-01-17 12:16:05 +00:00
Erik Österlund
a503ec2cc7 8299608: Add Register + imm32 orq to x86_64 assembler
Reviewed-by: shade, sviswanathan, kvn
2023-01-09 10:01:26 +00:00
Kim Barrett
bfa921ae6c 8160404: RelocationHolder constructors have bugs
Reviewed-by: kvn, jrose, jvernee
2022-12-16 20:47:40 +00:00
Quan Anh Mai
3dfadeebd0 8292289: [vectorapi] Improve the implementation of VectorTestNode
Reviewed-by: xgong, kvn
2022-12-08 20:25:20 +00:00
Smita Kamath
073897c88b 8294588: Auto vectorize half precision floating point conversion APIs
Reviewed-by: sviswanathan, kvn, jbhateja, fgao, xgong
2022-12-08 09:46:59 +00:00
Jamil Nimeh
cd6bebbf34 8247645: ChaCha20 intrinsics
Reviewed-by: sviswanathan, ngasson, vlivanov, ascarpino
2022-11-29 14:40:20 +00:00
Volodymyr Paprotski
f12710e938 8288047: Accelerate Poly1305 on x86_64 using AVX512 instructions
Reviewed-by: sviswanathan, vlivanov
2022-11-21 21:01:25 +00:00
Quan Anh Mai
404e8de155 8294865: x86: Improve the code generation of MulVB and MulVL
Reviewed-by: kvn, vlivanov
2022-10-15 11:28:02 +00:00
Jatin Bhateja
2ceb80c60f 8288043: Optimize FP to word/sub-word integral type conversion on X86 AVX2 platforms
Reviewed-by: kvn, sviswanathan
2022-10-12 01:05:44 +00:00
Smita Kamath
07946aa49c 8289552: Make intrinsic conversions between bit representations of half precision values and floats
Reviewed-by: kvn, sviswanathan, jbhateja
2022-10-11 17:06:19 +00:00
Quan Anh Mai
e2f8251490 8293618: x86: Wrong code generation in class Assembler
Reviewed-by: kvn, thartmann
2022-09-23 17:12:48 +00:00
Vladimir Ivanov
6e248279cf 8292878: x86: Make scratch register usage explicit in assembler code
Reviewed-by: kvn, shade
2022-08-30 18:45:24 +00:00
Vladimir Ivanov
f3be6731d3 8292638: x86: Improve scratch register handling in VM stubs
Co-authored-by: Aleksey Shipilev <shade@openjdk.org>
Reviewed-by: kvn, shade
2022-08-23 19:32:51 +00:00
Jatin Bhateja
38a81913d3 8290322: Optimize Vector.rearrange over byte vectors for AVX512BW targets.
Reviewed-by: kvn, sviswanathan
2022-08-22 23:59:58 +00:00
Quan Anh Mai
92d2982f3f 8283232: x86: Improve vector broadcast operations
Reviewed-by: kvn, jbhateja
2022-08-04 16:27:45 +00:00
Andrew Haley
966ab219b4 8291895: Remove PRAGMA_NONNULL_IGNORED from x86 and AArch64
Reviewed-by: shade
2022-08-04 16:11:01 +00:00