llvm-project

History

[VectorCombine] Generalize foldBitOpOfBitcasts to support more cast operations (#148350 )

This patch generalizes the existing foldBitOpOfBitcasts optimization in the VectorCombine pass to handle additional cast operations beyond just bitcast.

  Fixes: [#146037](https://github.com/llvm/llvm-project/issues/146037)

  Summary

The optimization now supports folding bitwise operations (AND/OR/XOR)
with the following cast operations:
  - bitcast (original functionality)
  - trunc (truncate)
  - sext (sign extend)
  - zext (zero extend)

  The transformation pattern is:
  bitop(castop(x), castop(y)) -> castop(bitop(x, y))

This reduces the number of cast instructions from 2 to 1, improving
performance on targets where cast operations
are expensive or where performing bitwise operations on narrower types
is beneficial.
  
  Implementation Details

- Renamed foldBitOpOfBitcasts to foldBitOpOfCastops to reflect broader
functionality
  - Extended pattern matching to handle any CastInst operation
- Added validation for each cast type's constraints (e.g., trunc
requires source > dest)
  - Updated cost model to use the actual cast opcode
  - Preserves IR flags from original instructions
  - Handles multi-use scenarios appropriately

  Testing

- Added comprehensive tests in
test/Transforms/VectorCombine/bitop-of-castops.ll
  - Tests cover all supported cast types with all bitwise operations
  - Includes negative tests for unsupported patterns
  - All existing VectorCombine tests pass

2025-07-21 17:14:56 +01:00

AArch64

[AArch64,TTI] Remove RealUse check for vector insert/extract costs. (#146526 )

2025-07-15 15:19:27 +01:00

AMDGPU

[AMDGPU] Remove explicit datalayout from tests where not needed

2025-04-30 10:58:17 -07:00

ARM

[VectorCombine] Fold binary op of reductions. (#121567 )

2025-02-22 06:11:33 -05:00

Hexagon

…