llvm-project

Author	SHA1	Message	Date
David CARLIER	c9f037622f	[compiler-rt][memprof] adding free_sized/free_aligned_sized intercept… (#154011 ) …ions.	2025-08-20 00:27:08 +01:00
Sergei Barannikov	803edce6f7	[TableGen][DecoderEmitter] Analyze encodings once (#154309 ) Follow-up to #154288. With HwModes involved, we used to analyze the same encoding multiple times (unless `-suppress-per-hwmode-duplicates=O2` is specified). This affected the build time and made the statistics inaccurate. From the point of view of the generated code, this is an NFC.	2025-08-19 23:17:12 +00:00
Jonas Devlieghere	7cd61793ed	[lldb] Improve error handling in ObjectFileWasm (#154433 ) Improve error handling in ObjectFileWasm by using helpers that wrap their result in an llvm::Expected. The helper to read a Wasm string now return an Expected<std::string> and I created a helper to parse 32-bit ULEBs that returns an Expected<uint32_t>.	2025-08-19 16:00:31 -07:00
Valentin Clement (バレンタインクレメン)	f64b5c9c6f	[flang][cuda] Fix hasDataAttr signature in header file (#154435 ) Fix for #154422	2025-08-19 22:49:57 +00:00
Andres-Salamanca	577ee97d4e	[CIR] Fix access to bitfields inside a union (#154398 ) This PR fixes the access to bitfields inside a union. Previously, we were using a `getMemberOp` to access the field, but because it is a union, `getMemberOp` would always use index `0`. For example, given: ```c typedef union { int x; int y : 4; int z : 8; } demo; ``` ```mlir !rec_demo = !cir.record<union "demo" {!s32i, !u8i, !u8i}> ``` In the case of: ```c d.y = 2; ``` It would generate: ```mlir cir.get_member %0[0] {name = "y"} : !cir.ptr<!rec_demo> -> !cir.ptr<!s32i> ``` with a return type of `!s32i`, when it should be `!u8i`. the get_member verifier would detect that the return type does not match the `y` member. To fix this, we now use `bitcast` to get the start of the union.	2025-08-19 17:43:13 -05:00
Sirraide	7f20c6c29e	[Clang] [Sema] Always rebuild `this` if captured by value in a lambda with a dependent explicit object parameter (#154276 ) We have a flag that tracks whether a `CXXThisExpr` refers to a `*this` capture in a lambda with a dependent explicit object parameter; this is to mark it and member accesses involving it as dependent because there is no other way to track that (DREs have a similar flag); when instantiating the lambda, we need to always rebuild the `CXXThisExpr` to potentially clear that flag if the explicit object parameter is no longer dependent. Fixes #154054.	2025-08-20 00:37:48 +02:00
Craig Topper	58c41b7491	[ExpandVectorPredication] Use IRBuilder::CreateNUWMul instead of passing flags to CreateMul. NFC	2025-08-19 15:31:15 -07:00
Sergei Barannikov	07a6323c32	[TableGen][DecoderEmitter] Turn EncodingAndInst into a class (NFC) (#154230 ) The class will get more methods in follow-up patches.	2025-08-20 01:29:26 +03:00
Matheus Izvekov	227e88b943	[clang-tidy] fix misc-unconventional-assign-operator entity match (#154430 ) Makes sure UnconventionalAssignOperatorCheck checks if the types reference the same entity, not the exact declaration. This adds a new matcher to support this check. This fixes a regression introduced by #147835. Since this regression was never released, there are no release notes. Fixes #153770	2025-08-19 19:28:41 -03:00
Sterling-Augustine	317920063b	Add vector-based strlen implementation for x86_64 and aarch64 (#152389 ) These replace the default LIBC_CONF_STRING_UNSAFE_WIDE_READ implementation on x86_64 and aarch64. These are substantially faster than both the character-by-character implementation and the original unsafe_wide_read implementation. Some below I have been unable to performance-test the aarch64 version, but I suspect speedups similar to avx2. ``` Function: strlen Variant: char wide ull sse2 avx2 avx512 ============================================================================================================================================================= length=1, alignment=1: 13.18 20.47 (-55.24%) 20.21 (-53.27%) 32.50 (-146.54%) 26.05 (-97.61%) 18.03 (-36.74%) length=1, alignment=0: 12.80 34.92 (-172.89%) 20.01 (-56.39%) 17.52 (-36.86%) 17.78 (-38.92%) 18.04 (-40.94%) length=2, alignment=2: 9.91 19.02 (-91.95%) 12.64 (-27.52%) 11.06 (-11.59%) 9.48 ( 4.38%) 9.48 ( 4.34%) length=2, alignment=0: 9.56 26.88 (-181.24%) 12.64 (-32.31%) 11.06 (-15.73%) 11.06 (-15.72%) 11.83 (-23.80%) length=3, alignment=3: 8.31 10.45 (-25.84%) 8.28 ( 0.32%) 8.28 ( 0.36%) 6.21 ( 25.28%) 6.21 ( 25.24%) length=3, alignment=0: 8.39 14.53 (-73.20%) 8.28 ( 1.33%) 7.24 ( 13.69%) 7.56 ( 9.94%) 7.25 ( 13.65%) length=4, alignment=4: 9.84 21.76 (-121.24%) 15.55 (-58.11%) 6.57 ( 33.18%) 5.02 ( 48.98%) 6.00 ( 39.00%) length=4, alignment=0: 8.64 13.70 (-58.51%) 7.28 ( 15.73%) 6.37 ( 26.31%) 6.36 ( 26.36%) 6.36 ( 26.36%) length=5, alignment=5: 11.85 23.81 (-100.97%) 12.17 ( -2.67%) 5.68 ( 52.09%) 4.87 ( 58.94%) 6.48 ( 45.33%) length=5, alignment=0: 11.82 13.64 (-15.42%) 7.27 ( 38.45%) 6.36 ( 46.15%) 6.37 ( 46.11%) 6.36 ( 46.14%) length=6, alignment=6: 10.50 19.37 (-84.56%) 13.64 (-29.93%) 6.54 ( 37.71%) 6.89 ( 34.35%) 9.45 ( 10.01%) length=6, alignment=0: 14.96 14.05 ( 6.04%) 6.49 ( 56.62%) 5.68 ( 62.04%) 5.68 ( 62.04%) 13.15 ( 12.05%) length=7, alignment=7: 10.97 18.02 (-64.35%) 14.59 (-33.06%) 6.36 ( 41.96%) 5.46 ( 50.25%) 5.46 ( 50.25%) length=7, alignment=0: 10.96 15.76 (-43.77%) 15.37 (-40.15%) 6.96 ( 36.51%) 5.68 ( 48.22%) 7.04 ( 35.83%) length=4, alignment=0: 8.66 13.69 (-58.02%) 7.28 ( 16.00%) 6.37 ( 26.44%) 6.37 ( 26.52%) 6.61 ( 23.74%) length=4, alignment=7: 8.87 17.35 (-95.73%) 12.18 (-37.39%) 5.68 ( 35.94%) 4.87 ( 45.11%) 6.00 ( 32.36%) length=4, alignment=2: 8.67 10.05 (-15.91%) 7.28 ( 16.01%) 7.37 ( 15.02%) 5.46 ( 37.02%) 5.47 ( 36.89%) length=2, alignment=2: 5.64 10.01 (-77.64%) 7.29 (-29.34%) 6.37 (-13.04%) 5.46 ( 3.19%) 5.46 ( 3.19%) length=8, alignment=0: 12.78 16.52 (-29.33%) 18.27 (-43.00%) 11.82 ( 7.47%) 9.83 ( 23.03%) 11.46 ( 10.27%) length=8, alignment=7: 14.24 17.30 (-21.49%) 12.16 ( 14.59%) 5.68 ( 60.14%) 4.87 ( 65.83%) 6.23 ( 56.28%) length=8, alignment=3: 12.34 26.15 (-111.98%) 12.20 ( 1.14%) 6.50 ( 47.34%) 4.87 ( 60.54%) 6.18 ( 49.94%) length=5, alignment=3: 10.95 19.74 (-80.30%) 12.17 (-11.11%) 5.68 ( 48.16%) 4.87 ( 55.56%) 5.96 ( 45.55%) length=16, alignment=0: 20.33 29.29 (-44.08%) 36.18 (-77.97%) 5.68 ( 72.06%) 5.68 ( 72.08%) 10.60 ( 47.86%) length=16, alignment=7: 19.29 17.52 ( 9.16%) 12.98 ( 32.73%) 7.05 ( 63.47%) 4.87 ( 74.75%) 6.23 ( 67.71%) length=16, alignment=4: 20.54 25.18 (-22.56%) 15.42 ( 24.92%) 7.31 ( 64.43%) 4.87 ( 76.29%) 5.98 ( 70.88%) length=10, alignment=4: 14.59 21.26 (-45.71%) 12.17 ( 16.58%) 5.68 ( 61.07%) 4.87 ( 66.65%) 6.00 ( 58.91%) length=32, alignment=0: 35.46 22.00 ( 37.95%) 16.22 ( 54.26%) 7.32 ( 79.35%) 5.68 ( 83.98%) 7.01 ( 80.22%) length=32, alignment=7: 35.23 24.14 ( 31.48%) 16.22 ( 53.96%) 7.30 ( 79.28%) 8.76 ( 75.12%) 6.14 ( 82.58%) length=32, alignment=5: 35.16 28.56 ( 18.76%) 16.22 ( 53.87%) 7.30 ( 79.23%) 6.77 ( 80.75%) 9.82 ( 72.07%) length=21, alignment=5: 26.47 27.66 ( -4.49%) 15.04 ( 43.17%) 6.90 ( 73.95%) 4.87 ( 81.60%) 6.04 ( 77.18%) length=64, alignment=0: 66.45 25.16 ( 62.14%) 22.70 ( 65.83%) 12.99 ( 80.44%) 7.47 ( 88.77%) 8.70 ( 86.90%) length=64, alignment=7: 64.75 27.78 ( 57.10%) 22.72 ( 64.91%) 10.85 ( 83.25%) 7.46 ( 88.48%) 8.68 ( 86.60%) length=64, alignment=6: 67.26 28.58 ( 57.51%) 22.70 ( 66.24%) 11.26 ( 83.25%) 9.46 ( 85.94%) 13.90 ( 79.33%) length=42, alignment=6: 73.42 27.97 ( 61.91%) 19.46 ( 73.49%) 8.92 ( 87.84%) 6.49 ( 91.16%) 6.00 ( 91.83%) length=128, alignment=0: 172.07 39.18 ( 77.23%) 35.68 ( 79.26%) 13.02 ( 92.43%) 12.98 ( 92.46%) 9.76 ( 94.33%) length=128, alignment=7: 163.98 43.79 ( 73.30%) 36.03 ( 78.03%) 15.68 ( 90.44%) 11.35 ( 93.08%) 10.51 ( 93.59%) length=128, alignment=7: 185.86 40.27 ( 78.33%) 36.04 ( 80.61%) 13.78 ( 92.58%) 11.35 ( 93.89%) 10.49 ( 94.36%) length=85, alignment=7: 121.61 55.66 ( 54.23%) 32.34 ( 73.40%) 13.88 ( 88.59%) 7.30 ( 94.00%) 8.72 ( 92.83%) length=256, alignment=0: 295.54 66.48 ( 77.50%) 61.63 ( 79.15%) 19.54 ( 93.39%) 12.97 ( 95.61%) 12.45 ( 95.79%) length=256, alignment=7: 308.06 78.92 ( 74.38%) 61.63 ( 80.00%) 22.90 ( 92.57%) 12.97 ( 95.79%) 13.23 ( 95.71%) length=256, alignment=8: 295.32 65.83 ( 77.71%) 61.62 ( 79.13%) 23.19 ( 92.15%) 12.97 ( 95.61%) 13.50 ( 95.43%) length=170, alignment=8: 234.39 48.79 ( 79.18%) 43.79 ( 81.32%) 16.22 ( 93.08%) 13.97 ( 94.04%) 10.48 ( 95.53%) length=512, alignment=0: 563.75 116.89 ( 79.27%) 114.99 ( 79.60%) 62.71 ( 88.88%) 19.58 ( 96.53%) 17.76 ( 96.85%) length=512, alignment=7: 580.53 120.91 ( 79.17%) 114.47 ( 80.28%) 37.75 ( 93.50%) 19.55 ( 96.63%) 18.68 ( 96.78%) length=512, alignment=9: 584.05 128.35 ( 78.02%) 114.74 ( 80.35%) 39.09 ( 93.31%) 19.76 ( 96.62%) 18.71 ( 96.80%) length=341, alignment=9: 405.84 90.87 ( 77.61%) 78.79 ( 80.59%) 28.77 ( 92.91%) 14.60 ( 96.40%) 14.15 ( 96.51%) length=1024, alignment=0: 1143.61 247.03 ( 78.40%) 243.70 ( 78.69%) 75.59 ( 93.39%) 67.02 ( 94.14%) 28.99 ( 97.46%) length=1024, alignment=7: 1124.55 267.87 ( 76.18%) 259.16 ( 76.95%) 64.96 ( 94.22%) 33.05 ( 97.06%) 30.91 ( 97.25%) length=1024, alignment=10: 1459.58 257.79 ( 82.34%) 239.91 ( 83.56%) 65.00 ( 95.55%) 33.10 ( 97.73%) 30.33 ( 97.92%) length=682, alignment=10: 732.89 163.67 ( 77.67%) 170.54 ( 76.73%) 46.48 ( 93.66%) 24.32 ( 96.68%) 21.44 ( 97.07%) length=2048, alignment=0: 2141.96 451.61 ( 78.92%) 448.00 ( 79.08%) 133.24 ( 93.78%) 61.22 ( 97.14%) 80.08 ( 96.26%) length=2048, alignment=7: 2145.05 458.26 ( 78.64%) 449.99 ( 79.02%) 140.19 ( 93.46%) 60.26 ( 97.19%) 51.71 ( 97.59%) length=2048, alignment=11: 2162.61 463.37 ( 78.57%) 448.07 ( 79.28%) 140.29 ( 93.51%) 59.51 ( 97.25%) 51.59 ( 97.61%) length=1365, alignment=11: 1439.74 322.86 ( 77.58%) 310.84 ( 78.41%) 116.08 ( 91.94%) 42.43 ( 97.05%) 36.15 ( 97.49%) length=4096, alignment=0: 4278.68 871.60 ( 79.63%) 865.25 ( 79.78%) 252.50 ( 94.10%) 161.17 ( 96.23%) 94.97 ( 97.78%) length=4096, alignment=7: 4253.01 871.62 ( 79.51%) 864.21 ( 79.68%) 243.90 ( 94.27%) 171.17 ( 95.98%) 95.14 ( 97.76%) length=4096, alignment=12: 4252.18 879.66 ( 79.31%) 863.68 ( 79.69%) 244.26 ( 94.26%) 185.36 ( 95.64%) 93.61 ( 97.80%) length=2730, alignment=12: 2868.22 597.65 ( 79.16%) 586.22 ( 79.56%) 175.09 ( 93.90%) 120.35 ( 95.80%) 101.35 ( 96.47%) length=0, alignment=0: 4.87 8.11 (-66.73%) 6.49 (-33.34%) 5.80 (-19.26%) 5.68 (-16.67%) 6.86 (-40.91%) length=32, alignment=0: 33.82 22.36 ( 33.89%) 17.03 ( 49.66%) 7.30 ( 78.42%) 5.68 ( 83.22%) 7.50 ( 77.83%) length=64, alignment=0: 66.20 26.76 ( 59.58%) 23.22 ( 64.93%) 12.99 ( 80.37%) 7.34 ( 88.92%) 8.44 ( 87.25%) length=96, alignment=0: 130.26 31.62 ( 75.72%) 30.00 ( 76.97%) 11.39 ( 91.26%) 10.54 ( 91.91%) 8.68 ( 93.34%) length=128, alignment=0: 164.66 39.05 ( 76.29%) 35.68 ( 78.33%) 13.07 ( 92.07%) 12.97 ( 92.12%) 9.59 ( 94.18%) length=160, alignment=0: 196.63 45.18 ( 77.02%) 42.16 ( 78.56%) 14.65 ( 92.55%) 10.87 ( 94.47%) 9.31 ( 95.27%) length=192, alignment=0: 225.50 52.71 ( 76.63%) 49.61 ( 78.00%) 16.22 ( 92.81%) 11.36 ( 94.96%) 11.08 ( 95.09%) length=224, alignment=0: 261.08 57.57 ( 77.95%) 55.82 ( 78.62%) 17.84 ( 93.17%) 12.16 ( 95.34%) 11.51 ( 95.59%) length=256, alignment=0: 295.13 65.56 ( 77.79%) 62.59 ( 78.79%) 19.46 ( 93.41%) 13.12 ( 95.56%) 12.33 ( 95.82%) length=288, alignment=0: 325.69 72.16 ( 77.84%) 69.20 ( 78.75%) 21.08 ( 93.53%) 13.94 ( 95.72%) 12.32 ( 96.22%) length=320, alignment=0: 364.18 78.78 ( 78.37%) 75.69 ( 79.21%) 22.71 ( 93.77%) 14.70 ( 95.96%) 14.46 ( 96.03%) length=352, alignment=0: 391.40 84.87 ( 78.32%) 82.15 ( 79.01%) 24.50 ( 93.74%) 15.62 ( 96.01%) 14.27 ( 96.35%) length=384, alignment=0: 428.50 91.43 ( 78.66%) 88.70 ( 79.30%) 26.16 ( 93.90%) 17.29 ( 95.97%) 15.04 ( 96.49%) length=416, alignment=0: 457.30 98.23 ( 78.52%) 95.02 ( 79.22%) 27.81 ( 93.92%) 17.22 ( 96.23%) 15.05 ( 96.71%) length=448, alignment=0: 488.38 104.52 ( 78.60%) 101.87 ( 79.14%) 31.22 ( 93.61%) 18.07 ( 96.30%) 16.89 ( 96.54%) length=480, alignment=0: 526.44 109.61 ( 79.18%) 108.11 ( 79.46%) 31.11 ( 94.09%) 18.88 ( 96.41%) 17.10 ( 96.75%) length=512, alignment=0: 556.50 117.29 ( 78.92%) 113.78 ( 79.56%) 62.57 ( 88.76%) 19.88 ( 96.43%) 17.80 ( 96.80%) length=576, alignment=0: 622.17 152.93 ( 75.42%) 127.58 ( 79.49%) 39.34 ( 93.68%) 21.31 ( 96.58%) 19.99 ( 96.79%) length=640, alignment=0: 691.01 142.56 ( 79.37%) 161.78 ( 76.59%) 39.20 ( 94.33%) 22.98 ( 96.67%) 20.13 ( 97.09%) length=704, alignment=0: 756.90 156.31 ( 79.35%) 176.19 ( 76.72%) 45.03 ( 94.05%) 24.82 ( 96.72%) 22.33 ( 97.05%) length=768, alignment=0: 826.23 193.17 ( 76.62%) 188.41 ( 77.20%) 50.81 ( 93.85%) 27.46 ( 96.68%) 23.25 ( 97.19%) length=832, alignment=0: 890.17 204.81 ( 76.99%) 201.61 ( 77.35%) 53.77 ( 93.96%) 27.73 ( 96.88%) 25.06 ( 97.18%) length=896, alignment=0: 959.52 217.89 ( 77.29%) 213.86 ( 77.71%) 57.99 ( 93.96%) 29.53 ( 96.92%) 26.29 ( 97.26%) length=960, alignment=0: 1024.52 231.06 ( 77.45%) 227.05 ( 77.84%) 60.36 ( 94.11%) 32.29 ( 96.85%) 27.94 ( 97.27%) length=1024, alignment=0: 1086.71 244.17 ( 77.53%) 239.87 ( 77.93%) 64.72 ( 94.04%) 72.38 ( 93.34%) 28.72 ( 97.36%) length=1152, alignment=0: 1231.48 270.22 ( 78.06%) 266.47 ( 78.36%) 73.38 ( 94.04%) 40.24 ( 96.73%) 32.42 ( 97.37%) length=1280, alignment=0: 1349.29 295.45 ( 78.10%) 292.69 ( 78.31%) 111.80 ( 91.71%) 42.44 ( 96.85%) 34.59 ( 97.44%) length=1408, alignment=0: 1487.13 322.57 ( 78.31%) 318.18 ( 78.60%) 84.47 ( 94.32%) 44.35 ( 97.02%) 37.31 ( 97.49%) length=1536, alignment=0: 1623.52 347.98 ( 78.57%) 344.24 ( 78.80%) 108.31 ( 93.33%) 49.82 ( 96.93%) 39.94 ( 97.54%) length=1664, alignment=0: 1748.88 373.80 ( 78.63%) 370.03 ( 78.84%) 118.76 ( 93.21%) 52.89 ( 96.98%) 42.93 ( 97.55%) length=1792, alignment=0: 1886.22 399.59 ( 78.82%) 397.39 ( 78.93%) 127.32 ( 93.25%) 53.64 ( 97.16%) 45.39 ( 97.59%) length=1920, alignment=0: 2018.37 425.98 ( 78.89%) 422.31 ( 79.08%) 126.70 ( 93.72%) 57.08 ( 97.17%) 48.12 ( 97.62%) length=2048, alignment=0: 2167.09 451.70 ( 79.16%) 447.70 ( 79.34%) 141.68 ( 93.46%) 61.63 ( 97.16%) 79.06 ( 96.35%) length=2304, alignment=0: 2422.03 503.63 ( 79.21%) 502.23 ( 79.26%) 149.62 ( 93.82%) 73.10 ( 96.98%) 56.97 ( 97.65%) length=2560, alignment=0: 2678.68 556.84 ( 79.21%) 553.24 ( 79.35%) 161.06 ( 93.99%) 127.74 ( 95.23%) 58.81 ( 97.80%) length=2816, alignment=0: 2941.95 608.70 ( 79.31%) 604.03 ( 79.47%) 171.85 ( 94.16%) 87.11 ( 97.04%) 67.08 ( 97.72%) length=3072, alignment=0: 3229.89 660.14 ( 79.56%) 659.19 ( 79.59%) 183.85 ( 94.31%) 140.25 ( 95.66%) 73.01 ( 97.74%) length=3328, alignment=0: 3496.08 713.05 ( 79.60%) 710.00 ( 79.69%) 209.72 ( 94.00%) 138.78 ( 96.03%) 77.81 ( 97.77%) length=3584, alignment=0: 3756.52 766.19 ( 79.60%) 763.94 ( 79.66%) 214.16 ( 94.30%) 146.36 ( 96.10%) 83.43 ( 97.78%) length=3840, alignment=0: 4017.15 817.43 ( 79.65%) 819.77 ( 79.59%) 242.07 ( 93.97%) 164.56 ( 95.90%) 89.72 ( 97.77%) length=4096, alignment=0: 4281.59 867.87 ( 79.73%) 864.71 ( 79.80%) 243.33 ( 94.32%) 173.11 ( 95.96%) 95.65 ( 97.77%) length=4608, alignment=0: 4810.30 977.80 ( 79.67%) 985.03 ( 79.52%) 271.13 ( 94.36%) 190.62 ( 96.04%) 107.82 ( 97.76%) length=5120, alignment=0: 5380.16 1075.77 ( 80.00%) 1071.80 ( 80.08%) 294.27 ( 94.53%) 206.04 ( 96.17%) 141.90 ( 97.36%) length=5632, alignment=0: 5925.70 1195.61 ( 79.82%) 1193.68 ( 79.86%) 323.42 ( 94.54%) 223.55 ( 96.23%) 125.28 ( 97.89%) length=6144, alignment=0: 6402.20 1285.52 ( 79.92%) 1281.04 ( 79.99%) 342.68 ( 94.65%) 234.84 ( 96.33%) 167.01 ( 97.39%) length=6656, alignment=0: 6997.01 1387.32 ( 80.17%) 1384.21 ( 80.22%) 365.93 ( 94.77%) 269.89 ( 96.14%) 176.40 ( 97.48%) length=7168, alignment=0: 7454.76 1492.10 ( 79.98%) 1488.45 ( 80.03%) 391.92 ( 94.74%) 280.81 ( 96.23%) 187.73 ( 97.48%) length=7680, alignment=0: 8163.34 1608.43 ( 80.30%) 1615.98 ( 80.20%) 460.03 ( 94.36%) 299.86 ( 96.33%) 201.40 ( 97.53%) ```	2025-08-19 15:18:04 -07:00
Andre Kuhlenschmidt	d4673febb4	[flang][openacc] fix unguarded dereference of type pointer (#153606 ) The added test used to cause a segfault, now it doesn't.	2025-08-19 15:10:09 -07:00
Maksim Panchenko	0d9b9d1eef	[BOLT] Keep X86 HLT instruction as a terminator in user mode (#154402 ) This is a follow-up to #150963. X86 HLT instruction may appear in the user-level code, in which case we should treat it as a terminator. Handle it as a non-terminator in the Linux kernel mode.	2025-08-19 14:41:13 -07:00
Matheus Izvekov	ec6389d0c0	[clang] fix runtime check for NNS transform (#154418 )	2025-08-19 18:32:17 -03:00
Valentin Clement (バレンタインクレメン)	af8a149546	[flang][cuda] Add utility function cuf::hasDataAttr (#154422 )	2025-08-19 14:25:32 -07:00
barsolo2000	402109e1c4	[LLDB] added getName method in SBModule (#150331 ) added getName method in SBModule.h and .cpp in order to get the name of the module from m_object_name. --------- Co-authored-by: Bar Soloveychik <barsolo@fb.com>	2025-08-19 14:24:10 -07:00
Harshil Solanki	6127e46ff8	[clang-tidy][docs] Add documentation for Clang-Tidy Automation (#153166 ) Resolves #90772 --------- Co-authored-by: Baranov Victor <bar.victor.2002@gmail.com>	2025-08-19 23:28:26 +03:00
Shilei Tian	b170f17861	[AMDGPU] Add support for safe bfloat16 fdiv on targets with bf16 trans instructions (#154373 ) Recent changes introduced custom lowering for bf16 fdiv on targets that support bf16 trans instructions, but only covered the unsafe version. This PR extends that support to the safe variant. For the safe version, the op is lowered by converting to float, performing the div in float, and converting the result back to bf16. This matches the behavior on targets that don't support bf16 trans instructions. Fixes SWDEV-550381.	2025-08-19 16:03:45 -04:00
Arthur Eubanks	b35b6297fd	[gn build] Build win asan libraries against MutliThreadedDLL win asan only supports /MD. Without this, the asan runtime cannot intercept functions.	2025-08-19 19:48:32 +00:00
Ian Wood	961b052e98	[mlir][tensor][NFC] Refactor common methods for bubbling extract_slice op (#153675 ) Exposes the `tensor.extract_slice` reshaping logic in `BubbleUpExpandShapeThroughExtractSlice` and `BubbleUpCollapseShapeThroughExtractSlice` through two corresponding utility functions. These compute the offsets/sizes/strides of an extract slice after either collapsing or expanding. This should also make it easier to implement the two other bubbling cases: (1) the `collapse_shape` is a consumer or (2) the `expand_shape` is a consumer. --------- Signed-off-by: Ian Wood <ianwood@u.northwestern.edu>	2025-08-19 19:31:30 +00:00
Yifei Xu	6609d5fb0c	Update BUILD.bazel	2025-08-19 14:29:58 -05:00
Alex MacLean	d494eb0fa3	[NVPTX] Skip numbering unreferenced virtual registers (readability) (#154391 ) When assigning numbers to registers, skip any with neither uses nor defs. This is will not have any impact at all on the final SASS but it makes for slightly more readable PTX. This change should also ensure that future minor changes are less likely to cause noisy diffs in register numbering.	2025-08-19 12:27:46 -07:00
Kazu Hirata	13faa15ee1	[lldb] Replace SmallSet with SmallPtrSet (NFC) (#154366 ) This patch replaces SmallSet<T , N> with SmallPtrSet<T , N>. Note that SmallSet.h "redirects" SmallSet to SmallPtrSet for pointer element types: template <typename PointeeType, unsigned N> class SmallSet<PointeeType, N> : public SmallPtrSet<PointeeType, N> {}; We only have 10 instances that rely on this "redirection". Since the redirection doesn't improve readability, this patch replaces SmallSet with SmallPtrSet for pointer element types. I'm planning to remove the redirection eventually.	2025-08-19 12:20:43 -07:00
Andy Kaylor	6747139bc2	[CIR] Use zero-initializer for partial array fills (#154161 ) If an array initializer list leaves eight or more elements that require zero fill, we had been generating an individual zero element for every one of them. This change instead follows the behavior of classic codegen, which creates a constant structure with the specified elements followed by a zero-initializer for the trailing zeros.	2025-08-19 12:14:05 -07:00
David Tellenbach	0542355147	[AArch64] Fix zero-register copying with zero-cycle moves (#154362 ) Fix incorrect super-register lookup when copying from $wzr on subtargets that lack zero-cycle zeroing but support 64-bit zero-cycle moves. When copying from $wzr, we used the wrong register class to lookup the super-register, causing $w0 = COPY $wzr to get expanded as $x0 = ORRXrr $xzr, undef $noreg, implicit $wzr, rather than the correct $x0 = ORRXrr $xzr, undef $xzr, implicit $wzr.	2025-08-19 21:07:16 +02:00
Florian Hahn	23ea79de61	[LV] Add more tests for costs of predicated udivs and calls. Adds missing test coverage for the cost model. Also reduce the size of check lines a bit, by using a common prefix and filtering out after scalar.ph.	2025-08-19 20:04:31 +01:00
Baranov Victor	ed26993976	[clang-tidy] Improve "-quiet" option by suppressing "xxx warnings generated" (#154012 ) Before this change, `-quiet` mode in clang-tidy generated meaningless messages `xxx warnings generated` in output: ```cpp // main.cpp #include <iostream> int main() { std::cout << 42; } ``` ```console > clang-tidy -checks='-,readability-magic-numbers' -quiet main.cpp 82 warnings generated. main.cpp:4:16: warning: 42 is a magic number; consider replacing it with a named constant [readability-magic-numbers] 4 \| std::cout << 42; \| ^ ``` As you can see, `82 warnings generated.` does not say much `quiet` mode, this patch removes this message completely: ```console > ./build/bin/clang-tidy -p build -checks='-,readability-magic-numbers' -quiet main.cpp main.cpp:4:16: warning: 42 is a magic number; consider replacing it with a named constant [readability-magic-numbers] 4 \| std::cout << 42; \| ^ ``` In contrast, when running without `quiet`, It gives some meaningful information because we know how many messages were suppressed thus calculating total messages count: ```console > clang-tidy -checks='-,readability-magic-numbers' main.cpp 82 warnings generated. main.cpp:4:16: warning: 42 is a magic number; consider replacing it with a named constant [readability-magic-numbers] 4 \| std::cout << 42; \| ^ Suppressed 81 warnings (81 in non-user code). Use -header-filter=. to display errors from all non-system headers. Use -system-headers to display errors from system headers as well. ``` Fixes #47042	2025-08-19 21:47:51 +03:00
Tomer Shafir	ffddf33beb	[AArch64] Remove wrong processor feature (#151289 ) `fmov dX, dY` is not a preferred instruction. Previously introduced by: https://github.com/llvm/llvm-project/pull/144152	2025-08-19 21:20:09 +03:00
Jonas Devlieghere	8bd9897815	[lldb] Create sections for Wasm segments (#153634 ) This is a continuation of #153494. In a WebAssembly file, the "name" section contains names for the segments in the data section (WASM_NAMES_DATA_SEGMENT). We already parse these as symbols, and with this PR, we now also create sub-sections for each of the segments.	2025-08-19 11:07:49 -07:00
Andy Kaylor	0f34360915	[CIR] Fix var_arg.c test (again) (#154393 ) We had competing fixes for this that weren't compatible. This should pass now.	2025-08-19 11:04:56 -07:00
Thurston Dang	4e9dbb519b	[compiler-rt] Fix-forward "[compiler-rt] Fix frame numbering for unparsable frames. #148278 " (#154397 ) Reason: buildbot failure (https://lab.llvm.org/buildbot/#/builders/51/builds/21874/steps/9/logs/stdio) Fix by restricting test to x86-64	2025-08-19 10:50:11 -07:00
Jonas Devlieghere	4cecbeed4f	[lldb] Support error variations in TestProcessCrashInfo.py (#154202 ) The error message emitted by libmalloc changed in macOS 26. Update the test to support both.	2025-08-19 10:49:23 -07:00
Matheus Izvekov	fdcc1b3588	[clang] NFC: fix and add some comments missing from #147835 (#154395 )	2025-08-19 14:47:11 -03:00
Benjamin Maxwell	86e23af6be	[AArch64][SME] Temporarily mark test as XFAIL (#154384 ) Marking this as XFAIL until https://github.com/llvm/llvm-project/pull/154325 lands	2025-08-19 17:33:08 +00:00
Matheus Izvekov	5485c7021a	[clang] fix redecl chain assumption when checking linkage consistency (#153996 ) In C++, it can be assumed the same linkage will be computed for all redeclarations of an entity, and we have assertions to check this. However, the linkage for a declaration can be requested in the middle of deserealization, and at this point the redecl chain is not well formed, as computation of the most recent declaration is deferred. This patch makes that assertion work even in such conditions. This fixes a regression introduced in https://github.com/llvm/llvm-project/pull/147835, which was never released, so there are no release notes for this. Fixes #153933	2025-08-19 14:32:08 -03:00
Oleksandr T.	29fa5b72e9	Revert "[Clang] improve -Wstring-concatenation to warn on every missing comma in initializer lists" (#154369 ) Revert llvm/llvm-project#154018 changes due to excessive _false positives_. The warning caused multiple benign reports in large codebases (e.g. _Linux kernel_, _Fuchsia_, _tcpdump_). Since many of these concatenations are intentional and follow project style rules, the diagnostic introduced more false positives than value. This will be revisited as a potential `clang-tidy` check instead.	2025-08-19 20:30:16 +03:00
Michael Jones	d2b2d6ff10	[libc] Fix missing close at the end of file test (#154392 ) The test added by #150802 was missing a close at the end.	2025-08-19 10:26:05 -07:00
Michael Buch	3f3bc4853e	[clang][test][DebugInfo] Move debug-info tests from CodeGen to DebugInfo directory (#154311 ) This patch works towards consolidating all Clang debug-info into the `clang/test/DebugInfo` directory (https://discourse.llvm.org/t/clang-test-location-of-clang-debug-info-tests/87958). Here we move only the `clang/test/CodeGen` tests. The list of files i came up with is: 1. searched for anything with `debug-info` in the filename 2. searched for occurrences of `debug-info-kind` in the tests I created a couple of subdirectories in `clang/test/DebugInfo` where I thought it made sense (mostly when the tests were target-specific). There's a couple of tests in `clang/test/CodeGen` that still set `-debug-info-kind`. They probably don't need to do that, but I'm not changing that as part of this PR.	2025-08-19 18:25:13 +01:00
Jesse Schwartzentruber	b1306da6ac	[compiler-rt] Fix frame numbering for unparsable frames. (#148278 ) This can happen when JIT code is run, and we can't symbolize those frames, but they should remain numbered in the stack. An example spidermonkey trace: ``` #0 0x564ac90fb80f (/builds/worker/dist/bin/js+0x240e80f) (BuildId: 5d053c76aad4cfbd08259f8832e7ac78bbeeab58) #1 0x564ac9223a64 (/builds/worker/dist/bin/js+0x2536a64) (BuildId: 5d053c76aad4cfbd08259f8832e7ac78bbeeab58) #2 0x564ac922316f (/builds/worker/dist/bin/js+0x253616f) (BuildId: 5d053c76aad4cfbd08259f8832e7ac78bbeeab58) #3 0x564ac9eac032 (/builds/worker/dist/bin/js+0x31bf032) (BuildId: 5d053c76aad4cfbd08259f8832e7ac78bbeeab58) #4 0x0dec477ca22e (<unknown module>) ``` Without this change, the following symbolization is output: ``` #0 0x55a6d72f980f in MOZ_CrashSequence /builds/worker/workspace/obj-build/dist/include/mozilla/Assertions.h:248:3 #1 0x55a6d72f980f in Crash(JSContext, unsigned int, JS::Value) /builds/worker/checkouts/gecko/js/src/shell/js.cpp:4223:5 #2 0x55a6d7421a64 in CallJSNative(JSContext, bool ()(JSContext, unsigned int, JS::Value), js::CallReason, JS::CallArgs const&) /builds/worker/checkouts/gecko/js/src/vm/Interpreter.cpp:501:13 #3 0x55a6d742116f in js::InternalCallOrConstruct(JSContext, JS::CallArgs const&, js::MaybeConstruct, js::CallReason) /builds/worker/checkouts/gecko/js/src/vm/Interpreter.cpp:597:12 #4 0x55a6d80aa032 in js::jit::DoCallFallback(JSContext, js::jit::BaselineFrame, js::jit::ICFallbackStub, unsigned int, JS::Value, JS::MutableHandle<JS::Value>) /builds/worker/checkouts/gecko/js/src/jit/BaselineIC.cpp:1705:10 #4 0x2c803bd8f22e (<unknown module>) ``` The last frame has a duplicate number. With this change the numbering is correct: ``` #0 0x5620c58ec80f in MOZ_CrashSequence /builds/worker/workspace/obj-build/dist/include/mozilla/Assertions.h:248:3 #1 0x5620c58ec80f in Crash(JSContext, unsigned int, JS::Value) /builds/worker/checkouts/gecko/js/src/shell/js.cpp:4223:5 #2 0x5620c5a14a64 in CallJSNative(JSContext, bool ()(JSContext, unsigned int, JS::Value), js::CallReason, JS::CallArgs const&) /builds/worker/checkouts/gecko/js/src/vm/Interpreter.cpp:501:13 #3 0x5620c5a1416f in js::InternalCallOrConstruct(JSContext, JS::CallArgs const&, js::MaybeConstruct, js::CallReason) /builds/worker/checkouts/gecko/js/src/vm/Interpreter.cpp:597:12 #4 0x5620c669d032 in js::jit::DoCallFallback(JSContext, js::jit::BaselineFrame, js::jit::ICFallbackStub, unsigned int, JS::Value, JS::MutableHandle<JS::Value>) /builds/worker/checkouts/gecko/js/src/jit/BaselineIC.cpp:1705:10 #5 0x349f24c7022e (<unknown module>) ```	2025-08-19 10:13:41 -07:00
codefaber	fd7f69bfe7	[libc] Fix copy/paste error in file.cpp (#150802 ) Fix using wrong variable due to copy/paste error. --------- Co-authored-by: codefaber <codefaber>	2025-08-19 10:05:38 -07:00
Younan Zhang	0732693d81	Revert "[Clang] Only remove lambda scope after computing evaluation context" (#154382 ) Revert due to breakage as reported in https://github.com/llvm/llvm-project/pull/154106#discussion_r2285824084 Reverts llvm/llvm-project#154106	2025-08-20 00:56:56 +08:00
Aaron Ballman	ae434cdd0b	[C] Fix issue with -Wimplicit-void-ptr-cast (#154351 ) The changes from https://github.com/llvm/llvm-project/pull/136855 missed a change with atomic assignment constraints. This fixes a bug where we'd accidentally drop a non-atomic-to-atomic conversion step. Fixes #154157 co-authored-by: @ahatanak	2025-08-19 12:55:03 -04:00
Renato Golin	5cc8c92268	[NFC][MLIR] Document better linalg morphism (#154313 )	2025-08-19 17:51:03 +01:00
Krishna Pandey	550dbec03a	[libc][math][c++23] Add {,u}fromfp{,x}bf16 math functions (#153992 ) This PR adds the following basic math functions for BFloat16 type along with the tests: - fromfpbf16 - fromfpxbf16 - ufromfpbf16 - ufromfpxbf16 --------- Signed-off-by: Krishna Pandey <kpandey81930@gmail.com>	2025-08-19 22:19:03 +05:30
Sam Clegg	28f2fb24e1	Revert "[lld][WebAssembly] Do not relocate ABSOLUTE symbols" (#154371 ) Reverts llvm/llvm-project#153763 This caused a bunch of failures on the emscripten waterfall, specifically most of the lsan tests started failing.	2025-08-19 09:46:10 -07:00
Min-Yih Hsu	f82054eaa2	[RISCV] Handle more cases when combining (vfmv.s.f (extract_subvector X, 0)) (#154175 ) Previously, we fold `(vfmv.s.f (extract_subvector X, 0))` into X when X's type is the same as `vfmv.s.f`'s result type. This patch generalizes it by folding it into insert_subvector when X is narrower and extract_subvector when X is wider. Co-authored-by: Craig Topper <craig.topper@sifive.com>	2025-08-19 09:43:55 -07:00
Frederik Harwath	f1e56ac032	Revert "[Clang] Take libstdc++ into account during GCC detection" (#154368 ) Reverts llvm/llvm-project#145056	2025-08-19 16:35:58 +00:00
Helena Kotas	e04fedadba	[HLSL] Tests for local resource arrays (#153257 ) Add tests for local arrays of resources. Closes #145425	2025-08-19 09:22:27 -07:00
Joseph Huber	e2777af84b	[LLVM] Add missing dependency for offload-wrapper tool	2025-08-19 11:19:35 -05:00
Drew Kersnar	069ad2353c	[NVPTXLowerArgs] Add align attribute to return value of addrspace.wrap intrinsic (#153889 ) If alignment inference happens after NVPTXLowerArgs these addrspace wrap intrinsics can prevent computeKnownBits from deriving alignment of loads/stores from parameters. To solve this, we can insert an alignment annotation on the generated intrinsic so that computeKnownBits does not need to traverse through it to find the alignment.	2025-08-19 11:13:57 -05:00
Kazu Hirata	67ca5dad16	[polly] Replace SmallSet with SmallPtrSet (NFC) (#154367 ) This patch replaces SmallSet<T , N> with SmallPtrSet<T , N>. Note that SmallSet.h "redirects" SmallSet to SmallPtrSet for pointer element types: template <typename PointeeType, unsigned N> class SmallSet<PointeeType, N> : public SmallPtrSet<PointeeType, N> {}; We only have 10 instances that rely on this "redirection". Since the redirection doesn't improve readability, this patch replaces SmallSet with SmallPtrSet for pointer element types. I'm planning to remove the redirection eventually.	2025-08-19 09:09:11 -07:00

1 2 3 4 5 ...

549210 Commits