llvm-project

Author	SHA1	Message	Date
Peter Klausler	fc97d2e68b	[flang] Add UNSIGNED (#113504 ) Implement the UNSIGNED extension type and operations under control of a language feature flag (-funsigned). This is nearly identical to the UNSIGNED feature that has been available in Sun Fortran for years, and now implemented in GNU Fortran for gfortran 15, and proposed for ISO standardization in J3/24-116.txt. See the new documentation for details; but in short, this is C's unsigned type, with guaranteed modular arithmetic for +, -, and *, and the related transformational intrinsic functions SUM & al.	2024-12-18 07:02:37 -08:00
Peter Klausler	3f594741cf	[flang] Fix implementation of Kahan summation (#116897 ) In the runtime's implementation of floating-point SUM, the implementation of Kahan's algorithm for increased precision is incorrect. The running correction factor should be subtracted from each new data item, not added to it. This fix ensures that the sum of 100M random default real values between 0. and 1. is close to 5.E7. See https://en.wikipedia.org/wiki/Kahan_summation_algorithm.	2024-11-21 10:47:21 -08:00
Slava Zakharin	fc51c7f0cc	[flang][runtime] Disable LDBL_MANT_DIG == 113 for the offload builds. (#109339 ) When compiling on aarch64 some `LDBL_MANT_DIG == 113` entries end up trying to use `complex<long double>` for which there are no certain specializations in `libcudacxx`. This change-set includes a clean-up for `LDBL_MANT_DIG == 113` usage, which is replaced with `HAS_LDBL128` that is set in `float128.h`.	2024-09-19 15:45:45 -07:00
Slava Zakharin	104f3c1806	Reland "[flang][runtime] Use cuda::std::complex in F18 runtime CUDA build. (#109078 )" (#109207 ) `std::complex` operators do not work for the CUDA device compilation of F18 runtime. This change makes use of `cuda::std::complex` from `libcudacxx`. `cuda::std::complex` does not have specializations for `long double`, so the change is accompanied with a clean-up for `long double` usage. Additional change on top of #109078 is to use `cuda::std::complex` only for the device compilation, otherwise the host compilation fails because `libcudacxx` may not support `long double` specialization at all (depending on the compiler).	2024-09-18 17:41:33 -07:00
Slava Zakharin	36192fdfb9	Revert "[flang][runtime] Use cuda::std::complex in F18 runtime CUDA build." (#109173 ) Reverts llvm/llvm-project#109078	2024-09-18 11:22:31 -07:00
Slava Zakharin	be187a6812	[flang][runtime] Use cuda::std::complex in F18 runtime CUDA build. (#109078 ) `std::complex` operators do not work for the CUDA device compilation of F18 runtime. This change makes use of `cuda::std::complex` from `libcudacxx`. `cuda::std::complex` does not have specializations for `long double`, so the change is accompanied with a clean-up for `long double` usage.	2024-09-18 10:59:05 -07:00
Peter Klausler	ce7700e29d	[flang][runtime] Address PRODUCT numeric discrepancy, folding vs runtime (#90125 ) Ensure that the runtime implementations of floating-point reductions use intermediate results of the same precision as the operands, so that results match those from constant folding. (SUM reduction uses Kahan summation in both cases.)	2024-05-02 05:43:54 -07:00
Slava Zakharin	d699d9d609	[flang][runtime] Support SUM/PRODUCT/DOT_PRODUCT reductions for REAL(16). (#83169 ) The reductions implementations rely on trivial operations that are supported by the build compiler runtime, so they can be enabled whenever the build compiler provides 128-bit float support. std::conj used by DOT_PRODUCT is a template implementation in most environments, so it should not introduce a dependency on any 128-bit float support library. I am not goind to test it in all the build environments before merging. If it fails for someone, I will deal with it.	2024-02-27 15:59:25 -08:00
Slava Zakharin	76facde32c	[flang][runtime] Enable more APIs in the offload build. (#76486 )	2023-12-28 13:50:43 -08:00
Slava Zakharin	b4b23ff7f8	[flang][runtime] Enable more APIs in the offload build. (#75996 ) This patch enables more numeric (mod, sum, matmul, etc.) APIs, and some others. I added new macros to disable warnings about using C++ STD methods like operators of std::complex, which do not have __device__ attribute. This may probably result in unresolved references, if the header files implementation relies on libstdc++. I will need to follow up on this.	2023-12-20 11:52:51 -08:00
David Spickett	ffc67bb360	Revert "[Flang] [FlangRT] Introduce FlangRT project as solution to Flang's runtime LLVM integration" This reverts commit 6403287eff71a3d6f6c862346d6ed3f0f000eb70. This is failing on all but 1 of Linaro's flang builders. CMake Error at /home/tcwg-buildbot/worker/clang-aarch64-full-2stage/llvm/flang-rt/unittests/CMakeLists.txt:37 (message): Target llvm_gtest not found.	2023-10-02 09:02:05 +00:00
Paul Scoropan	6403287eff	[Flang] [FlangRT] Introduce FlangRT project as solution to Flang's runtime LLVM integration See discourse thread https://discourse.llvm.org/t/rfc-support-cmake-option-to-control-link-type-built-for-flang-runtime-libraries/71602/18 for full details. Flang-rt is the new library target for the flang runtime libraries. It builds the Flang-rt library (which contains the sources of FortranRuntime and FortranDecimal) and the Fortran_main library. See documentation in this patch for detailed description (flang-rt/docs/GettingStarted.md). This patch aims to: - integrate Flang's runtime into existing llvm infrasturcture so that Flang's runtime can be built similarly to other runtimes via the runtimes target or via the llvm target as an enabled runtime - decouple the FortranDecimal library sources that were used by both compiler and runtime so that different build configurations can be applied for compiler vs runtime - add support for running flang-rt testsuites, which were created by migrating relevant tests from `flang/test` and `flang/unittest` to `flang-rt/test` and `flang-rt/unittest`, using a new `check-flang-rt` target. - provide documentation on how to build and use the new FlangRT runtime Reviewed By: DanielCChen Differential Revision: https://reviews.llvm.org/D154869	2023-09-30 12:35:33 -04:00
Peter Steinfeld	478e0b5860	[flang] Quadmath 128 bit floating point intrinsics This update allows constant folding for many 128 bit floating point intrinsics through the library quadmath, which is only available on some platforms. Differential Revision: https://reviews.llvm.org/D156435	2023-07-31 11:12:29 -07:00
Peter Klausler	4daa33f6d1	[flang][runtime] Use __float128 where possible & needed in runtime On targets with __float128 available and distinct from long double, use it to support more kind=16 entry points. This affects mostly x86-64 targets. This means that more runtime entry points are defined for lowering to call. Delete Common/long-double.h and its LONG_DOUBLE macro in favor of testing the standard macro LDBL_MANT_DIG. Differential Revision: https://reviews.llvm.org/D127025	2022-06-05 09:16:57 -07:00
Peter Klausler	830c0b9023	[flang] Move runtime API headers to flang/include/flang/Runtime Move the closure of the subset of flang/runtime/*.h header files that are referenced by source files outside flang/runtime (apart from unit tests) into a new directory (flang/include/flang/Runtime) so that relative include paths into ../runtime need not be used. flang/runtime/pgmath.h.inc is moved to flang/include/flang/Evaluate; it's not used by the runtime. Differential Revision: https://reviews.llvm.org/D109107	2021-09-03 11:08:34 -07:00
peter klausler	c375ec8613	[flang] Use a "double-double" accumulator in SUM Use a "double-double" accumulator, a/k/a Kahan summation, in the SUM intrinsic in the runtime for real & complex. This seems to be the best-recommended technique for reducing error, as opposed to the initial implementation of SUM's distinct accumulators for positive and negative items. Differential Revision: https://reviews.llvm.org/D104338	2021-06-16 14:29:39 -07:00
peter klausler	beb5ac8b25	[flang] (NFC) Break up flang/runtime/reduction.cpp The single source file reduction.cpp is a little large in terms of both source lines and generated text bytes, so split SUM, PRODUCT, FINDLOC, and MAXLOC/MAXVAL/MINLOC/MINVAL off into their own C++ source files that share a set of implementation function templates now in a common header. Differential Revision: https://reviews.llvm.org/D101111	2021-04-22 15:24:10 -07:00

17 Commits