llvm-project

Author	SHA1	Message	Date
Guillaume Chatelet	5e32765c15	[libc] Improve memcmp latency and codegen This is based on ideas from @nafi to: - use a branchless version of 'cmp' for 'uint32_t', - completely resolve the lexicographic comparison through vector operations when wide types are available. We also get rid of byte reloads and serializing '__builtin_ctzll'. I did not include the suggestion to replace comparisons of 'uint16_t' with two 'uint8_t' as it did not seem to help the codegen. This can be revisited in sub-sequent patches. The code been rewritten to reduce nested function calls, making the job of the inliner easier and preventing harmful code duplication. Reviewed By: nafi3000 Differential Revision: https://reviews.llvm.org/D148717	2023-06-12 13:47:16 +00:00
Tue Ly	a982431295	[libc] Add platform independent floating point rounding mode checks. Many math functions need to check for floating point rounding modes to return correct values. Currently most of them use the internal implementation of `fegetround`, which is platform-dependent and blocking math functions to be enabled on platforms with unimplemented `fegetround`. In this change, we add platform independent rounding mode checks and switching math functions to use them instead. https://github.com/llvm/llvm-project/issues/63016 Reviewed By: sivachandra Differential Revision: https://reviews.llvm.org/D152280	2023-06-12 09:36:41 -04:00
Guillaume Chatelet	1ec995cc1c	Revert D148717 "[libc] Improve memcmp latency and codegen" This broke aarch64 debug buildbot https://lab.llvm.org/buildbot/#/builders/223/builds/21703 This reverts commit bd4f978754758d5ef29d1f10370f45362da3de37.	2023-06-12 08:32:00 +00:00
Guillaume Chatelet	bd4f978754	[libc] Improve memcmp latency and codegen This is based on ideas from @nafi to: - use a branchless version of 'cmp' for 'uint32_t', - completely resolve the lexicographic comparison through vector operations when wide types are available. We also get rid of byte reloads and serializing '__builtin_ctzll'. I did not include the suggestion to replace comparisons of 'uint16_t' with two 'uint8_t' as it did not seem to help the codegen. This can be revisited in sub-sequent patches. The code been rewritten to reduce nested function calls, making the job of the inliner easier and preventing harmful code duplication. Reviewed By: nafi3000 Differential Revision: https://reviews.llvm.org/D148717	2023-06-12 07:56:23 +00:00
Guillaume Chatelet	8e44b849da	[libc][NFC] Introduce a Location object for consistent failure logging This is just an implementation detail. Reviewed By: sivachandra Differential Revision: https://reviews.llvm.org/D152532	2023-06-10 06:58:15 +00:00
Guillaume Chatelet	0fd0b74289	[libc][NFC] Clean up matchers namespace This is a follow up to https://reviews.llvm.org/D152503 Reviewed By: sivachandra Differential Revision: https://reviews.llvm.org/D152533	2023-06-10 06:55:16 +00:00
Tue Ly	37458f6693	[libc][math] Move str method from FPBits class to testing utils. str method of FPBits class is only used for pretty printing its objects in tests. It brings cpp::string dependency to FPBits class, which is not ideal for embedded use case. We move str method to a free function in test utils and remove this dependency of FPBits class. Reviewed By: sivachandra Differential Revision: https://reviews.llvm.org/D152607	2023-06-10 02:50:58 -04:00
Joseph Huber	168fa31816	[libc] Fix some tests on NVPTX due to insufficient stack size A few of these tests were disabled due to failing on NVPTX. After looking into it the vast majority of these cases were due to insufficient stack memory. This can be worked around by increasing the stack size in the loader or by reducing the memory usage in the case of large string constants. Reviewed By: tra Differential Revision: https://reviews.llvm.org/D152583	2023-06-09 16:42:14 -05:00
Guillaume Chatelet	fd2c74c8ed	[libc][NFC] Simplify LibcTest and trim down string allocations This is a bit of cleanup before working on logging via stream operator (i.e., `EXPECT_XXX() << ...`). Reviewed By: sivachandra Differential Revision: https://reviews.llvm.org/D152503	2023-06-09 09:36:18 +00:00
Michael Jones	4b8543f33f	[libc][NFC] fix constants not marked constexpr The constants in the ryu constant tables were not marked constexpr, only static const. This caused problems when compiling with GCC. Differential Revision: https://reviews.llvm.org/D152487	2023-06-08 16:43:01 -07:00
Michael Jones	47fd67ec34	[libc][NFC] land long double table for printf The Mega Table that printf uses for long doubles with some flags is too large for the linters, and so has been split out from the main patch. The main patch: https://reviews.llvm.org/D150399 Reviewed By: sivachandra Differential Revision: https://reviews.llvm.org/D152470	2023-06-08 16:14:56 -07:00
Joseph Huber	63710fd529	[libc] Disable uint test on NVPTX GPUs This test started failing on Nvidia, we need to disable it to keep the bot green until we can investigate the root cause. Differential Revision: https://reviews.llvm.org/D152481	2023-06-08 17:54:27 -05:00
Michael Jones	5df182f121	[libc] disable printf Lf tests for float128 plats The results for the %Lf tests were calculated on 80 bit long double systems, meaning the results are not necessarily accurate for float128 systems. In future these will be fixed, but for the moment I'm just turning them off. Differential Revision: https://reviews.llvm.org/D152471	2023-06-08 14:35:51 -07:00
Michael Jones	688b9730d1	[libc] add options to printf decimal floats This patch adds three options for printf decimal long doubles, and these can also apply to doubles. 1. Use a giant table which is fast and accurate, but takes up ~5MB). 2. Use dyadic floats for approximations, which only gives ~50 digits of accuracy but is very fast. 3. Use large integers for approximations, which is accurate but very slow. Reviewed By: sivachandra, lntue Differential Revision: https://reviews.llvm.org/D150399	2023-06-08 14:23:15 -07:00
Tue Ly	4ade88a453	[libc] Update FMA detection macro for x86-64 targets. To generate fma instructions for x86-64 targets, we need both -mavx2 and -mfma. Reviewed By: brooksmoses Differential Revision: https://reviews.llvm.org/D152410	2023-06-07 19:55:18 -04:00
Leonard Chan	8f360c3560	[libc] Temporarily suppress -fsanitize=function for qsort comparator Recent upstream changes to -fsanitize=function find more instances of function type mismatches. One case is with the comparator passed to this class. Libraries like boringssl will tend to pass comparators that take pointers to varying types while this comparator expects to accept const void pointers. Ideally those tools would pass a function that strictly accepts const void*s to avoid UB, or we'd have something like qsort_r/s. Differential Revision: https://reviews.llvm.org/D152322	2023-06-07 18:46:15 +00:00
Joseph Huber	b05c63fba6	[libc] Silence warning about returning from noreturn function The `exit` entrypoint calls into `quick_exit` which is marked noreturn in some cases. This will cause errors because we then have control flow externally. This warning can be silenced by using a `__builtin_unreachable` instruction accordingly. Reviewed By: sivachandra, lntue Differential Revision: https://reviews.llvm.org/D152323	2023-06-07 06:23:00 -05:00
Tue Ly	c0a751ae3d	[libc] Fix undefined behavior of left shifting signed integer in exp2f.cpp. Fix undefined behavior of left shifting signed integer in exp2f.cpp. Reviewed By: sivachandra Differential Revision: https://reviews.llvm.org/D152336	2023-06-07 01:15:18 -04:00
Joseph Huber	8aad5012cc	[libc][Docs] Add support for the printing functions	2023-06-06 14:33:08 -05:00
Joseph Huber	27a80fc946	[libc] Replace use of `asm` in the GPU code with LIBC_INLINE_ASM We should more consistently use inline assembly using the LIBC wrappers. It's much safer to mark all of these volatile as well. Reviewed By: lntue Differential Revision: https://reviews.llvm.org/D152294	2023-06-06 14:24:52 -05:00
Tue Ly	b95ed8b6d9	[libc] Remove operator T from cpp::expected. The libc's equivalent of std::expected has a non-standard and non-explicit operator T - https://github.com/llvm/llvm-project/issues/62738 Reviewed By: sivachandra Differential Revision: https://reviews.llvm.org/D152270	2023-06-06 13:57:44 -04:00
Aiden Grossman	14a06b806e	[CMake][libc] Don't put archive in build/lib/<target triple> by default ea8f4b98419750c8cc7c60ea43b570adf47b3f78 broke some build configurations because it was enabled by default and some people are using a just built libc/clang/LLVM to work on other projects where having a just built LLVM libc in one of Clang's default include directories can make things unusable. Differential Revision: https://reviews.llvm.org/D152190	2023-06-06 00:43:11 +00:00
Joseph Huber	e6a350df10	[libc] Replace the `PRINT_TO_STDERR` opcode for RPC printing. A previous patch added general support for printing via the RPC interface. we should consolidate this functionality and get rid of the old opcode that was used for simple testing. Reviewed By: lntue Differential Revision: https://reviews.llvm.org/D152211	2023-06-05 19:28:30 -05:00
Joseph Huber	a59e1712fa	[libc][obvious] Fix conditional when CUDA is not found If CUDA is not found this string will expand into nothing. We need to surround it with a string otherwise it will cause build failures. Differential Revision: https://reviews.llvm.org/D152209	2023-06-05 18:51:23 -05:00
Joseph Huber	e6c401b5e8	[libc] Add initial support for 'puts' and 'fputs' to the GPU This patch adds the initial support required to support basic priting in `stdio.h` via `puts` and `fputs`. This is done using the existing LLVM C library `File` API. In this sense we can think of the RPC interface as our system call to dump the character string to the file. We carry a `uintptr_t` reference as our native "file descriptor" as it will be used as an opaque reference to the host's version once functions like `fopen` are supported. For some unknown reason the declaration of the `StdIn` variable causes both the AMDGPU and NVPTX backends to crash if I use the `READ` flag. This is not used currently as we only support output now, but it needs to be fixed Reviewed By: sivachandra, lntue Differential Revision: https://reviews.llvm.org/D151282	2023-06-05 17:56:55 -05:00
Joseph Huber	a621308881	[libc] Implement basic `malloc` and `free` support on the GPU This patch adds support for the `malloc` and `free` functions. These currently aren't implemented in-tree so we first add the interface filies. This patch provides the most basic support for a true `malloc` and `free` by using the RPC interface. This is functional, but in the future we will want to implement a more intelligent system and primarily use the RPC interface more as a `brk()` or `sbrk()` interface only called when absolutely necessary. We will need to design an intelligent allocator in the future. The semantics of these memory allocations will need to be checked. I am somewhat iffy on the details. I've heard that HSA can allocate asynchronously which seems to work with my tests at least. CUDA uses an implicit synchronization scheme so we need to use an explicitly separate stream from the one launching the kernel or the default stream. I will need to test the NVPTX case. I would appreciate if anyone more experienced with the implementation details here could chime in for the HSA and CUDA cases. Reviewed By: sivachandra Differential Revision: https://reviews.llvm.org/D151735	2023-06-05 17:56:53 -05:00
Guillaume Chatelet	e49a608511	Revert D148717 "[libc] Improve memcmp latency and codegen" This reverts commit 9ec6ebd3ceabb29482aa18a64b943788b65223dc. The patch broke RISCV and aarch64 builtbots.	2023-06-05 09:50:30 +00:00
Guillaume Chatelet	9ec6ebd3ce	[libc] Improve memcmp latency and codegen This is based on ideas from @nafi to: - use a branchless version of 'cmp' for 'uint32_t', - completely resolve the lexicographic comparison through vector operations when wide types are available. We also get rid of byte reloads and serializing '__builtin_ctzll'. I did not include the suggestion to replace comparisons of 'uint16_t' with two 'uint8_t' as it did not seem to help the codegen. This can be revisited in sub-sequent patches. The code been rewritten to reduce nested function calls, making the job of the inliner easier and preventing harmful code duplication. Reviewed By: nafi3000 Differential Revision: https://reviews.llvm.org/D148717	2023-06-05 09:46:05 +00:00
Aiden Grossman	ea8f4b9841	[libc][CMake] Place archives in build/lib/<target-triple> This patch moves the location of libllvmlibc.a within the build tree to within ./lib/<target triple>. This more closely matches the behavior of other runtime builds and allows for clang in the same build tree to automatically be able to link against llvmlibc since this path is by default included by the driver. Also removes the LIBC_BINARY_DIR CMake flag since it isn't used anywhere in the tree (based on a quick grep). Reviewed By: phosek Differential Revision: https://reviews.llvm.org/D151624	2023-06-03 22:40:03 +00:00
Tue Ly	5a4e344bd9	[libc][NFC] Add LIBC_INLINE and attribute.h header includes to targets' FMA.h. Targets' FMA.h headers are missing LIBC_INLINE and attributes.h header. Reviewed By: brooksmoses Differential Revision: https://reviews.llvm.org/D152024	2023-06-02 21:15:58 -04:00
Joseph Huber	48bb7bb868	[libc] Disable the string_to_float test on NVPTX This test began failing after recent changes. Disable it for now. Differential Revision: https://reviews.llvm.org/D152032	2023-06-02 15:56:39 -05:00
Joseph Huber	cfde5f2d89	[libc] Implement 'errno' on the GPU as a global integer internally The C standard asserts that the `errno` value is an l-value thread local integer. We cannot provide a generic thread local integer on the GPU currently without some workarounds. Previously, we worked around this by implementing the `errno` value as a special consumer class that made all the writes disappear. However, this is problematic for internal tests. Currently there are build failures because of this handling and it's only likely to cause more problems the more we do this. This patch instead makes the internal target used for testing export the `errno` value as a simple global integer. This allows us to use and test the `errno` interface correctly assuming we run with a single thread. Because this is only used for the non-exported target we still do not provide this feature in the version that users will use so we do not need to worrk about it being incorrect in general. Reviewed By: lntue Differential Revision: https://reviews.llvm.org/D152015	2023-06-02 14:16:24 -05:00
Tue Ly	f9753ef189	[libc][Obvious] Fix a typo in setting FMA control option for RISCV64.	2023-06-02 11:15:29 -04:00
Michael Jones	722832e6d7	[libc] Add strtoint32 and strtoint64 tests There were regressions in the testing framework due to none of the functioning buildbots having a 32 bit long. This allowed the 32 bit version of the strtointeger function to go untested. This patch adds tests for strtoint32 and strtoint64, which are internal testing functions that use constant integer sizes. It also fixes the tests to properly handle these situations. Reviewed By: sivachandra Differential Revision: https://reviews.llvm.org/D151935	2023-06-01 15:06:46 -07:00
Guillaume Chatelet	2697ffd039	[libc] Reduce math tests runtime further Reviewed By: lntue Differential Revision: https://reviews.llvm.org/D151875	2023-06-01 12:43:18 +00:00
Guillaume Chatelet	ae5c472410	[libc] Reduce math tests runtime Reviewed By: lntue Differential Revision: https://reviews.llvm.org/D151798	2023-06-01 05:01:56 +00:00
Tue Ly	cfc5c6cb8d	[libc][docs] Update implementation status table for Date and Time Functions. Update implementation status table for Date and Time Functions to include different targets. Reviewed By: jeffbailey Differential Revision: https://reviews.llvm.org/D151809	2023-05-31 15:09:06 -04:00
Guillaume Chatelet	c76a3e795e	[libc][NFC] Fixing various typos	2023-05-31 12:11:09 +00:00
Tue Ly	e557b8a142	[libc][RISCV] Add log, log2, log1p, log10 for RISC-V64 entrypoints. Add log, log2, log1p, log10 RISCV64 entrypoints. Reviewed By: michaelrj, sivachandra Differential Revision: https://reviews.llvm.org/D151674	2023-05-30 14:18:19 -04:00
Joseph Huber	1ef0bafc4f	[libc][NFC] Move the Linux file implementation to a subdirectory This patch simply moves the special handling for `linux` files to a subdirectory. This is done to make it easier in the future to extend this support to targets (like the GPU) that will have different dependencies. Reviewed By: lntue Differential Revision: https://reviews.llvm.org/D151231	2023-05-30 06:49:21 -05:00
Mark de Wever	cbaa3597aa	Reland "[CMake] Bumps minimum version to 3.20.0. This reverts commit d763c6e5e2d0a6b34097aa7dabca31e9aff9b0b6. Adds the patch by @hans from https://github.com/llvm/llvm-project/issues/62719 This patch fixes the Windows build. d763c6e5e2d0a6b34097aa7dabca31e9aff9b0b6 reverted the reviews D144509 [CMake] Bumps minimum version to 3.20.0. This partly undoes D137724. This change has been discussed on discourse https://discourse.llvm.org/t/rfc-upgrading-llvms-minimum-required-cmake-version/66193 Note this does not remove work-arounds for older CMake versions, that will be done in followup patches. D150532 [OpenMP] Compile assembly files as ASM, not C Since CMake 3.20, CMake explicitly passes "-x c" (or equivalent) when compiling a file which has been set as having the language C. This behaviour change only takes place if "cmake_minimum_required" is set to 3.20 or newer, or if the policy CMP0119 is set to new. Attempting to compile assembly files with "-x c" fails, however this is workarounded in many cases, as OpenMP overrides this with "-x assembler-with-cpp", however this is only added for non-Windows targets. Thus, after increasing cmake_minimum_required to 3.20, this breaks compiling the GNU assembly for Windows targets; the GNU assembly is used for ARM and AArch64 Windows targets when building with Clang. This patch unbreaks that. D150688 [cmake] Set CMP0091 to fix Windows builds after the cmake_minimum_required bump The build uses other mechanism to select the runtime. Fixes #62719 Reviewed By: #libc, Mordante Differential Revision: https://reviews.llvm.org/D151344	2023-05-27 12:51:21 +02:00
Krasimir Georgiev	07f49bf475	[libc] Adapt includes after 25174976e19b2ef916bb94f4613662646c95cd46	2023-05-26 14:25:50 +00:00
Siva Chandra Reddy	4f1fe19df3	[libc] Make ErrnoSetterMatcher handle logging floating point values. Along the way, couple of additional things have been done: 1. Move `ErrnoSetterMatcher.h` to `test/UnitTest` as all other matchers live there now. 2. `ErrnoSetterMatcher` ignores matching `errno` on GPUs. Reviewed By: jhuber6 Differential Revision: https://reviews.llvm.org/D151129	2023-05-26 06:24:51 +00:00
Joseph Huber	4311246a3a	Revert "[libc] Enable hermetic floating point tests" This passed locally but unfortauntely it seems some tests are not ready to be made hermetic. Revert for now until we can investigate specifically which tests are failing and mark those as `UNIT_TEST_ONLY`. This reverts commit 417ea79e792a87d53f5ac4f5388af4b25aa04d7d.	2023-05-25 19:15:02 -05:00
Joseph Huber	417ea79e79	[libc] Enable hermetic floating point tests This patch enables us to run the floating point tests as hermetic. Importantly we now use the internal versions of the `fesetround` and `fegetround` functions. Reviewed By: michaelrj Differential Revision: https://reviews.llvm.org/D151123	2023-05-25 19:08:44 -05:00
Tue Ly	0aa7ea4e22	[libc][darwin] Add OSUtil for darwin arm64 target so that unit tests can be run. Currently unit tests cannot be run on macOS due to missing OSUtil. Reviewed By: michaelrj Differential Revision: https://reviews.llvm.org/D151377	2023-05-25 19:25:24 -04:00
Tue Ly	0bda541829	[libc][doc] Update math function status page to show more targets. Show availability of math functions on each target. Reviewed By: jeffbailey Differential Revision: https://reviews.llvm.org/D151489	2023-05-25 19:24:33 -04:00
Roland McGrath	65c78933ae	[libc] Support LIBC_COPT_USE_C_ASSERT build flag In this mode, LIBC_ASSERT is just standard C assert. Reviewed By: abrachet Differential Revision: https://reviews.llvm.org/D151498	2023-05-25 14:10:33 -07:00
Roland McGrath	1d4e8f0ea6	[libc] Fix compilation issues in memory_check_utils.h Strict warnings require explicit static_cast to counteract default widening of types narrower than int. Functions in header files should have vague linkage (inline keyword), not internal linkage (static) or external linkage (no inline keyword) even for template functions. Note these don't use the LIBC_INLINE macro since this is only for test code. Reviewed By: abrachet Differential Revision: https://reviews.llvm.org/D151494	2023-05-25 14:09:14 -07:00
Siva Chandra Reddy	daeee56798	[libc] Add macro LIBC_THREAD_LOCAL. It resolves to thread_local on all platform except for the GPUs on which it resolves to nothing. The use of thread_local in the source code has been replaced with the new macro. Reviewed By: jhuber6 Differential Revision: https://reviews.llvm.org/D151486	2023-05-25 19:53:52 +00:00

1 2 3 4 5 ...

1902 Commits