llvm-project

Author	SHA1	Message	Date
lntue	da28593d71	[libc][math] Implement double precision expm1 function correctly rounded for all rounding modes. (#67048 ) Implementing expm1 function for double precision based on exp function algorithm: - Reduced x = log2(e) * (hi + mid1 + mid2) + lo, where: * hi is an integer * mid1 * 2^-6 is an integer * mid2 * 2^-12 is an integer * \|lo\| < 2^-13 + 2^-30 - Then exp(x) - 1 = 2^hi * 2^mid1 * 2^mid2 * exp(lo) - 1 ~ 2^hi * (2^mid1 * 2^mid2 * (1 + lo * P(lo)) - 2^(-hi) ) - We evaluate fast pass with P(lo) is a degree-3 Taylor polynomial of (e^lo - 1) / lo in double precision - If the Ziv accuracy test fails, we use degree-6 Taylor polynomial of (e^lo - 1) / lo in double double precision - If the Ziv accuracy test still fails, we re-evaluate everything in 128-bit precision.	2023-09-28 16:43:15 -04:00
Joseph Huber	1a5d3b6cda	[libc] Scan the ports more fairly in the RPC server (#66680 ) Summary: Currently, we use the RPC server to respond to different ports which each contain a request from some client thread wishing to do work on the server. This scan starts at zero and continues until its checked all ports at which point it resets. If we find an active port, we service it and then restart the search. This is bad for two reasons. First, it means that we will always bias the lower ports. If a thread grabs a high port it will be stuck for a very long time until all the other work is done. Second, it means that the `handle_server` function can technically run indefinitely as long as the client is always pushing new work. Because the OpenMP implementation uses the user thread to service the kernel, this means that it could be stalled with another asyncrhonous device's kernels. This patch addresses this by making the server restart at the next port over. This means we will always do a full scan of the ports before quitting.	2023-09-26 16:09:48 -05:00
Joseph Huber	2b7227db1e	[libc] Fix RPC server global after mass replace of __llvm_libc Summary: This variable needs a reserved name starting with `__`. It was mistakenly changed with a mass replace. It happened to work because the tests still picked up the associated symbol, but it just became a bad name because it's not reserved anymore.	2023-09-26 14:28:48 -05:00
Siva Chandra	f2c9fe452f	[libc][NFC] Fix delete operator linkage names after switch to LIBC_NAMESPACE. (#67475 ) The name __llvm_libc was mass-replaced with LIBC_NAMESPACE which ended up changing the "__llvm_libc" prefix of the delete operator linkage names to "LIBC_NAMESPACE". This change corrects it by changing the namespace prefix to "__llvm_libc_<version info>".	2023-09-26 11:53:14 -07:00
Siva Chandra	3bfd6a7521	[libc][NFC] Add compile options only to the header libraries which use them. (#67447 ) Other libraries dependent on these libraries will automatically inherit those compile options. This change in particular affects the compile option "-DLIBC_COPT_STDIO_USE_SYSTEM_FILE".	2023-09-26 09:20:00 -07:00
Mikhail R. Gadelha	e3087c4b8c	[libc] Start to refactor riscv platform abstraction to support both 32 and 64 bits versions This patch enables the compilation of libc for rv32 by unifying the current rv64 and rv32 implementation into a single rv implementation. We updated the cmake file to match the new riscv32 arch and force LIBC_TARGET_ARCHITECTURE to be "riscv" whenever we find "riscv32" or "riscv64". This is required as LIBC_TARGET_ARCHITECTURE is used in the path for several platform specific implementations. Reviewed By: michaelrj Differential Revision: https://reviews.llvm.org/D148797	2023-09-26 12:32:25 -03:00
Siva Chandra	599eadec28	[libc] Propagate printf config options from a single config header library. (#66979 ) printf_core.parser is not yet updated to use the printf config options. It does not use them currently anyway and the corresponding parser_test should be updated to respect the config options.	2023-09-26 08:16:31 -07:00
Joseph Huber	1b8c8155cc	[libc][Obvious] Fix incorrect filepath for ftell.h header Summary: The previous patch moved the location of this CMake line but didn't update the header. Fix it.	2023-09-26 10:02:20 -05:00
Joseph Huber	7ac8e26fc7	[libc] Implement `fseek`, `fflush`, and `ftell` on the GPU (#67160 ) Summary: This patch adds the necessary entrypoints to handle the `fseek`, `fflush`, and `ftell` functions. These are all very straightfoward, we simply make RPC calls to the associated function on the other end. Implementing it this way allows us to more or less borrow the state of the stream from the server as we intentionally maintain no internal state on the GPU device. However, this does not implement the `errno` functinality so that must be ignored.	2023-09-26 09:46:46 -05:00
Guillaume Chatelet	b6bc9d72f6	[libc] Mass replace enclosing namespace (#67032 ) This is step 4 of https://discourse.llvm.org/t/rfc-customizable-namespace-to-allow-testing-the-libc-when-the-system-libc-is-also-llvms-libc/73079	2023-09-26 11:45:04 +02:00
michaelrj-google	23552fe220	[libc] Acquire the lock for scanf files (#67357 ) When creating the new scanf reader design, I forgot to add back the calls to flockfile and funlockfile in vfscanf_internal. This patch fixes that, and also changes the system file version to use the normal variants since ungetc_unlocked isn't always available.	2023-09-25 15:00:03 -07:00
Joseph Huber	791b279924	[libc] Change the `puts` implementation on the GPU (#67189 ) Summary: Normally, the implementation of `puts` simply writes a second newline charcter after printing the first string. However, because the GPU does everything in batches of the SIMT group size, this will end up with very poor output where you get the strings printed and then 1-64 newline characters all in a row. Optimizations like to turn `printf` calls into `puts` so it's a good idea to make this produce the expected output. The least invasive way I could do this was to add a new opcode. It's a little bloated, but it avoids an unneccessary and slow send operation to configure this.	2023-09-25 11:17:22 -05:00
michaelrj-google	a5a008ff4f	[libc] Refactor scanf reader to match printf (#66023 ) In a previous patch, the printf writer was rewritten to use a single writer class with a buffer and a callback hook. This patch refactors scanf's reader to match conceptually.	2023-09-22 12:50:02 -07:00
Joseph Huber	e0be78be42	[libc] Template the printf / scanf parser class (#66277 ) Summary: The parser class for stdio currently accepts different argument providers. In-tree this is only used for a fuzzer test, however, the proposed implementation of the GPU handling of printf / scanf will require custom argument handlers. This makes the current approach of using a preprocessor macro messier. This path proposed folding this logic into a template instantiation. The downside to this is that because the implementation of the parser class is placed into an implementation file we need to manually instantiate the needed templates which will slightly bloat binary size. Alternatively we could remove the implementation file, or key off of the `libc` external packaging macro so it is not present in the installed version.	2023-09-21 17:02:26 -05:00
Joseph Huber	f548d19fc8	[libc] Fix and simplify the implementation of 'fread' on the GPU (#66948 ) Summary: Previously, the `fread` operation was wrong in cases when we read less data than was requested. That is, if we tried to read N bytes while the file was in EOF, it would still copy N bytes of garbage. This is fixed by only copying over the sizes we got from locally opening it rather than just using the provided size. Additionally, this patch simplifies the interface. The output functions have special variants for writing to stdout / stderr. This is primarily an optimization for these common cases so we can avoid sending the stream as an argument which has a high delay. Because for input, we already need to start with a `send` to tell the server how much data to read, it costs us nothing to send the file along with it so this is redundant. Re-use the file encoding scheme from the other implementations, the one that stores the stream type in the LSBs of the FILE pointer.	2023-09-21 14:28:06 -05:00
michaelrj-google	5bd34e0a55	[libc] Fix Off By One Errors In Printf Long Double (#66957 ) Two major off-by-one errors are fixed in this patch. The first is in float_to_string.h with length_for_num, which wasn't accounting for the implicit leading bit when calculating the length of a number, causing a missing digit on 80 bit float max. The other off-by-one is the ryu_long_double_constants.h (a.k.a the Mega Table) not having any entries for the last POW10_OFFSET in POW10_SPLIT. This was also found on 80 bit float max. Finally, the integer calculation mode was using a slightly too short integer, again on 80 bit float max, not accounting for the mantissa width. All of these are fixed in this patch.	2023-09-21 11:43:29 -07:00
Joseph Huber	59896c168a	[libc] Remove the 'rpc_reset' routine from the RPC implementation (#66700 ) Summary: This patch removes the `rpc_reset` function. This was previously used to initialize the RPC client on the device by setting up the pointers to communicate with the server. The purpose of this was to make it easier to initialize the device for testing. However, this prevented us from enforcing an invariant that the buffers are all read-only from the client side. The expected way to initialize the server is now to copy it from the host runtime. This will allow us to maintain that the RPC client is in the constant address space on the GPU, potentially through inference, and improving caching behaviour.	2023-09-21 11:07:09 -05:00
Guillaume Chatelet	270547f3bf	[libc][clang-tidy] Add llvm-header-guard to get consistant naming and prevent file copy/paste issues. (#66477 )	2023-09-21 11:14:47 +02:00
Joseph Huber	3641d18557	[libc][Obvious] Fix incorrect RPC opcode for `clearerr` Summary: This was mistakenly using the opcode for `ferror` which wasn't noticed because tests using this weren't yet activated. This patch fixes this mistake.	2023-09-20 11:54:35 -05:00
michaelrj-google	d37496e75a	[libc] Fix printf config not working (#66834 ) The list of printf copts available in config.json wasn't working because the printf_core subdirectory was included before the printf_copts variable was defined, making it effectively nothing for the printf internals. Additionally, the tests weren't respecting the flags so they would cause the tests to fail. This patch reorders the cmake in src and adds flag handling in test.	2023-09-19 15:36:14 -07:00
Tue Ly	84c899b235	[libc][math] Extract non-MPFR math tests into libc-math-smoke-tests. Extract non-MPFR math tests into libc-math-smoke-tests. Reviewed By: sivachandra, jhuber6 Differential Revision: https://reviews.llvm.org/D159477	2023-09-19 12:10:21 -04:00
Guillaume Chatelet	2dbdc9fc85	[libc] Add invoke / invoke_result type traits (#65750 )	2023-09-15 11:15:41 +02:00
Joseph Huber	bbe7eb92b4	[libc][Obvious] Fix missing entrypoints after moving to generic Summary: The previous patch moved the implementations of these to generic/ and accidentally did not add the unlocked variants. This patch fixes that	2023-09-14 15:59:08 -05:00
Joseph Huber	a1be5d69df	[libc] Implement more input functions on the GPU (#66288 ) Summary: This patch implements the `fgets`, `getc`, `fgetc`, and `getchar` functions on the GPU. Their implementations are straightforward enough. One thing worth noting is that the implementation of `fgets` will be extremely slow due to the high latency to read a single char. A faster solution would be to make a new RPC call to call `fgets` (due to the special rule that newline or null breaks the stream). But this is left out because performance isn't the primary concern here.	2023-09-14 15:39:29 -05:00
Alex Brachet	2ad7a06cb1	[libc] Fix some warnings (#66366 ) Some compilers will warn about dangling else and missleading lack of parentheses.	2023-09-14 08:47:21 -04:00
Guillaume Chatelet	aee8f8784a	[libc][utils] cpp::always_false to enable static_assert(false) (#66209 )	2023-09-14 10:28:43 +02:00
Siva Chandra	17114f8b19	[libc] Remove common_libc_tuners.cmake and move options into config.json. (#66226 ) The name has been changed to adhere to the config option naming format. The necessary build changes to use the new option have also been made.	2023-09-13 22:17:00 -07:00
Michael Jones	3fb63c2921	[libc] simplify printf float writing The two decimal float printing styles are similar, but different in how they end. For simplicity of writing I initially gave them different "write_last_block" functions. This patch unifies them into one function. Reviewed By: sivachandra Differential Revision: https://reviews.llvm.org/D158036	2023-09-13 13:53:29 -07:00
michaelrj-google	380eb46b13	[libc] Move long double table option to new config (#66151 ) This patch adds the long double table option for printf into the new configuration scheme. This allows it to be set for most targets but unset for baremetal.	2023-09-13 10:43:05 -07:00
Mikhail R. Gadelha	75398f28eb	[libc] Make time_t 64 bits long on all platforms but arm32 This patch changes the size of time_t to be an int64_t. This still follows the POSIX standard which only requires time_t to be an integer. Making time_t a 64-bit integer also fixes two cases in 32 bits platforms that use SYS_clock_nanosleep_time64 and SYS_clock_gettime64, as the name of these calls implies, they require a 64-bit time_t. For instance, in rv32, the 32-bit version of these syscalls is not available. We also follow glibc here, where time_t is still a 32-bit integer in arm32. Reviewed By: sivachandra Differential Revision: https://reviews.llvm.org/D159125	2023-09-13 10:49:39 -03:00
Joseph Huber	ef169f5707	[libc] Improve the implementation of the rand() function (#66131 ) Summary: This patch improves the implementation of the standard `rand()` function by implementing it in terms of the xorshift64star pRNG as described in https://en.wikipedia.org/wiki/Xorshift#xorshift*. This is a good, general purpose random number generator that is sufficient for most applications that do not require an extremely long period. This patch also correctly initializes the seed to be `1` as described by the standard. We also increase the `RAND_MAX` value to be `INT_MAX` as the standard only specifies that it can be larger than 32768.	2023-09-12 16:52:20 -05:00
Joseph Huber	688019851e	[libc][NFC] Factor GPU exiting into a common function (#66093 ) Summary: We currently call the GPU routine to terminate the current thread in three separate locations .This should be wrapped into a helper function to simplify the implementation.	2023-09-12 14:59:02 -05:00
Siva Chandra	c5ad6c7781	[libc] Fix a typo in a CMakeLists.txt - replace DEPS with DEPENDS. (#66130 )	2023-09-12 12:24:27 -07:00
Guillaume Chatelet	7329816285	[libc] Add is_object (#65749 ) Add the is_object type traits. Implementation comes from https://en.cppreference.com/w/cpp/types/is_object	2023-09-12 10:35:22 +02:00
Guillaume Chatelet	d557e2b076	[libc][NFC] Fix missing header in CMakelists.txt (#65960 )	2023-09-11 14:12:58 +00:00
Guillaume Chatelet	88348252a6	[libc] Add missing add_lvalue_reference_t (#65940 )	2023-09-11 11:31:37 +02:00
Joseph Huber	60c0d303d6	[libc] Implement stdio writing functions for the GPU port (#65809 ) Summary: This patch implements fwrite, putc, putchar, and fputc on the GPU. These are very straightforward, the main difference for the GPU implementation is that we are currently ignoring `errno`. This patch also introduces a minimal smoke test for `putc` that is an exact copy of the `puts` test except we print the string char by char. This also modifies the `fopen` test to use `fwrite` to mirror its use of `fread` so that it is tested as well.	2023-09-09 13:27:07 -05:00
Joseph Huber	31d4f0692f	[libc][NFC] Cleanup the GPU file I/O utility header (#65680 ) Summary: The GPU uses separate implementations to perform file IO. This is all done through the RPC interface and we kept it minimal such that we could treat a `stdin`, `stdout`, or `stderr` handle from the CPU correctly on the GPU. The RPC implementation uses different opcodes for whether or not we are using one of the standard streams. This is so we do not need to initialize anything to access the CPU's standard stream, because the server knows that it should print to `stdout` if it gets the `STDOUT` variant of the opcode. It also saves us an RPC call, which are expensive relatively speaking. This patch simply cleans up this interface to make them all use a common function. This is done in preparation to implement some more file IO functions like getc or putc.	2023-09-08 14:15:53 -05:00
Mikhail R. Gadelha	123bf08402	[libc] Unify gettime implementations (#65383 ) Similar to D159208, this patch unifies the calls to a syscall, in this patch it is the syscall SYS_clock_gettime/SYS_clock_gettime64. This patch also fixes calls to SYS_clock_gettime64 by creating a timespec64 object, passing it to the syscall and rewriting the timespec given by the caller with timespec64 object's contents. This fixes cases where timespec has a 4 bytes long time_t member, but SYS_clock_gettime is not available (e.g., rv32).	2023-09-08 12:41:29 -04:00
Guillaume Chatelet	74971db140	[libc] Add is_scalar (#65740 ) Adds the is_scalar traits based on implementation in https://en.cppreference.com/w/cpp/types/is_scalar	2023-09-08 12:45:17 +00:00
Guillaume Chatelet	eebf8faf3e	[libc] Add is_member_pointer_v (#65631 ) Implementation from https://en.cppreference.com/w/cpp/types/is_member_pointer	2023-09-08 11:36:19 +02:00
Michael Jones	dd51ae81d8	[libc] Fix printf %p format The %p format wasn't correctly passing along flags and modifiers to the integer conversion behind the scenes. This patch fixes that behavior, as well as changing the nullptr behavior to be a string conversion behind the scenes. Reviewed By: lntue, jhuber6 Differential Revision: https://reviews.llvm.org/D159458	2023-09-07 14:13:35 -07:00
Joseph Huber	d6cc3410ab	[libc] Fix missing GPU math implementations (#65616 ) These functions were implemented by simply calling their `__builtin_*` equivalents. The builtins were resolving to the libc functions back again. This patch adds explicit vendor versions for these functions to avoid the recursion.	2023-09-07 11:48:44 -05:00
Guillaume Chatelet	260036ab1e	[libc] move in_place_t in utility (#65623 ) This is needed because `cpp::in_place_t` is also used by `cpp::expected` https://en.cppreference.com/w/cpp/utility/in_place	2023-09-07 18:12:50 +02:00
Guillaume Chatelet	a279bf0d78	[libc] Add is_null_pointer_v (#65627 )	2023-09-07 18:07:24 +02:00
Guillaume Chatelet	f72d41b5b1	[libc] Add missing include in type_traits/remove_all_extents.h (#65626 )	2023-09-07 17:57:47 +02:00
Guillaume Chatelet	26ddf2c935	[libc] fix missing default template parameter value in enable_if (#65622 )	2023-09-07 17:42:00 +02:00
Tue Ly	f0d05bb699	[libc][math] Fix signed zeros for acosf, acoshf, and atanf in FE_DOWNWARD mode. Fix signed zeros for acosf, acoshf, and atanf in FE_DOWNWARD mode. Reviewed By: sivachandra Differential Revision: https://reviews.llvm.org/D159476	2023-09-07 15:21:33 +00:00
Guillaume Chatelet	9c4e005678	[libc] customizable namespace 2/4 (#65471 ) This implements the second step of https://discourse.llvm.org/t/rfc-customizable-namespace-to-allow-testing-the-libc-when-the-system-libc-is-also-llvms-libc/73079 Namely "Add a guard in `src/__support/common.h`"	2023-09-07 17:19:47 +02:00
Mikhail R. Gadelha	80225af4c1	[libc] Fix overflow check for 32 bit long time_t (#65394 ) This patch fixes the overflow check in update_from_seconds, used by gmtime, gmtime_r and mktime. In update_from_seconds, total_seconds is a int64_t and the previous overflow check for when sizeof(time_t) == 4 would check if it was < 0x80000000 and > 0x7FFFFFFF, however, this check would cause the following issues: 1. Valid negative numbers would be discarded, e.g., -1 is 0xffffffffffffffff as a int64_t, outside the range of the overflow check. 2. Some valid positive numbers would be discarded because the hex constants were being implicitly converted to int64_t, e.g., 0x80000000 would be implicitly converted to 2147483648, instead of -2147483648. The fix for both cases was to static_cast total_seconds and the constants to time_t if sizeof(time_t) == 4. The behaviour is not changed in systems with sizeof(time_t) == 8. --------- Signed-off-by: Mikhail R. Gadelha <mikhail@igalia.com>	2023-09-07 09:18:23 -04:00

1 2 3 4 5 ...

1256 Commits