llvm-project

Author	SHA1	Message	Date
Mircea Trofin	795910c2d9	Fix windows bot breakages due to D143110	2023-02-01 16:27:44 -08:00
Mircea Trofin	83051c5a5f	[mlgo] Make InteractiveModelRunner actually work with named pipes Turns out raw_fd_stream doesn't work with named pipes, so we just need to lower the abstraction. Updated the unittest accordingly. Because mkfifo's path argument requires a certain naming pattern on Windows (IIUC), restricted the test to Linux only. Differential Revision: https://reviews.llvm.org/D143110	2023-02-01 15:24:44 -08:00
Mircea Trofin	35aa73746c	[mlgo] Allow logging the spec for the "advice", if needed This is for the interactive model runner, so it can confirm the tensor spec of the advice with its host.	2023-02-01 10:24:38 -08:00
Mircea Trofin	5b8dc7c8a5	[mlgo] Introduce an "InteractiveModelRunner" This is a model runner for ML researchers using environments like CompilerGym. In such environments, researchers host the compiler and want to be able to observe the problem space (features) at each decision step of some optimization pass, at which point the compiler is stopped, waiting for the host makes a decision and provide an advice back to the compiler, which then continues its normal operation, and so on. The InteractiveModelRunner supports this scenario for the feature set exposed by the compiler at a given time. It uses 2 files - ideally FIFO pipes - one to pass data to the host, the other to get advices back from the host. This means this scenario is supported with no special dependencies. The file creation and deletion is the responsibility of the host. Hooking up this model evaluator to a MLGO-ed pass is the responsibilty of the pass author, and subsequent patches will do so for the current set of mlgo passes, and offer an API to easily "just opt in" by default when mlgo-ing a new pass. The data protocol is that of the training logger: the host sees a training log doled out observation by observation by reading from one of the files, and passes back its advice as a serialized tensor (i.e. tensor value memory dump) via the other file. There are some differences wrt the log seen during training: the interactive model doesn't currently include the outcome (because it should be identical to the decision, and it's also not present in the "release" mode); and partial rewards aren't currently communicated back. The assumption - just like with the training logger - is that the host is co-located, thus avoiding any endianness concerns. In a distributed environment, it is up to the hosting infrastructure to intermediate that. Differential Revision: https://reviews.llvm.org/D142642	2023-01-27 17:03:28 -08:00
Simon Pilgrim	345ed58ed5	Fix implicit double -> float truncation warnings. NFCI.	2022-05-13 19:07:00 +01:00
Mircea Trofin	c35ad9ee4f	[mlgo] Support exposing more features than those supported by models This allows the compiler to support more features than those supported by a model. The only requirement (development mode only) is that the new features must be appended at the end of the list of features requested from the model. The support is transparent to compiler code: for unsupported features, we provide a valid buffer to copy their values; it's just that this buffer is disconnected from the model, so insofar as the model is concerned (AOT or development mode), these features don't exist. The buffers are allocated at setup - meaning, at steady state, there is no extra allocation (maintaining the current invariant). These buffers has 2 roles: one, keep the compiler code simple. Second, allow logging their values in development mode. The latter allows retraining a model supporting the larger feature set starting from traces produced with the old model. For release mode (AOT-ed models), this decouples compiler evolution from model evolution, which we want in scenarios where the toolchain is frequently rebuilt and redeployed: we can first deploy the new features, and continue working with the older model, until a new model is made available, which can then be picked up the next time the compiler is built. Differential Revision: https://reviews.llvm.org/D124565	2022-05-09 18:01:21 -07:00
Mircea Trofin	059e03476c	[NFC][mlgo] Generalize model runner interface This prepares it for the regalloc work. Part of it is making model evaluation accross 'development' and 'release' scenarios more reusable. This patch: - extends support to tensors of any shape (not just scalars, like we had in the inliner -Oz case). While the tensor shape can be anything, we assume row-major layout and expose the tensor as a buffer. - exposes the NoInferenceModelRunner, which we use in the 'development' mode to keep the evaluation code path consistent and simplify logging, as we'll want to reuse it in the regalloc case. Differential Revision: https://reviews.llvm.org/D115306	2021-12-08 20:10:58 -08:00

7 Commits