llvm-project

Author	SHA1	Message	Date
Mircea Trofin	ab2e7666c2	[mlgo][inl] Interactive mode: optionally tell the default decision This helps training algorithms that may want to sometimes replicate the default decision. The default decision is presented as an extra feature called `inlining_default`. It's not normally exported to save computation time. This is only available in interactive mode. Differential Revision: https://reviews.llvm.org/D147794	2023-04-10 12:20:09 -07:00
Mircea Trofin	5fd51fcba6	Reland "[mlgo] Hook up the interactive runner to the mlgo-ed passes" This reverts commit a772f0bb920a4957fb94dd8dbe45943809fd0ec3. The main problem was related to how we handled `dbgs()` from the hosted compiler. Using explicit `subprocess.communicate`, and not relying on dbgs() being flushed until the end appears to address the problem. Also some fixes due to some bots running older pythons, so we can't have nice things like `int \| float` and such.	2023-02-03 17:54:42 -08:00
Mircea Trofin	a772f0bb92	Revert "[mlgo] Hook up the interactive runner to the mlgo-ed passes" This reverts commit a7354899d1a235a796b3a2ccb45f6596983c8672. The way stdout/stderr get routed seems to work differently locally and on the bots. Investigating.	2023-02-03 16:34:31 -08:00
Mircea Trofin	a7354899d1	[mlgo] Hook up the interactive runner to the mlgo-ed passes This hooks up the interactive model runner to the passes that support ml-based decisions. Because the interface to this runner is the exact same as the one used during inference, we just reuse the exact same setup we have for "release mode". This makes "release mode" a misnomer - and that's something we needed to resolve sooner or later (e.g. supporting more than one embedded model for the same problem was another reason to drop that nomenclature). That will happen in a subsequent change. To use this evaluator, just enable the pass in (currently) "release" mode, but also pass the base name for the 2 channel files via the pass-specific flag. The 2 files are the responsibilty of the hosting process. The added tests use a minimal, toy such host, illustrating setup and communication. Differential Revision: https://reviews.llvm.org/D143218	2023-02-03 16:22:57 -08:00
Mircea Trofin	6d11baf02b	[mlgo] Stream the training data This leverages the new logging format in that we don't need to buffer the training data, we can just write it out. Differential Revision: https://reviews.llvm.org/D142168	2023-01-20 07:01:08 -08:00
Mircea Trofin	5898be19e6	[mlgo] Remove the protobuf dependency The dependency was due to the log format. This change switches to the previously-introduced (D139370) "dependency-free" logger instead of the protobuf-based one. A subsequent change will clean out the unnecessary abstraction left behind. This change drops the logger unittest, we have sufficient test coverage via lit tests, and a unit test would require adding, unnecesarily, a log reader (the reader is expected to be python, for the ML side, and there is a reader for that under Analysis/models, used for tests). Differential Revision: https://reviews.llvm.org/D141720	2023-01-17 13:12:27 -08:00
Fangrui Song	d4b6fcb32e	[Analysis] llvm::Optional => std::optional	2022-12-14 07:32:24 +00:00
Kazu Hirata	edc83a15b4	[mlgo] Use LLVM_HAVE_TFLITE instead of LLVM_HAVE_TF_API in C++ code (NFC) We use LLVM_HAVE_TFLITE as the key to enable the mlgo work these days, and LLVM_HAVE_TF_API is defined whenever LLVM_HAVE_TF_API is defined. I'm posting this patch because it's purely mechanical. I'll post a follow-up patch to remove LLVM_HAVE_TF_API in non-C++ files, and that will not be as mechanical as this one. Differential Revision: https://reviews.llvm.org/D139863	2022-12-12 11:28:40 -08:00
Kazu Hirata	9c444f7021	[llvm] Use std::nullopt instead of None (NFC) This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-12-09 18:32:32 -08:00
Mircea Trofin	1ee3bb17c3	[mlgo][nfc] Make `LoggedFeatureSpec` an implementation detail It's an artifact very specific to using TFAgents during training, so it belongs with ModelUnderTrainingRunner. Differential Revision: https://reviews.llvm.org/D139031	2022-11-30 15:57:58 -08:00
Mircea Trofin	0cb9746a7d	[nfc][mlgo] Separate logger and training-mode model evaluator This just shuffles implementations and declarations around. Now the logger and the TF C API-based model evaluator are separate. Differential Revision: https://reviews.llvm.org/D131116	2022-08-03 16:20:28 -07:00
Mircea Trofin	c35ad9ee4f	[mlgo] Support exposing more features than those supported by models This allows the compiler to support more features than those supported by a model. The only requirement (development mode only) is that the new features must be appended at the end of the list of features requested from the model. The support is transparent to compiler code: for unsupported features, we provide a valid buffer to copy their values; it's just that this buffer is disconnected from the model, so insofar as the model is concerned (AOT or development mode), these features don't exist. The buffers are allocated at setup - meaning, at steady state, there is no extra allocation (maintaining the current invariant). These buffers has 2 roles: one, keep the compiler code simple. Second, allow logging their values in development mode. The latter allows retraining a model supporting the larger feature set starting from traces produced with the old model. For release mode (AOT-ed models), this decouples compiler evolution from model evolution, which we want in scenarios where the toolchain is frequently rebuilt and redeployed: we can first deploy the new features, and continue working with the older model, until a new model is made available, which can then be picked up the next time the compiler is built. Differential Revision: https://reviews.llvm.org/D124565	2022-05-09 18:01:21 -07:00
serge-sans-paille	ed98c1b376	Cleanup includes: DebugInfo & CodeGen Discourse thread: https://discourse.llvm.org/t/include-what-you-use-include-cleanup Differential Revision: https://reviews.llvm.org/D121332	2022-03-12 17:26:40 +01:00
Jan Svoboda	5f4ae56457	[llvm] Remove uses of `std::vector<bool>` LLVM Programmer’s Manual strongly discourages the use of `std::vector<bool>` and suggests `llvm::BitVector` as a possible replacement. This patch does just that for llvm. Reviewed By: dexonsmith Differential Revision: https://reviews.llvm.org/D117121	2022-01-18 18:20:45 +01:00
Mircea Trofin	248d55af3e	[NFC][MLGO] Use LazyCallGraph::Node to track functions. This avoids the InlineAdvisor carrying the responsibility of deleting Function objects. We use LazyCallGraph::Node objects instead, which are stable in memory for the duration of the Module-wide performance of CGSCC passes started under the same ModuleToPostOrderCGSCCPassAdaptor (which is the case here) Differential Revision: https://reviews.llvm.org/D116964	2022-01-11 19:23:47 -08:00
Mircea Trofin	a120fdd337	[NFC][MLGO]Add RTTI support for MLModelRunner and simplify runner setup	2022-01-04 19:46:14 -08:00
Mircea Trofin	04f2712ef4	[NFC][MLGO] Factor ModelUnderTrainingRunner for reuse This is so we may reuse it. It was very non-inliner specific already. Differential Revision: https://reviews.llvm.org/D115465	2021-12-10 11:24:15 -08:00
Mircea Trofin	059e03476c	[NFC][mlgo] Generalize model runner interface This prepares it for the regalloc work. Part of it is making model evaluation accross 'development' and 'release' scenarios more reusable. This patch: - extends support to tensors of any shape (not just scalars, like we had in the inliner -Oz case). While the tensor shape can be anything, we assume row-major layout and expose the tensor as a buffer. - exposes the NoInferenceModelRunner, which we use in the 'development' mode to keep the evaluation code path consistent and simplify logging, as we'll want to reuse it in the regalloc case. Differential Revision: https://reviews.llvm.org/D115306	2021-12-08 20:10:58 -08:00
Mircea Trofin	1055c5e1d3	[MLGO] Make sure inliner logs when deleting callees When using final reward (which is now the default), we were skipping logging decisions that were leading to callee deletion. This fixes that. Differential Revision: https://reviews.llvm.org/D108587	2021-08-23 14:54:46 -07:00
Christopher Di Bella	c874dd5362	[llvm][clang][NFC] updates inline licence info Some files still contained the old University of Illinois Open Source Licence header. This patch replaces that with the Apache 2 with LLVM Exception licence. Differential Revision: https://reviews.llvm.org/D107528	2021-08-11 02:48:53 +00:00
Mircea Trofin	ae1a2a09e4	[NFC][MLGO] Make logging more robust 1) add some self-diagnosis (when asserts are enabled) to check that all features have the same nr of entries 2) avoid storing pointers to mutable fields because the proto API contract doesn't actually guarantee those stay fixed even if no further mutation of the object occurs. Differential Revision: https://reviews.llvm.org/D107594	2021-08-06 04:44:52 -07:00
Mircea Trofin	55e12f7080	[NFC][MLGO] Just use the underlying protobuf object for logging Avoid buffering just to copy the buffered data, in 'development mode', when logging. Instead, just populate the underlying protobuf. Differential Revision: https://reviews.llvm.org/D106592	2021-07-23 10:56:48 -07:00
Mircea Trofin	0d06b14f59	[MLGO] Fix use of AM.invalidate post D100519 The ML inline advisors more aggressively invalidate certain analyses after each call site inlining, to more accurately capture the problem state.	2021-04-15 18:45:39 -07:00
Kazu Hirata	a3254904b2	[Analysis] Use llvm::append_range (NFC)	2021-01-22 23:25:01 -08:00
Mircea Trofin	e8049dc3c8	[NewPM][Inliner] Move the 'always inliner' case in the same CGSCC pass as 'regular' inliner Expanding from D94808 - we ensure the same InlineAdvisor is used by both InlinerPass instances. The notion of mandatory inlining is moved into the core InlineAdvisor: advisors anyway have to handle that case, so this change also factors out that a bit better. Differential Revision: https://reviews.llvm.org/D94825	2021-01-15 17:59:38 -08:00
Mircea Trofin	8ab2353a4c	[NFC][TFUtils] also include output specs lookup logic in loadOutputSpecs The lookup logic is also reusable. Also refactored the API to return the loaded vector - this makes it more clear what state it is in in the case of error (as it won't be returned). Differential Revision: https://reviews.llvm.org/D91759	2020-11-18 21:20:21 -08:00
Mircea Trofin	b51e844f7a	[NFC][TFUtils] Extract out the output spec loader It's generic for the 'development mode', not specific to the inliner case. Differential Revision: https://reviews.llvm.org/D91751	2020-11-18 20:03:20 -08:00
Mircea Trofin	ac2018da61	[NFC][MLInliner] Getters should return by reference	2020-10-07 13:55:38 -07:00
Mircea Trofin	36bb1fb1fe	[MLInliner] Factor out logging Factored out the logging facility, to allow its reuse outside the inliner. Differential Revision: https://reviews.llvm.org/D88770	2020-10-05 18:09:17 -07:00
Mircea Trofin	8c63df2416	[MLInliner] Support training that doesn't require partial rewards If we use training algorithms that don't need partial rewards, we don't need to worry about an ir2native model. In that case, training logs won't contain a 'delta_size' feature either (since that's the partial reward). Differential Revision: https://reviews.llvm.org/D86481	2020-08-24 17:36:29 -07:00
Mircea Trofin	62fc44ca3c	[MLInliner] In development mode, obtain the output specs from a file Different training algorithms may produce models that, besides the main policy output (i.e. inline/don't inline), produce additional outputs that are necessary for the next training stage. To facilitate this, in development mode, we require the training policy infrastructure produce a description of the outputs that are interesting to it, in the form of a JSON file. We special-case the first entry in the JSON file as the inlining decision - we care about its value, so we can guide inlining during training - but treat the rest as opaque data that we just copy over to the training log. Differential Revision: https://reviews.llvm.org/D85674	2020-08-17 16:56:47 -07:00
Mircea Trofin	211117b660	[NFC][MLInliner] remove curly braces for a few sinle-line loops	2020-08-10 09:32:21 -07:00
Mircea Trofin	d5c81be3ca	[NFC][MLInliner] Set up the logger outside the development mode advisor This allows us to subsequently configure the logger for the case when we use a model evaluator and want to log additional outputs. Differential Revision: https://reviews.llvm.org/D85577	2020-08-10 09:22:17 -07:00
Mircea Trofin	64372d93bc	[NFC][MLInliner] Refactor logging implementation This prepares it for logging externally-specified outputs. Differential Revision: https://reviews.llvm.org/D85451	2020-08-07 14:56:56 -07:00
Mircea Trofin	87fb7aa137	[llvm][MLInliner] Don't log 'mandatory' events We don't want mandatory events in the training log. We do want to handle them, to keep the native size accounting accurate, but that's all. Fixed the code, also expanded the test to capture this. Differential Revision: https://reviews.llvm.org/D85373	2020-08-06 09:04:15 -07:00
Mircea Trofin	65b6dbf939	[llvm][NFC] Moved implementation of TrainingLogger outside of its decl Also renamed a method - printTensor - to print; and added comments.	2020-08-04 14:35:35 -07:00
Mircea Trofin	71059257bd	[llvm][NFC] TensorSpec abstraction for ML evaluator Further abstracting the specification of a tensor, to more easily support different types and shapes of tensor, and also to perform initialization up-front, at TFModelEvaluator construction time. Differential Revision: https://reviews.llvm.org/D84685	2020-07-29 16:29:21 -07:00
Nico Weber	4fe912f186	Build: Move TF source file inclusion from build system to source files Outside of compiler-rt (where it's arguably an anti-pattern too), LLVM tries to keep its build files as simple as possible. See e.g. llvm/docs/SupportLibrary.rst, "Code Organization". Differential Revision: https://reviews.llvm.org/D84243	2020-07-21 13:02:34 -04:00
Mircea Trofin	70f8d0ac8a	[llvm] Development-mode InlineAdvisor Summary: This is the InlineAdvisor used in 'development' mode. It enables two scenarios: - loading models via a command-line parameter, thus allowing for rapid training iteration, where models can be used for the next exploration phase without requiring recompiling the compiler. This trades off some compilation speed for the added flexibility. - collecting training logs, in the form of tensorflow.SequenceExample protobufs. We generate these as textual protobufs, which simplifies generation and testing. The protobufs may then be readily consumed by a tensorflow-based training algorithm. To speed up training, training logs may also be collected from the 'default' training policy. In that case, this InlineAdvisor does not use a model. RFC: http://lists.llvm.org/pipermail/llvm-dev/2020-April/140763.html Reviewers: jdoerfert, davidxl Subscribers: mgorny, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D83733	2020-07-20 11:01:56 -07:00

39 Commits