llvm-project

Author	SHA1	Message	Date
martinboehme	526c9b7e37	[clang][nullability] Use `proves()` and `assume()` instead of deprecated synonyms. (#70297 )	2023-10-30 13:18:57 +01:00
martinboehme	14b039c1dd	[clang][dataflow] Remove `declToLocConsistent()` assertion. (#69819 ) As described [here](https://discourse.llvm.org/t/70086/6), there are legitimate non-bug scenarios where two `DeclToLoc` maps to be joined contain different storage locations for the same declaration. This patch also adds a test containing an example of such a situation. (The test fails without the other changes in this patch.) With the assertion removed, the existing logic in `intersectDenseMaps()` will remove the corresponding declaration from the joined DeclToLoc map. We also remove `removeDecl()`'s precondition (that the declaration must be associated with a storage location) because this may no longer hold if the declaration was previously removed during a join, as described above.	2023-10-24 08:42:30 +02:00
Sam McCall	7338eb561c	Reapply "[dataflow] use true/false literals in formulas, rather than variables" This reverts commit 3353f7dd3d91c9b2b6a15ba9229bee53e0cb8196. Fixed test bug (unspecified order of arg evaluation)	2023-10-19 11:34:08 +02:00
Yitzhak Mandelbaum	342dca7528	[clang][dataflow] Check for backedges directly (instead of loop statements). (#68923 ) Widen on backedge nodes, instead of nodes with a loop statement as terminator. This fixes #67834 and a precision loss from assignment in a loop condition. The commit contains tests for both of these issues.	2023-10-16 14:07:16 -04:00
Stanislav Gatev	52d0696355	[clang][dataflow] Add support for lambda captures (#68558 ) This adds support for copy, ref, and this lambda captures to the core framework and also adds relevant tests in UncheckedOptionalAccessTest.	2023-10-11 22:18:46 +02:00
martinboehme	834cb919b3	[clang][dataflow] Remove declarations from `DeclToLoc` when their lifetime ends. (#67300 ) After https://reviews.llvm.org/D153273, we're now able to use `CFGLifetimeEnds` together with the other CFG options we use.	2023-09-26 08:41:09 +02:00
Douglas Yung	3353f7dd3d	Revert "[dataflow] use true/false literals in formulas, rather than variables" This reverts commit 36bd5bd888f193b70abf43a09bb4fc04cd2a2ff1. This change is causing a test failure on several build bots: - https://lab.llvm.org/buildbot/#/builders/139/builds/50255 - https://lab.llvm.org/buildbot/#/builders/216/builds/27735 - https://lab.llvm.org/buildbot/#/builders/247/builds/9334	2023-09-22 11:43:27 -07:00
Sam McCall	36bd5bd888	[dataflow] use true/false literals in formulas, rather than variables And simplify formulas containing true/false It's unclear to me how useful this is, it does make formulas more conveniently self-contained now (we can usefully print them without carrying around the "true/false" labels) (while here, simplify !!X to X, too) Differential Revision: https://reviews.llvm.org/D153485	2023-09-22 17:12:20 +02:00
martinboehme	1d7b59ca8d	[clang][dataflow] Fix two null pointer dereferences in `getMemberForAccessor()`. (#66742 ) The additions to the test trigger crashes without the fixes.	2023-09-19 09:03:20 +02:00
Kinuko Yasuda	03be486ecc	[clang][dataflow] Model the fields that are accessed via inline accessors (#66368 ) So that the values that are accessed via such accessors can be analyzed as a limited version of context-sensitive analysis. We can potentially do this only when some option is set, but doing additional modeling like this won't be expensive and intrusive, so we do it by default for now.	2023-09-18 10:46:36 +02:00
martinboehme	0069004856	[clang][dataflow] Add a test for context-sensitive analysis on a self-referential class. (#66359 ) The test demonstrates that the `this` pointer seen in the constructor has the same value as the address of the variable the object is constructed into.	2023-09-15 14:31:10 +02:00
Kinuko Yasuda	0612c9b09a	[clang][dataflow] Ignore assignment where base class's operator is used (#66364 ) In C++ it seems it is legit to use base class's operator (e.g. `using Base::operator=`) to perform copy if the base class is the common ancestor of the source and destination object. In such a case we shouldn't try to access fields beyond that of the base class, however such a case seems to be very rare (typical code would implement a copy constructor instead), and could add complexities, so in this patch we simply bail if the method operator's parent class is different from the type of the destination object that this framework recognizes.	2023-09-14 20:45:56 +02:00
martinboehme	e65e94fddc	[clang][dataflow] Rename test target function to `target()`. (#66195 ) Otherwise, the test doesn't actually do anything.	2023-09-13 15:07:44 +02:00
martinboehme	7cf20f156f	[clang][dataflow] Eliminate `RecordValue::getChild()`. (#65586 ) We want to eliminate the `RecordStorageLocation` from `RecordValue` and, ultimately, eliminate `RecordValue` entirely (see the discussion linked in the `RecordValue` class comment). This is one step in that direction. To eliminate `RecordValue::getChild()`, we also eliminate the last remaining caller, namely the `getFieldValue(const RecordValue , ...)` overload. Calls to this overload have been rewritten to use the `getFieldValue(const RecordStorageLocation , ...)` overload. Note that this also makes the code slightly simpler in many cases.	2023-09-12 09:17:38 +02:00
Tianlan Zhou	057564fec5	Fix some typos in comments: evalute -> evaluate (NFC) (#65906 )	2023-09-11 04:11:06 +08:00
Kinuko Yasuda	8e1d2f2f12	[clang][dataflow] Don't crash when BlockToState is called from unreachable path (#65732 ) When we call `getEnvironment`, `BlockToState[BlockId]` for the block can return null even if CFCtx.isBlockReachable(B) returns true if it is called from a particular block that is marked unreachable to the block.	2023-09-08 10:24:08 -04:00
Yitzhak Mandelbaum	80f0dc3aa4	[clang][dataflow] Unsoundly treat "Unknown" as "Equivalent" in widening. This change makes widening act the same as equivalence checking. When the analysis does not provide an answer regarding the equivalence of two distinct values, the framework treats them as equivalent. This is an unsound choice that enables convergence. Differential Revision: https://reviews.llvm.org/D159355	2023-09-07 19:06:35 +00:00
Kinuko Yasuda	f9026cfb76	[clang][dataflow] Fix Record initialization with InitListExpr and inheritances Usually RecordValues for record objects (e.g. struct) are initialized with `Environment::createValue()` which internally calls `getObjectFields()` to collects all fields from the current and base classes, and then filter them with `ModeledValues` via `DACtx::getModeledFields()` so that the fields that are actually referenced are modeled. The consistent set of fields should be initialized when a record is initialized with an initializer list (InitListExpr), however the existing code's behavior was different. Before this patch: * When a struct is initialized with InitListExpr, its fields are initialized based on what is returned by `getFieldsForInitListExpr()`, which only collects the direct fields in the current class, but not from the base classes. Moreover, if the base classes have their own InitListExpr, values that are initialized by their InitListExpr's weren't merged into the child objects. After this patch: * When a struct is initialized with InitListExpr, it collects and merges the fields in the base classes that were initialized by their InitListExpr's. The code also asserts that the consistent set of fields are initialized with the ModeledFields. Reviewed By: mboehme Differential Revision: https://reviews.llvm.org/D159284	2023-09-07 07:37:50 +00:00
martinboehme	c0703eaec1	[clang][dataflow] Emit an error if source code is not compiled as C++. (#65301 ) The shape of certain elements of the AST can vary depending on the langugage. We currently only support C++.	2023-09-06 10:02:21 +02:00
martinboehme	f470c361d9	[clang][dataflow] Eliminate uses of `RecordValue::getChild()`. (#65329 ) We want to work towards eliminating the `RecordStorageLocation` from `RecordValue`. These particular uses of `RecordValue::getChild()` can simply be replaced with `RecordStorageLocation::getChild()`.	2023-09-06 09:43:05 +02:00
Martin Braenne	37458c66bf	[clang][dataflow] Eliminate deprecated `DataflowAnalysis` constructor. Reviewed By: ymandel, xazax.hun Differential Revision: https://reviews.llvm.org/D159261	2023-09-04 07:00:11 +00:00
Kai Luo	09ccc5563e	Fix [-Werror,-Wsign-compare] error. NFC.	2023-08-24 07:56:43 +00:00
Martin Braenne	4866a6e1d3	[clang][dataflow] Produce pointer values for callees of member operator calls. Calls to member operators are a special case in that their callees have pointer type. The callees of non-operator non-static member functions are not pointers. See the comments in the code for details. This issue came up in the Crubit nullability check; the fact that we weren't modeling the `PointerValue` caused non-convergence. Reviewed By: ymandel, xazax.hun Differential Revision: https://reviews.llvm.org/D158592	2023-08-24 07:12:14 +00:00
Martin Braenne	a1a63d68a4	[clang][dataflow] Add two repros for non-convergence involving pointers in loops. These are broken out from https://reviews.llvm.org/D156658, which it now seems obvious isn't the right way to solve the non-convergence. Instead, my plan is to address the non-convergence through pointer value widening, but the exact way this should be implemented is TBD. In the meantime, I think there's value in getting these repros submitted to record the current undesirable behavior. Reviewed By: ymandel, xazax.hun Differential Revision: https://reviews.llvm.org/D158513	2023-08-23 07:03:16 +00:00
Martin Braenne	9ecdbe3855	[clang][dataflow] Rename `AggregateStorageLocation` to `RecordStorageLocation` and `StructValue` to `RecordValue`. - Both of these constructs are used to represent structs, classes, and unions; Clang uses the collective term "record" for these. - The term "aggregate" in `AggregateStorageLocation` implies that, at some point, the intention may have been to use it also for arrays, but it don't think it's possible to use it for arrays. Records and arrays are very different and therefore need to be modeled differently. Records have a fixed set of named fields, which can have different type; arrays have a variable number of elements, but they all have the same type. - Futhermore, "aggregate" has a very specific meaning in C++ (https://en.cppreference.com/w/cpp/language/aggregate_initialization). Aggregates of class type may not have any user-declared or inherited constructors, no private or protected non-static data members, no virtual member functions, and so on, but we use `AggregateStorageLocations` to model all objects of class type. In addition, for consistency, we also rename the following: - `getAggregateLoc()` (in `RecordValue`, formerly known as `StructValue`) to simply `getLoc()`. - `refreshStructValue()` to `refreshRecordValue()` We keep the old names around as deprecated synonyms to enable clients to be migrated to the new names. Reviewed By: ymandel, xazax.hun Differential Revision: https://reviews.llvm.org/D156788	2023-08-01 20:29:40 +00:00
Martin Braenne	e95134b9cb	[clang][dataflow] Reverse course on `getValue()` deprecation. In the [value categories RFC](https://discourse.llvm.org/t/70086), I proposed that the end state of the migration should be that `getValue()` should only be legal to call on prvalues. As a stepping stone, to allow migrating off existing calls to `getValue()`, I proposed introducing `getValueStrict()`, which would already have the new semantics. However, I've now reconsidered this. Any expression, whether prvalue or glvalue, has a value, so really there isn't any reason to forbid calling `getValue()` on glvalues. I'm therefore removing the deprecation from `getValue()` and transitioning existing `getValueStrict()` calls back to `getValue()`. The other "strict" accessors are a different case. `setValueStrict()` should only be called on prvalues because glvalues need to have a storage location associated with them; it doesn't make sense to only set a value for them. And, of course, `getStorageLocationStrict()` and `setStorageLocationStrict()` should obviously only be called on glvalues because prvalues don't have storage locations. Reviewed By: ymandel, xazax.hun Differential Revision: https://reviews.llvm.org/D155921	2023-07-27 13:14:49 +00:00
Martin Braenne	e6e83cbcc7	[clang][dataflow] Don't crash when constructing an array of records. When I wrote https://reviews.llvm.org/D155446, I assumed that a `CXXConstructExpr` would always have record type, but this isn't true: It can have array type when constructing an array of records. The code would crash in this situation because `createValue()` would return null. This patch includes a test that reproduces the crash without the other changes in the patch. Reviewed By: sammccall Differential Revision: https://reviews.llvm.org/D156402	2023-07-27 12:46:13 +00:00
Paul Semel	bc37893433	[clang][dataflow] fix bug for transparent ListInitExpr handling This fixes the handling of "transparent" ListInitExpr, when they're only used as a copy constructor for records. Without the fix, the two tests are crashing the process.	2023-07-26 08:50:28 +00:00
Martin Braenne	c3cf630d80	[clang][dataflow] Remove checks that test for consistency between `StructValue` and `AggregateStorageLocation`. Now that the redundancy between these two classes has been eliminated, these checks aren't needed any more. Reviewed By: ymandel, xazax.hun Differential Revision: https://reviews.llvm.org/D155813	2023-07-24 13:20:04 +00:00
Martin Braenne	44f98d0101	[clang][dataflow] Eliminate duplication between `AggregateStorageLocation` and `StructValue`. After this change, `StructValue` is just a wrapper for an `AggregateStorageLocation`. For the wider context, see https://discourse.llvm.org/t/70086. ## How to review - Start by looking at the comments added / changed in Value.h, StorageLocation.h, and DataflowEnvironment.h. This will give you a good overview of the semantic changes. - Look at the corresponding .cpp files that implement the semantic changes. - Transfer.cpp, TypeErasedDataflowAnalysis.cpp, and RecordOps.cpp show how the core of the framework is affected by the semantic changes. - UncheckedOptionalAccessModel.cpp shows how this complex model is affected by the changes. - Many of the changes in the rest of the patch are mechanical in nature. Reviewed By: ymandel, xazax.hun Differential Revision: https://reviews.llvm.org/D155446	2023-07-24 13:20:01 +00:00
Martin Braenne	6236bf5341	[clang][dataflow] Strengthen flow condition assertions. Instead of asserting merely that the flow condition doesn't imply that a variable is true, make the stronger assertion that the flow condition implies that the variable is false. Reviewed By: ymandel, xazax.hun Differential Revision: https://reviews.llvm.org/D155067	2023-07-18 13:41:43 +00:00
Martin Braenne	4782597e3c	[clang][dataflow] Add a test for not explicitly initialized fields in aggregate initialization. Reviewed By: ymandel Differential Revision: https://reviews.llvm.org/D155074	2023-07-17 07:26:08 +00:00
Sam McCall	6272226b9f	[dataflow] Remove deprecated BoolValue flow condition accessors Use the Formula versions instead, now. Differential Revision: https://reviews.llvm.org/D155152	2023-07-13 09:39:23 +02:00
Martin Braenne	bd9b57de4f	[clang][dataflow] Fix initializing a reference field with an `InitListExpr`. I added a test for this as the ongoing migration to strict handling of value categories (see https://discourse.llvm.org/t/70086) will change the code that handles this case. It turns out we already didn't handle this correctly, so I fixed the existing implementation. Depends On D154961 Reviewed By: xazax.hun Differential Revision: https://reviews.llvm.org/D154965	2023-07-12 04:52:30 +00:00
Martin Braenne	b47bdcbc72	[clang][dataflow] Include fields initialized in an `InitListExpr` in `getModeledFields()`. Previously, we were including these fields only in the specific instance that was initialized by the `InitListExpr`, but not in other instances of the same type. This is inconsistent and error-prone. Depends On D154952 Reviewed By: xazax.hun, gribozavr2 Differential Revision: https://reviews.llvm.org/D154961	2023-07-12 04:52:29 +00:00
Martin Braenne	103a0fc084	[clang][dataflow] Use `getFieldValue()` in TransferTest.cpp. For context, see https://reviews.llvm.org/D154935. Depends On D154935 Reviewed By: ymandel, xazax.hun Differential Revision: https://reviews.llvm.org/D154949	2023-07-12 04:52:25 +00:00
Sam McCall	fc9821a877	Reland "[dataflow] Add dedicated representation of boolean formulas" This reverts commit 7a72ce98224be76d9328e65eee472381f7c8e7fe. Test problems were due to unspecified order of function arg evaluation. Reland "[dataflow] Replace most BoolValue subclasses with references to Formula (and AtomicBoolValue => Atom and BoolValue => Formula where appropriate)" This properly frees the Value hierarchy from managing boolean formulas. We still distinguish AtomicBoolValue; this type is used in client code. However we expect to convert such uses to BoolValue (where the distinction is not needed) or Atom (where atomic identity is intended), and then fold AtomicBoolValue into FormulaBoolValue. We also distinguish TopBoolValue; this has distinct rules for widen/join/equivalence, and top-ness is not represented in Formula. It'd be nice to find a cleaner representation (e.g. the absence of a formula), but no immediate plans. For now, BoolValues with the same Formula are deduplicated. This doesn't seem desirable, as Values are mutable by their creators (properties). We can probably drop this for FormulaBoolValue immediately (not in this patch, to isolate changes). For AtomicBoolValue we first need to update clients to stop using value pointers for atom identity. The data structures around flow conditions are updated: - flow condition tokens are Atom, rather than AtomicBoolValue* - conditions are Formula, rather than BoolValue Most APIs were changed directly, some with many clients had a new version added and the existing one deprecated. The factories for BoolValues in Environment keep their existing signatures for now (e.g. makeOr(BoolValue, BoolValue) => BoolValue) and are not deprecated. These have very many clients and finding the most ergonomic API & migration path still needs some thought. Differential Revision: https://reviews.llvm.org/D153469	2023-07-07 11:58:33 +02:00
Martin Braenne	ca01be54c1	[clang][dataflow] Bug fix: `BuiltinFnToFnPtr` cast does not produce a pointer. See comments in the code for details. Reviewed By: xazax.hun Differential Revision: https://reviews.llvm.org/D154479	2023-07-06 06:56:02 +00:00
Sam McCall	2d8cd19512	Revert "Reland "[dataflow] Add dedicated representation of boolean formulas" and followups These changes are OK, but they break downstream stuff that needs more time to adapt :-( This reverts commit 71579569f4399d3cf6bc618dcd449b5388d749cc. This reverts commit 5e4ad816bf100b0325ed45ab1cfea18deb3ab3d1. This reverts commit 1c3ac8dfa16c42a631968aadd0396cfe7f7778e0.	2023-07-05 14:32:25 +02:00
Sam McCall	71579569f4	[dataflow] use true/false literals in formulas, rather than variables And simplify formulas containing true/false It's unclear to me how useful this is, it does make formulas more conveniently self-contained now (we can usefully print them without carrying around the "true/false" labels) (while here, simplify !!X to X, too) Differential Revision: https://reviews.llvm.org/D153485	2023-07-05 14:06:48 +02:00
Sam McCall	5e4ad816bf	[dataflow] Replace most BoolValue subclasses with references to Formula (and AtomicBoolValue => Atom and BoolValue => Formula where appropriate) This properly frees the Value hierarchy from managing boolean formulas. We still distinguish AtomicBoolValue; this type is used in client code. However we expect to convert such uses to BoolValue (where the distinction is not needed) or Atom (where atomic identity is intended), and then fold AtomicBoolValue into FormulaBoolValue. We also distinguish TopBoolValue; this has distinct rules for widen/join/equivalence, and top-ness is not represented in Formula. It'd be nice to find a cleaner representation (e.g. the absence of a formula), but no immediate plans. For now, BoolValues with the same Formula are deduplicated. This doesn't seem desirable, as Values are mutable by their creators (properties). We can probably drop this for FormulaBoolValue immediately (not in this patch, to isolate changes). For AtomicBoolValue we first need to update clients to stop using value pointers for atom identity. The data structures around flow conditions are updated: - flow condition tokens are Atom, rather than AtomicBoolValue* - conditions are Formula, rather than BoolValue Most APIs were changed directly, some with many clients had a new version added and the existing one deprecated. The factories for BoolValues in Environment keep their existing signatures for now (e.g. makeOr(BoolValue, BoolValue) => BoolValue) and are not deprecated. These have very many clients and finding the most ergonomic API & migration path still needs some thought. Differential Revision: https://reviews.llvm.org/D153469	2023-07-05 13:54:32 +02:00
Martin Braenne	d0be47c51c	[clang][dataflow] Make `runDataflowReturnError()` a non-template function. It turns out this didn't need to be a template at all. Likewise, change callers to they're non-template functions. Also, correct / clarify some comments in RecordOps.h. This is in response to post-commit comments on https://reviews.llvm.org/D153006. Reviewed By: gribozavr2 Differential Revision: https://reviews.llvm.org/D154339	2023-07-04 12:44:49 +00:00
Martin Braenne	880f306226	[clang][dataflow] Add a test for a struct that is directly self-referential through a reference. The ongoing migration to strict handling of value categories (see https://discourse.llvm.org/t/70086) will change the way we handle fields of reference type, and I want to put a test in place that makes sure we continue to handle this special case correctly. Depends On D154420 Reviewed By: gribozavr2, xazax.hun Differential Revision: https://reviews.llvm.org/D154421	2023-07-04 12:06:13 +00:00
Martin Braenne	1e7329cd79	[clang][dataflow] Model variables / fields / funcs used in default initializers. The newly added test fails without the other changes in this patch. Reviewed By: sammccall, gribozavr2 Differential Revision: https://reviews.llvm.org/D154420	2023-07-04 12:06:10 +00:00
Martin Braenne	1d7f9ce61f	[clang][dataflow] Don't crash when creating pointers to members. The newly added tests crash without the other changes in this patch. Reviewed By: sammccall, xazax.hun, gribozavr2 Differential Revision: https://reviews.llvm.org/D153960	2023-06-29 07:12:55 +00:00
Martin Braenne	74d8455ba6	[clang][dataflow] Make `getThisPointeeStorageLocation()` return an `AggregateStorageLocation`. This avoids the need for casts at callsites. Depends On D153852 Reviewed By: sammccall, xazax.hun, gribozavr2 Differential Revision: https://reviews.llvm.org/D153854	2023-06-29 04:07:08 +00:00
Martin Braenne	d36324866e	[clang][dataflow] Initialize fields of anonymous records correctly. Previously, the newly added test would crash. Depends On D153851 Reviewed By: gribozavr2 Differential Revision: https://reviews.llvm.org/D153852	2023-06-29 04:07:04 +00:00
Martin Braenne	abc8367413	[clang][dataflow] Don't crash if copy constructor arg doesn't have a storage location. I accidentally used `cast` instead of `cast_or_null`. Reviewed By: sammccall, xazax.hun Differential Revision: https://reviews.llvm.org/D153956	2023-06-28 11:15:52 +00:00
Martin Braenne	d04b198973	[clang][dataflow] Add a test that we can access fields of anonymous records. Reviewed By: sammccall, ymandel, gribozavr2, xazax.hun Differential Revision: https://reviews.llvm.org/D153409	2023-06-28 09:12:03 +00:00
Martin Braenne	f2123af1e7	[clang][dataflow] Perform deep copies in copy and move operations. This serves two purposes: - Because, today, we only copy the `StructValue`, modifying the destination of the copy also modifies the source. This is demonstrated by the new checks added to `CopyConstructor` and `MoveConstructor`, which fail without the deep copy. - It lays the groundwork for eliminating the redundancy between `AggregateStorageLocation` and `StructValue`, which will happen as part of the ongoing migration to strict handling of value categories (seeo https://discourse.llvm.org/t/70086 for details). This will involve turning `StructValue` into essentially just a wrapper for `AggregateStorageLocation`; under this scheme, the current "shallow" copy (copying a `StructValue` from one `AggregateStorageLocation` to another) will no longer be possible. Because we now perform deep copies, tests need to perform a deep equality comparison instead of just comparing for equal identity of the `StructValue`s. The new function `recordsEqual()` provides such a deep equality comparison. Reviewed By: xazax.hun Differential Revision: https://reviews.llvm.org/D153006	2023-06-26 13:52:56 +00:00

1 2 3

150 Commits