Sanitise: Resolve free range indices when resolving associates #455

mlange05 · 2024-12-02T13:23:44Z

The current Associate resolution does not account for array index ranges if they are specific in the associate clauses (index remapping). This PR adds a partial resolution step to ResolveAssociates, which resolves free index ranges (:) in association clause against explicit indices from the original expression, should should both exist. The respective changes in the expression mapper require some care with respect to early termination and recursion into .dimension and .type properties of the given array expression.

Note: It does not resolve index shifts, where offsets are introduced by using bounded ranges (eg. 3:8). Instead it warns about such (ab)use.

github-actions · 2024-12-02T13:27:04Z

Documentation for this branch can be viewed at https://sites.ecmwf.int/docs/loki/455/index.html

This is subtle, but to avoid false positives on the index-range matching, we need to terminate early if the symbol scope is not an association. However, to get this right, we still need to recurse over prior expression dimensions and type symbols.

Also slightly adjust the simply associate resolver test case to cover this basic use case.

codecov · 2024-12-06T04:46:46Z

Codecov Report

Attention: Patch coverage is 98.00000% with 1 line in your changes missing coverage. Please review.

Project coverage is 93.29%. Comparing base (e9760e8) to head (e350f50).
Report is 19 commits behind head on main.

Files with missing lines	Patch %	Lines
loki/transformations/sanitise/associates.py	95.23%	1 Missing ⚠️

Additional details and impacted files

@@           Coverage Diff           @@
##             main     #455   +/-   ##
=======================================
  Coverage   93.28%   93.29%           
=======================================
  Files         220      220           
  Lines       41224    41269   +45     
=======================================
+ Hits        38457    38502   +45     
  Misses       2767     2767

Flag	Coverage Δ
lint_rules	`96.39% <ø> (ø)`
loki	`93.25% <98.00%> (+<0.01%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

reuterbal

Thanks for this! I have some questions about the recursion and potentially a missing test but not sure on either, so marking this as a comment only for now

reuterbal · 2024-12-06T11:54:25Z

loki/transformations/sanitise/associates.py

+        expr_dims = self.rec(expr.dimensions, *args, **kwargs)

        # Recurse over the type's shape
        _type = expr.type
        if expr.type.shape:
            new_shape = self.rec(expr.type.shape, *args, **kwargs)
            _type = expr.type.clone(shape=new_shape)

+        # Stop if scope is not an associate
+        if not isinstance(expr.scope, ir.Associate):
+            return expr.clone(dimensions=expr_dims, type=_type)
+
+        new = self.map_scalar(expr, *args, **kwargs)
+
        # Recurse over array dimensions
-        new_dims = self.rec(expr.dimensions, *args, **kwargs)
+        if isinstance(new, sym.Array) and new.dimensions:
+            # Resolve unbound range symbols form existing indices
+            new_dims = self.rec(new.dimensions, *args, **kwargs)
+            new_dims = self._match_range_indices(new_dims, expr_dims)
+        else:
+            new_dims = expr_dims
+


I'm not sure I'm following the logic here. The first recursion to expr_dims seems to only be used if the array's scope isn't an associate - could we not move it into the conditional just before the return statement?

Or alternatively, in which situation do we expect the return value of the recursion on new.dimensions to differ from the first recursion? Could we maybe use the previous recursion value here?

I suspect the second wouldn't work but the first should allow us to save some recursion maybe?

Ah, yes, this was quite fiddly. The initial recursion is required for resolving associations in the dimensions of arrays that are not actually scope to the Associate; that's also why it has to happen before the short-circuiting. For example, consider

associate(a => b) some%obj(i, :, a) = ... end associate

The second recursion is then done on the potentially replaced symbol (new), which now might have entirely different .dimensions. For example consider

associate(a => b(:, : i)) a = 42.0 ! or even a(j, k) = 66.6 end associate

In the first line of this example expr == 'a' and expr.dimensions == None and new.dimensions == (:, :, i), the latter of which we have not recursed into yet. In the second example line new.dimensions then also needs to be matched against (j, k) to yield b(j, k, i). Does that make sense?

EDIT: Scratch this, I'm seeing my error 🤦

Thanks, I suspected something along these lines.

My question was mostly on the grounds of avoiding redundant recursion, purely judged from the dependencies within this method. ~~Something like this:~~

# Recurse over the type's shape _type = expr.type if _type.shape: new_shape = self.rec(_type.shape, *args, **kwargs) _type = expr.type.clone(shape=new_shape) # Stop if scope is not an associate, but still recurse on the dimensions # in case an associated symbol is in there if not isinstance(expr.scope, ir.Associate): expr_dims = self.rec(expr.dimensions, *args, **kwargs) return expr.clone(dimensions=expr_dims, type=_type) # For arrays that represent an associate symbol, re-use the # scalar implementation to resolve the symbol new = self.map_scalar(expr, *args, **kwargs) # Recurse over array dimensions if isinstance(new, sym.Array) and new.dimensions: # Resolve unbound range symbols form existing indices new_dims = self.rec(new.dimensions, *args, **kwargs) new_dims = self._match_range_indices(new_dims, expr_dims) else: new_dims = expr_dims return new.clone(dimensions=new_dims, type=_type)

~~This should save the need to do the recursion on the dimensions for symbols from the associate since you're recursing on the new_dims anyway~~

reuterbal · 2024-12-06T11:57:13Z

loki/transformations/sanitise/tests/test_associates.py

+  associate (a => arr2d(:, 1), b=>arr2d(:, idx_a) )
+    b(:) = 42.0
+    do i=1, 5
+      a(i) = b(i+2)
+      call another_routine(i, a(2:4), b)
+    end do
+  end associate


Do we also have a test that covers the reverse situation, where a dimension expression contains an associated symbol but the array is defined outside?

Something like:

real, intent(inout) :: arr2d(:,:) associate(var_idx=>some%type%idx) do i=1,5 arr2d(i, var_idx) = 1.0 end do end associate

Yes, agreed, good point! I've extended the array slicing test to cover this example.

When using dict-mapping to match symbols, the range keys might be `:`, which alias and mean we'd miss susequent `:` matches. The test has been updated accordingly.

mlange05 · 2024-12-11T12:55:47Z

Hi @reuterbal, thanks for the review and apologies for the delay. In addition to the things you flagged, I've independently found another bug in this, which I've now fixed and extended the slice array test for. This pertains to the matching of ':' when matching the free range symbols in the utility method of the mapper class. Please have another look. 🙏

reuterbal

Many thanks for the additional testing and explanations. I was a bit slow but finally got the double recursion... (scratch my comment)

mlange05 requested a review from reuterbal December 2, 2024 13:23

mlange05 force-pushed the naml-resolve-assoc-array-indices branch from 88e014b to 6874999 Compare December 2, 2024 13:24

mlange05 added 5 commits December 6, 2024 04:17

Sanitise: Add test for matching range indices in array expressions

f4b9036

Sanitise: Add utility method to match indices to free range symbols

6618203

Sanitise: Apply partial range resolution when resolving associates

2f71142

Sanitise: Only apply partial range resolution if new symbols has dims

c055d93

Also slightly adjust the simply associate resolver test case to cover this basic use case.

mlange05 force-pushed the naml-resolve-assoc-array-indices branch from 6874999 to c055d93 Compare December 6, 2024 04:18

mlange05 marked this pull request as ready for review December 6, 2024 04:54

reuterbal reviewed Dec 6, 2024

View reviewed changes

mlange05 added 2 commits December 11, 2024 10:52

Sanitise: Use iterator, when matching free symbols in assoc resolve

bed1ae8

When using dict-mapping to match symbols, the range keys might be `:`, which alias and mean we'd miss susequent `:` matches. The test has been updated accordingly.

Sanitise: Add recursion + symbol-matching to array slice test

e350f50

reuterbal approved these changes Dec 12, 2024

View reviewed changes

reuterbal added the ready for merge This PR has been approved and is ready to be merged label Dec 12, 2024

reuterbal merged commit 64b0bbf into main Dec 13, 2024
13 checks passed

reuterbal deleted the naml-resolve-assoc-array-indices branch December 13, 2024 13:37

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sanitise: Resolve free range indices when resolving associates #455

Sanitise: Resolve free range indices when resolving associates #455

mlange05 commented Dec 2, 2024

github-actions bot commented Dec 2, 2024

codecov bot commented Dec 6, 2024 •

edited

Loading

reuterbal left a comment

reuterbal Dec 6, 2024

mlange05 Dec 11, 2024

reuterbal Dec 12, 2024 •

edited

Loading

reuterbal Dec 6, 2024

mlange05 Dec 11, 2024

mlange05 commented Dec 11, 2024

reuterbal left a comment

Sanitise: Resolve free range indices when resolving associates #455

Sanitise: Resolve free range indices when resolving associates #455

Conversation

mlange05 commented Dec 2, 2024

github-actions bot commented Dec 2, 2024

codecov bot commented Dec 6, 2024 • edited Loading

Codecov Report

reuterbal left a comment

Choose a reason for hiding this comment

reuterbal Dec 6, 2024

Choose a reason for hiding this comment

mlange05 Dec 11, 2024

Choose a reason for hiding this comment

reuterbal Dec 12, 2024 • edited Loading

Choose a reason for hiding this comment

reuterbal Dec 6, 2024

Choose a reason for hiding this comment

mlange05 Dec 11, 2024

Choose a reason for hiding this comment

mlange05 commented Dec 11, 2024

reuterbal left a comment

Choose a reason for hiding this comment

codecov bot commented Dec 6, 2024 •

edited

Loading

reuterbal Dec 12, 2024 •

edited

Loading