Optimize AngleMask #3072

SirYwell · 2025-01-10T08:51:45Z

Overview

Description

AngleMask caused unnecessarily slow block lookups due to its access patterns. That can be improved. I'll add some inline comments about specific changes.

Submitter Checklist

Give feedback

Make sure you are opening from a topic branch (/feature/fix/docs/ branch (right side)) and not your main branch.
Ensure that the pull request title represents the desired changelog entry.
New public fields and methods are annotated with @since TODO.
I read and followed the contribution guidelines.
Options

...t-core/src/main/java/com/fastasyncworldedit/core/queue/implementation/blocks/CharBlocks.java

worldedit-core/src/main/java/com/fastasyncworldedit/core/function/mask/AngleMask.java

dordsor21 · 2025-01-11T11:11:00Z

worldedit-core/src/main/java/com/fastasyncworldedit/core/function/mask/AngleMask.java

            return true;
        }
-        if (!mask.test(extent, mutable.setComponents(x, y, z + 1))) {
+        // other positions might be in different chunks, go a slower, cached path there


Potentially worth coming back to this after the merge of this and #3051 - if the cache performance is still poor then probably worth testing if x/z are actually on the edges of the chunk. Maybe this could be implemented into the cache mask constructor - we can potentially look at ways of compressing the data further.

dordsor21 · 2025-01-11T11:13:11Z

worldedit-core/src/main/java/com/fastasyncworldedit/core/function/mask/AngleMask.java

    }

    @Override
    public boolean test(BlockVector3 vector) {

-        if (!mask.test(vector)) {
+        if (!fastMask.test(vector)) {


I think we should cache this at least if on the edge of the chunk, as there's little point using the cache mask on x/z +/- 1 if there's unlikely to be a value already in the cache. We know we're going to be testing every block here

Not sure if it's worth the complexity to have different code paths for those cases. I tried again to revert this change to use the cached mask and it makes the whole mask slower even in the overlay scenario.

I also replaced the usages of the CachedMask in adjacentAir, bringing time spent in adjacentAir down from ~25s (when using the cached mask everywhere) to ~10s in my test... so I guess we really need to look into CachedMask after your PR.

Okay that's definitely concerning then. I suppose the caching makes it slower if the chunks are already available, and we will already have those GET chunks loaded in FAWE

Yes that code path doesn't seem to run into chunk loading, so any access is more or less direct (going through STQE)

github-actions · 2025-01-18T17:57:04Z

Please take a moment and address the merge conflicts of your pull request. Thanks!

SirYwell · 2025-01-21T13:44:55Z

After #3051 got merged, it still seems to be a performance win avoiding the CachedMask usage completely:
(old is the current state of this PR, new is replacing the remaining usages of mask with fastMask.

dordsor21 · 2025-01-26T18:35:13Z

I suppose it's probably worth removing the CachedMask in this case entirely?

SirYwell · 2025-01-27T07:29:16Z

I think so, yeah. Maybe it also comes down to how chunks are spread across threads, that might make caching values of surrounding chunks less useful too.

The CachedMask field is protected, do you think it's still fine to remove it?

SirYwell requested a review from a team as a code owner January 10, 2025 08:51

SirYwell commented Jan 10, 2025

View reviewed changes

...t-core/src/main/java/com/fastasyncworldedit/core/queue/implementation/blocks/CharBlocks.java Outdated Show resolved Hide resolved

worldedit-core/src/main/java/com/fastasyncworldedit/core/function/mask/AngleMask.java Show resolved Hide resolved

dordsor21 reviewed Jan 11, 2025

View reviewed changes

github-actions bot added the unresolved-merge-conflict label Jan 18, 2025

SirYwell added 4 commits January 18, 2025 19:01

check mask instead of extent

df19bd5

skip cached mask when possible

9031566

remove CharBlocks change

df34450

resolve rebase issue

2ea3626

SirYwell force-pushed the perf/AngleMask-improvements branch from 72e49fc to 2ea3626 Compare January 18, 2025 18:14

github-actions bot removed the unresolved-merge-conflict label Jan 18, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimize AngleMask #3072

Optimize AngleMask #3072

SirYwell commented Jan 10, 2025

Submitter Checklist

dordsor21 Jan 11, 2025

dordsor21 Jan 11, 2025

SirYwell Jan 11, 2025

dordsor21 Jan 11, 2025

SirYwell Jan 11, 2025

github-actions bot commented Jan 18, 2025

SirYwell commented Jan 21, 2025

dordsor21 commented Jan 26, 2025

SirYwell commented Jan 27, 2025

Optimize AngleMask #3072

Are you sure you want to change the base?

Optimize AngleMask #3072

Conversation

SirYwell commented Jan 10, 2025

Overview

Description

Submitter Checklist

dordsor21 Jan 11, 2025

Choose a reason for hiding this comment

dordsor21 Jan 11, 2025

Choose a reason for hiding this comment

SirYwell Jan 11, 2025

Choose a reason for hiding this comment

dordsor21 Jan 11, 2025

Choose a reason for hiding this comment

SirYwell Jan 11, 2025

Choose a reason for hiding this comment

github-actions bot commented Jan 18, 2025

SirYwell commented Jan 21, 2025

dordsor21 commented Jan 26, 2025

SirYwell commented Jan 27, 2025