Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Analyze Solr heap dump #3601

Closed
2 of 6 tasks
maxkadel opened this issue May 31, 2023 · 4 comments
Closed
2 of 6 tasks

Analyze Solr heap dump #3601

maxkadel opened this issue May 31, 2023 · 4 comments
Assignees
Labels
performance Tickets related to the application performance

Comments

@maxkadel
Copy link
Contributor

maxkadel commented May 31, 2023

What maintenance needs to be done?

Follow the instructions in Heap Dump section of pul_solr README to see why the heap is filling up.

Level of urgency

  • High
  • Moderate
  • Low

Why is this maintenance needed?

The heap on production Solr is periodically getting high, forcing it to restart and impacting performance.

Acceptance criteria

  • Run the task described
  • Document findings
  • Create any follow-up tickets stemming from findings
@maxkadel maxkadel added the performance Tickets related to the application performance label May 31, 2023
@sandbergja
Copy link
Member

In reviewing heap usage on staging, the heap is not increasing to a concerning amount. Therefore, we decided not to prioritize this analysis for now, but can revisit it when we want to investigate further.

@christinach christinach added the template update Tickets that need to be updated using the appropriate issue template label Apr 16, 2024
@maxkadel maxkadel removed the template update Tickets that need to be updated using the appropriate issue template label May 28, 2024
@sandbergja sandbergja self-assigned this May 28, 2024
@sandbergja
Copy link
Member

We did some preliminary exploration of the heap dump, with the goal of understanding why our heap fills up so fast. We noticed that 89.59% of the heap is solr's cache, including deprecated classes like solr.search.LRUCache (which has been replaced with a more memory-efficient solr.search.CaffeineCache). We came up with a plan to test CaffeineCache. There are other cache settings which bear further investigation.

@sandbergja
Copy link
Member

pulibrary/pul_solr#180 also seems like it would help us to make better use of our heap. Some problems that might get in the way of trying out the changes iteratively/scientifically:

  • it requires a full reindex to see the effects
  • our current solr load test plan doesn't change the sort order or the fq

@sandbergja
Copy link
Member

One more ticket that seems like it would help with our heap usage, with thanks to @maxkadel for identifying the issue: pulibrary/pul_solr#406

Closing this ticket now, so we can focus on the tickets regarding making better use of the cache.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
performance Tickets related to the application performance
Projects
None yet
Development

No branches or pull requests

3 participants