-
Notifications
You must be signed in to change notification settings - Fork 1.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Ruby/performance queries #18304
Draft
yoff
wants to merge
7
commits into
github:main
Choose a base branch
from
yoff:ruby/performance-queries
base: main
Could not load branches
Branch not found: {{ refName }}
Loading
Could not load tags
Nothing to show
Loading
Are you sure you want to change the base?
Some commits from the old base branch may be removed from the timeline,
and old review comments may become outdated.
Draft
Ruby/performance queries #18304
Changes from all commits
Commits
Show all changes
7 commits
Select commit
Hold shift + click to select a range
a300eed
ruby: first sketches
yoff f6cb090
ruby: more tweaking
yoff 51ecf48
ruby: first draft qhelp plus some tweaks
yoff 5d3a541
Ruby: revome cases where the loop may be interrupted
yoff 7d5fbf5
ruby: first draft of thread-unsafe modification
yoff 6ef6427
ruby: add comment from triage session
yoff c28c35c
ruby: add md help file to ierate on
yoff File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,14 @@ | ||
import ruby | ||
import codeql.ruby.ast.Expr | ||
import codeql.ruby.ast.Constant | ||
import codeql.ruby.ast.Variable | ||
|
||
predicate relevantModification(Expr e) { | ||
e instanceof ClassVariableWriteAccess | ||
or | ||
exists(e.(ConstantAssignment).getScopeExpr()) | ||
} | ||
|
||
from Expr e | ||
where relevantModification(e) and exists(e.getEnclosingCallable()) | ||
select e |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,24 @@ | ||
<!DOCTYPE qhelp PUBLIC | ||
"-//Semmle//qhelp//EN" | ||
"qhelp.dtd"> | ||
<qhelp> | ||
|
||
<overview> | ||
<p> | ||
When an AcriveRecord query is executed synchronously, the application is blocked until the query completes. This can lead to poor performance, especially when the query is slow or when many queries are executed in sequence. This query identifies ActiveRecord queries that could be executed asynchronously to improve performance. | ||
Specifically, this query identifies ActiveRecord queries that are executed in a loop. If each query is independent of the others, the queries could be executed in parallel by using the built-in Ruby <code>load_async</code> method. | ||
In those cases, where the query includes a <code>pluck</code> call, the query could be executed asynchronously by using the <code>async_pluck</code> method. | ||
</p> | ||
</overview> | ||
<recommendation> | ||
<p>If possible, split the loop into two. The first creates a map of promises resolving to the async query results. The second runs throuhg this map, waiting on each promise, and does whatever the original loop did with the result of the query.</p> | ||
</recommendation> | ||
<example> | ||
<p>The following (suboptimal) example code executes a series of ActiveRecord queries in a loop. The queries are independent of each other, so they could be executed in parallel to improve performance.</p> | ||
<sample src="examples/straight_loop.rb" /> | ||
<p>To be able to fetch the necessary information asynchronously, we first pull it out into its own (implicit) loop: | ||
<sample src="examples/preload.rb" /> | ||
<p>We can now use the <code>async_pluck</code> method to execute the queries in parallel.</p> | ||
<sample src="examples/async_pluck.rb" /> | ||
</example> | ||
</qhelp> |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,99 @@ | ||
/** | ||
* @name Could be async | ||
* @description Use `ActiveRecord::Relation#load_async` to load records asynchronously. | ||
* @kind problem | ||
* @problem.severity info | ||
* @precision high | ||
* @id rb/could-be-async | ||
* @tags performance | ||
*/ | ||
|
||
// Consider also Associations. | ||
// Associations are lazy-loading by default, so something like | ||
// in a loop over article do | ||
// article.book | ||
// if you have 1000 articles it will do a 1000 calls to book. | ||
// If you already did article includes book, there should be no problem. | ||
import ruby | ||
private import codeql.ruby.AST | ||
import codeql.ruby.Concepts | ||
import codeql.ruby.frameworks.ActiveRecord | ||
private import codeql.ruby.TaintTracking | ||
|
||
string loopMethodName() { | ||
result in [ | ||
"each", "reverse_each", "map", "map!", "foreach", "flat_map", "in_batches", "one?", "all?", | ||
"collect", "collect!", "select", "select!", "reject", "reject!" | ||
] | ||
} | ||
|
||
class LoopingCall extends DataFlow::CallNode { | ||
DataFlow::CallableNode loopBlock; | ||
|
||
LoopingCall() { | ||
this.getMethodName() = loopMethodName() and loopBlock = this.getBlock().asCallable() | ||
} | ||
|
||
DataFlow::CallableNode getLoopBlock() { result = loopBlock } | ||
} | ||
|
||
predicate happensInLoop(LoopingCall loop, DataFlow::CallNode e) { | ||
loop.getLoopBlock().asCallableAstNode() = e.asExpr().getScope() | ||
} | ||
|
||
predicate happensInOuterLoop(LoopingCall outerLoop, DataFlow::CallNode e) { | ||
exists(LoopingCall innerLoop | | ||
happensInLoop(outerLoop, innerLoop) and | ||
happensInLoop(innerLoop, e) | ||
) | ||
} | ||
|
||
predicate happensInInnermostLoop(LoopingCall loop, DataFlow::CallNode e) { | ||
happensInLoop(loop, e) and | ||
not happensInOuterLoop(loop, e) | ||
} | ||
|
||
private class PluckCall extends ActiveRecordInstanceMethodCall { | ||
PluckCall() { this.getMethodName() in ["pluck"] } | ||
|
||
ActiveRecordInstance chaines() { result = getChain(this) } | ||
} | ||
|
||
private ActiveRecordInstance getChain(ActiveRecordInstanceMethodCall c) { | ||
result = c.getInstance() | ||
or | ||
result = getChain(c.getInstance()) | ||
} | ||
|
||
// The ActiveRecord instance is used to potentially control the loop | ||
predicate usedInLoopControlGuard(ActiveRecordInstance ar, DataFlow::Node guard) { | ||
TaintTracking::localTaint(ar, guard) and | ||
guard = guardForLoopControl(_, _) | ||
} | ||
|
||
// A guard for controlling the loop | ||
DataFlow::Node guardForLoopControl(ConditionalExpr cond, Stmt control) { | ||
result.asExpr().getAstNode() = cond.getCondition().getAChild*() and | ||
( | ||
control.(MethodCall).getMethodName() = "raise" | ||
or | ||
control instanceof NextStmt | ||
) and | ||
control = cond.getBranch(_).getAChild() | ||
} | ||
|
||
from LoopingCall loop, DataFlow::CallNode call, string message | ||
where | ||
not call.getLocation().getFile().getAbsolutePath().matches("%test%") and | ||
not call = any(PluckCall p).chaines() and | ||
not usedInLoopControlGuard(call, _) and | ||
happensInInnermostLoop(loop, call) and | ||
( | ||
call instanceof ActiveRecordModelFinderCall and | ||
not call.getMethodName() in ["new", "create"] and | ||
message = "could be chained with load_async" | ||
or | ||
call instanceof PluckCall and | ||
message = "could be async_pluck" | ||
) | ||
select call, "This call happens inside $@, and " + message, loop, "this loop" |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,16 @@ | ||
require 'async_pluck' | ||
|
||
# Preload User data in parallel | ||
user_data = User.where(login: repo_names_by_owner.keys).async_pluck(:login, :id, :type).to_h do |login, id, type| | ||
[login, { id: id, type: type == "User" ? "USER" : "ORGANIZATION" }] | ||
end | ||
|
||
repo_names_by_owner.each do |owner_slug, repo_names| | ||
owner_info = user_data[owner_slug] | ||
owner_id = owner_info[:id] | ||
owner_type = owner_info[:type] | ||
rel_conditions = { owner_id: owner_id, name: repo_names } | ||
|
||
nwo_rel = nwo_rel.or(RepositorySecurityCenterConfig.where(rel_conditions)) unless neg | ||
nwo_rel = nwo_rel.and(RepositorySecurityCenterConfig.where.not(rel_conditions)) if neg | ||
end |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,14 @@ | ||
# Preload User data | ||
user_data = User.where(login: repo_names_by_owner.keys).pluck(:login, :id, :type).to_h do |login, id, type| | ||
[login, { id: id, type: type == "User" ? "USER" : "ORGANIZATION" }] | ||
end | ||
|
||
repo_names_by_owner.each do |owner_slug, repo_names| | ||
owner_info = user_data[owner_slug] | ||
owner_id = owner_info[:id] | ||
owner_type = owner_info[:type] | ||
rel_conditions = { owner_id: owner_id, name: repo_names } | ||
|
||
nwo_rel = nwo_rel.or(RepositorySecurityCenterConfig.where(rel_conditions)) unless neg | ||
nwo_rel = nwo_rel.and(RepositorySecurityCenterConfig.where.not(rel_conditions)) if neg | ||
end |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,8 @@ | ||
repo_names_by_owner.map do |owner_slug, repo_names| | ||
owner_id, owner_type = User.where(login: owner_slug).pluck(:id, :type).first | ||
owner_type = owner_type == "User" ? "USER" : "ORGANIZATION" | ||
rel_conditions = { owner_id: owner_id, name: repo_names } | ||
|
||
nwo_rel = nwo_rel.or(RepositorySecurityCenterConfig.where(rel_conditions)) unless neg | ||
nwo_rel = nwo_rel.and(RepositorySecurityCenterConfig.where.not(rel_conditions)) if neg | ||
end |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,46 @@ | ||
# ActiveRecord performance optimisation | ||
|
||
## Description | ||
|
||
When an AcriveRecord query is executed in a loop, it potentially an n+1 problem. | ||
This query identifies situations where an ActiveRecord query execution could be pulled out of a loop. | ||
|
||
## Recommendation | ||
Pull the query out of the loop, thus replacing the many calls with a single one. | ||
|
||
## Examples | ||
|
||
### Suboptimal | ||
|
||
The following (suboptimal) example code queries the User object in each iteration of the loop: | ||
|
||
```ruby | ||
repo_names_by_owner.map do |owner_slug, repo_names| | ||
owner_id, owner_type = User.where(login: owner_slug).pluck(:id, :type).first | ||
owner_type = owner_type == "User" ? "USER" : "ORGANIZATION" | ||
rel_conditions = { owner_id: owner_id, name: repo_names } | ||
|
||
nwo_rel = nwo_rel.or(RepositorySecurityCenterConfig.where(rel_conditions)) unless neg | ||
nwo_rel = nwo_rel.and(RepositorySecurityCenterConfig.where.not(rel_conditions)) if neg | ||
end | ||
``` | ||
### Fixed | ||
|
||
To improve the performance, we instead query the User object once outside the loop, gathereing all necessary information: | ||
|
||
```ruby | ||
# Preload User data | ||
user_data = User.where(login: repo_names_by_owner.keys).pluck(:login, :id, :type).to_h do |login, id, type| | ||
[login, { id: id, type: type == "User" ? "USER" : "ORGANIZATION" }] | ||
end | ||
|
||
repo_names_by_owner.each do |owner_slug, repo_names| | ||
owner_info = user_data[owner_slug] | ||
owner_id = owner_info[:id] | ||
owner_type = owner_info[:type] | ||
rel_conditions = { owner_id: owner_id, name: repo_names } | ||
|
||
nwo_rel = nwo_rel.or(RepositorySecurityCenterConfig.where(rel_conditions)) unless neg | ||
nwo_rel = nwo_rel.and(RepositorySecurityCenterConfig.where.not(rel_conditions)) if neg | ||
end | ||
``` |
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Check warning
Code scanning / CodeQL
Singleton set literal Warning