⚡️ Speed up function aggregate_bagging by 40%
#33
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
📄 40% (0.40x) speedup for
aggregate_bagginginframework/py/flwr/serverapp/strategy/strategy_utils.py⏱️ Runtime :
13.1 milliseconds→9.35 milliseconds(best of275runs)📝 Explanation and details
The optimization achieves a 39% speedup by eliminating redundant JSON parsing operations.
Key optimization: The original code called
_get_tree_nums()twice - once for each input model - which meant parsing the JSON data four times total (twice in the main function, twice in the helper). The optimized version extracts tree numbers directly from the already-parsed JSON objects, reducing JSON parsing from 4 operations to just 2.Why this matters: From the line profiler results, JSON parsing (
json.loads(bytearray(...))) was the most expensive operation, consuming 25.9% of total runtime in the original version. By eliminating the redundant calls to_get_tree_nums(), we cut this overhead significantly.Performance characteristics: This optimization is most effective for larger models where JSON parsing overhead dominates. The test cases show benefits across all scales - from single trees to 500+ tree models - because the fixed cost of redundant parsing is eliminated regardless of model size. The optimization maintains identical behavior while reducing the computational complexity from O(4 * JSON_size) to O(2 * JSON_size) for the parsing phase.
✅ Correctness verification report:
🌀 Generated Regression Tests and Runtime
🔎 Concolic Coverage Tests and Runtime
To edit these changes
git checkout codeflash/optimize-aggregate_bagging-mh9kfwe2and push.