langchain-ai
deepagents
Blog
Docs
Changelog
Blog
Docs
Changelog
Overview
Branches
Benchmarks
Runs
feat(sdk): `delete_file` tool
#3691
Merged
Comparing
nm/backend-delete-file-tool
(
a97bb94
) with
v0.7
(
f741293
)
CodSpeed Performance Gauge
-10%
Untouched
15
Skipped
79
Benchmarks
Mode
CPU Simulation
Wall Time
Memory
Status
Untouched
Skipped
94 total
Uses the
Wall Time instrument
to collect wall time performance metrics.
test_create_deep_agent_minimal
libs/deepagents/tests/benchmarks/test_benchmark_create_deep_agent.py::TestCreateDeepAgentBenchmark
CodSpeed Performance Gauge
+1%
53.3 ms
*
53 ms
Uses the
Wall Time instrument
to collect wall time performance metrics.
test_with_tools
libs/deepagents/tests/benchmarks/test_benchmark_create_deep_agent.py::TestCreateDeepAgentBenchmark
CodSpeed Performance Gauge
0%
54 ms
*
54 ms
Uses the
Wall Time instrument
to collect wall time performance metrics.
test_scaling_subagents[3_subagents]
libs/deepagents/tests/benchmarks/test_benchmark_create_deep_agent.py::TestCreateDeepAgentScaling
CodSpeed Performance Gauge
0%
132.5 ms
*
132.5 ms
Uses the
Wall Time instrument
to collect wall time performance metrics.
test_full_featured
libs/deepagents/tests/benchmarks/test_benchmark_create_deep_agent.py::TestCreateDeepAgentBenchmark
CodSpeed Performance Gauge
0%
191.1 ms
*
191.4 ms
Uses the
Wall Time instrument
to collect wall time performance metrics.
test_with_one_subagent
libs/deepagents/tests/benchmarks/test_benchmark_create_deep_agent.py::TestCreateDeepAgentBenchmark
CodSpeed Performance Gauge
0%
79.7 ms
*
79.9 ms
Uses the
Wall Time instrument
to collect wall time performance metrics.
test_scaling_subagents[5_subagents]
libs/deepagents/tests/benchmarks/test_benchmark_create_deep_agent.py::TestCreateDeepAgentScaling
CodSpeed Performance Gauge
0%
185 ms
*
185.7 ms
Uses the
Wall Time instrument
to collect wall time performance metrics.
test_scaling_subagents[10_subagents]
libs/deepagents/tests/benchmarks/test_benchmark_create_deep_agent.py::TestCreateDeepAgentScaling
CodSpeed Performance Gauge
0%
316.3 ms
*
317.4 ms
Uses the
Wall Time instrument
to collect wall time performance metrics.
test_scaling_subagents[1_subagents]
libs/deepagents/tests/benchmarks/test_benchmark_create_deep_agent.py::TestCreateDeepAgentScaling
CodSpeed Performance Gauge
0%
79.6 ms
*
79.9 ms
Uses the
Wall Time instrument
to collect wall time performance metrics.
test_with_string_model_resolution
libs/deepagents/tests/benchmarks/test_benchmark_create_deep_agent.py::TestCreateDeepAgentBenchmark
CodSpeed Performance Gauge
0%
53.5 ms
*
53.7 ms
Uses the
Wall Time instrument
to collect wall time performance metrics.
test_scaling_tools[20_tools]
libs/deepagents/tests/benchmarks/test_benchmark_create_deep_agent.py::TestCreateDeepAgentScaling
CodSpeed Performance Gauge
0%
56.7 ms
*
57 ms
Uses the
Wall Time instrument
to collect wall time performance metrics.
test_scaling_tools[10_tools]
libs/deepagents/tests/benchmarks/test_benchmark_create_deep_agent.py::TestCreateDeepAgentScaling
CodSpeed Performance Gauge
0%
55.1 ms
*
55.3 ms
Uses the
Wall Time instrument
to collect wall time performance metrics.
test_with_multiple_subagents
libs/deepagents/tests/benchmarks/test_benchmark_create_deep_agent.py::TestCreateDeepAgentBenchmark
CodSpeed Performance Gauge
0%
184.8 ms
*
185.6 ms
Uses the
Wall Time instrument
to collect wall time performance metrics.
test_scaling_tools[5_tools]
libs/deepagents/tests/benchmarks/test_benchmark_create_deep_agent.py::TestCreateDeepAgentScaling
CodSpeed Performance Gauge
-1%
54.1 ms
*
54.4 ms
Uses the
Wall Time instrument
to collect wall time performance metrics.
test_scaling_tools[1_tools]
libs/deepagents/tests/benchmarks/test_benchmark_create_deep_agent.py::TestCreateDeepAgentScaling
CodSpeed Performance Gauge
-1%
53.4 ms
*
53.7 ms
Uses the
Wall Time instrument
to collect wall time performance metrics.
test_filesystem_init
libs/deepagents/tests/benchmarks/test_benchmark_create_deep_agent.py::TestCreateDeepAgentBenchmark
CodSpeed Performance Gauge
-8%
371.1 µs
*
405.3 µs
The benchmarks below were skipped, so their baseline results are used instead. If they were deleted in your codebase, archive them to remove them from the performance reports.
Learn more about archiving benchmarks
Uses the
Memory instrument
to collect Memory usage metrics.
test_repl_memory_peak[console_log-8_threads]
libs/partners/quickjs/tests/benchmarks/test_quickjs_memory.py::TestQuickJSMemoryBenchmarks
Skipped
6.7 MB
*
Uses the
Memory instrument
to collect Memory usage metrics.
test_repl_memory_peak[console_log-32_threads]
libs/partners/quickjs/tests/benchmarks/test_quickjs_memory.py::TestQuickJSMemoryBenchmarks
Skipped
15.3 MB
*
Uses the
Memory instrument
to collect Memory usage metrics.
test_repl_memory_peak[ptc_tools-8_threads]
libs/partners/quickjs/tests/benchmarks/test_quickjs_memory.py::TestQuickJSMemoryBenchmarks
Skipped
5.2 MB
*
Uses the
Memory instrument
to collect Memory usage metrics.
test_repl_memory_peak[console_log-1_threads]
libs/partners/quickjs/tests/benchmarks/test_quickjs_memory.py::TestQuickJSMemoryBenchmarks
Skipped
1 MB
*
Uses the
Memory instrument
to collect Memory usage metrics.
test_repl_memory_peak[ptc_tools-32_threads]
libs/partners/quickjs/tests/benchmarks/test_quickjs_memory.py::TestQuickJSMemoryBenchmarks
Skipped
19.4 MB
*
Uses the
Memory instrument
to collect Memory usage metrics.
test_repl_memory_peak[ptc_tools-1_threads]
libs/partners/quickjs/tests/benchmarks/test_quickjs_memory.py::TestQuickJSMemoryBenchmarks
Skipped
1 MB
*
Uses the
Memory instrument
to collect Memory usage metrics.
test_repl_memory_peak[console_log-64_threads]
libs/partners/quickjs/tests/benchmarks/test_quickjs_memory.py::TestQuickJSMemoryBenchmarks
Skipped
16.4 MB
*
Uses the
Memory instrument
to collect Memory usage metrics.
test_repl_memory_peak[ptc_tools-64_threads]
libs/partners/quickjs/tests/benchmarks/test_quickjs_memory.py::TestQuickJSMemoryBenchmarks
Skipped
38.7 MB
*
Uses the
Wall Time instrument
to collect wall time performance metrics.
test_multi_turn_snapshot_throughput[snapshot_disabled-50_turns]
libs/partners/quickjs/tests/benchmarks/test_quickjs_throughput.py::TestQuickJSThroughputBenchmarks
Skipped
6.1 s
*
Uses the
Wall Time instrument
to collect wall time performance metrics.
test_multi_turn_snapshot_throughput[snapshot_enabled-10_turns]
libs/partners/quickjs/tests/benchmarks/test_quickjs_throughput.py::TestQuickJSThroughputBenchmarks
Skipped
1.2 s
*
1
2
3
4
Commits
Click on a commit to change the comparison range
Base
main
f741293
+0.03%
add evals
e18c8ef
2 days ago
by imnishitha
-10.18%
chore(evals): regenerate EVAL_CATALOG.md for delete_file evals
a97bb94
2 days ago
by imnishitha
© 2026 CodSpeed Technology
Home
Terms
Privacy
Docs