double_pendulum greenvalidity pass

Tier 2 · wall_time · tier2 default grid · OS linux

Validity gate

passpass — perf claims allowed when ratio is green

Source: latest.csv:perf_present

Wall-clock row exists only — no explicit correctness signal yet.

Ratio vs best competitor
0.947 (94.7% of rust speed) (best in series: rust — Li is never labeled best)
Best competitor value
0.3084 s

Numeric oracle

No analytical verify rows in ingest yet. Run tier-1 with --verify so latest.csv exports verify_ulps / verify_within_1ulp.

Problem size
tier2 default grid
Category
physics
Pillar
physics
Package
lic
Li / catalog oracle (mean ± σ)
0.3257 s / 0.3089 s (cpp)
Ratio vs catalog oracle
1.0544×
Best competitor
rust (0.3084 s)
Li relative speed vs SOTA
0.947 (1.0 = rust speed)
Validity
validity pass
Threshold
1.2×
Source
lic/benchmarks/tier2_physics/double_pendulum

PH ids: PH-5b

Performance vs best competitor

Compare oracle: cpp

Relative speed vs best competitor (rust) — SOTA = 1.0, higher is better. Absolute s values are in the table below.

Absolute measurements

Language comparison
LangMean ± σRunsUnitVariantOS
li0.3257sreleaselinux
cpp0.3089sreleaselinux
rust0.3084sreleaselinux
julia0.3141sreleaselinux
Metric: wall_time (value = mean of timed runs)

Host OS

Measurement hosts by language
ScopeOS
Row aggregatelinux
li (release)linux
cpp (release)linux
rust (release)linux
julia (release)linux

Latest history deltas

  • ratio_vs_cpp: 0.9997 → 1.0544 (Δ 0.0547) · regressed

← Overview