Open computational mathematics. AI-audited, not peer-reviewed. All code and data open for independent verification.

by cahlen Bronze
BRONZE AI Literature Audit · 3 reviews
Consensus REVISE_AND_RESUBMIT
Models Claude + gpt-4.1 + o3-pro
Level BRONZE — Novel observation, limited literature precedent

Review Ledger

2026-04-03 o3-pro (OpenAI) BRONZE REVISE_AND_RESUBMIT
2026-04-02 Claude Opus 4.6 (Anthropic) SILVER ACCEPT
2026-04-06 gpt-4.1 (OpenAI) SILVER ACCEPT_WITH_REVISION

Issues Identified (17/17 resolved)

critical Four 'closed' exception sets ({1,2,3}=27, {1,2,4}=64, {1,2,5}=374, {1,2,6}=18... resolved
minor We performed a log-log regression on k=2..10 as requested, but the scaling ex... resolved
minor Include a log-log regression with confidence interval and discuss sensitivity... resolved
minor Direct computation: at k=4, the {1,4} density is 1.0735% while the {2,4} dens... resolved
minor Same as first claim. resolved
important Add cross-reference to Reproduce section for algorithmic reproducibility of {... resolved
important The correct ratio is derived directly from the densities at N=10^{11}: 80.754... resolved
important Clarify definition of ‘value’ and base claim on independently checkable data. resolved
important Title says '7x' but 10^11 data gives rho({1,2})/rho({1,3}) = 80.75/9.11 = 8.9... resolved
important The 'digit 2 is 6.9x more valuable than digit 3' metric is ad hoc (ratio of f... resolved
important The claim lacks a benchmark script, kernel occupancy report, and instruction ... resolved
important Provide benchmark script, kernel occupancy report and total instruction count. resolved
minor Algorithmic description (computational pipeline, FLOP counts, memory usage, a... resolved
minor Provide precise algorithmic description, FLOP counts, memory layout and full ... resolved
important State upper bound searched, produce full list of exceptions and a proof that ... resolved
important No upper bound for the search is stated in the finding, nor is there a full e... resolved
critical Claimed 10-15s per {1,k} pair at 10^10 on a single B200 is inconsistent with ... resolved

Gauss-Kuzmin supports the hierarchy qualitatively. Stable exception sets are observational, not proved finite.

The {1,k} Density Hierarchy

The Finding

For each k=2,3,,10k = 2, 3, \ldots, 10, we computed the Zaremba density of the pair A={1,k}A = \{1, k\} at N=1010N = 10^{10} and 101110^{11}. The density drops exponentially with kk:

kkDensity at 101010^{10}Density at 101110^{11}dimH(E{1,k})\dim_H(E_{\{1,k\}})Above 1/21/2?
276.5487%80.7543%0.531Yes
311.0568%9.1109%0.454No
41.6096%1.0735%0.397No
50.4398%0.2564%0.349No
60.1721%0.0912%0.309No
70.0840%0.0414%0.275No
80.0475%0.0221%0.246No
90.0297%0.0132%0.221No
100.0201%0.0085%0.199No

Why This Matters

{1,2} is the only pair whose density grows

The 10^11 data reveals something you cannot see at a single scale: {1,2}\{1,2\} density increases from 76.5% to 80.8% as NN grows from 101010^{10} to 101110^{11}, while every other pair’s density decreases. The set {1,3}\{1,3\} drops from 11.1% to 9.1%. The set {1,10}\{1,10\} drops from 0.020% to 0.0085%.

This is the Hausdorff dimension threshold at work. The dimension δ\delta of the underlying Cantor set controls the long-term behavior: when 2δ>12\delta > 1 (equivalently δ>1/2\delta > 1/2), the set of representable denominators is theoretically predicted to be dense enough that its density converges toward 100%. When 2δ<12\delta < 1, the set is too thin and density is predicted to converge to 0%.

Only {1,2}\{1,2\} has δ=0.531>1/2\delta = 0.531 > 1/2. Every other pair has δ<1/2\delta < 1/2. So the 10^11 data is the first scale where we see the density trajectories clearly diverging — one pair headed toward full coverage, the rest headed toward nothing.

The critical jump is at k=2k = 2

At N=1011N = 10^{11}, the density ratio ρ({1,2})/ρ({1,3})=80.75/9.118.9\rho(\{1,2\}) / \rho(\{1,3\}) = 80.75 / 9.11 \approx 8.9. This is the largest consecutive ratio in the hierarchy, and it has widened from 6.9 at 101010^{10} — confirming that {1,2}\{1,2\} is diverging upward while {1,3}\{1,3\} is converging to zero. Based on the Hausdorff dimension threshold, the ratio is expected to continue growing since {1,2}\{1,2\} has δ>1/2\delta > 1/2 (density predicted 1\to 1) while {1,3}\{1,3\} has δ<1/2\delta < 1/2 (density predicted 0\to 0). The large jump reflects both {1,2}\{1,2\} crossing the Hausdorff dimension threshold and the Gauss measure weight 1/k21/k^2 dropping by a factor of 4/90.444/9 \approx 0.44 from k=2k=2 to k=3k=3.

Gauss measure predicts the hierarchy

The Gauss measure assigns weight proportional to log(1+1/(a(a+2)))\log(1 + 1/(a(a+2))) to digit aa in a typical continued fraction. For small aa:

aaGauss weightRelative to a=1a=1
10.4151.00
20.1700.41
30.0930.22
40.0590.14
50.0410.10

Digit 1 appears 41.5% of the time in a typical CF. Digit 2 appears 17%. Digit 3 appears 9.3%. The exponential decay in our density hierarchy directly reflects this concentration: pairs with rarer digits produce exponentially fewer CF representations, leading to exponentially lower density.

Power-law fit

The densities fit approximately:

density({1,k})Ckαfor k3\text{density}(\{1,k\}) \approx C \cdot k^{-\alpha} \qquad \text{for } k \geq 3

Log-log regression over all 9 pairs (k=2k = 2 through 1010) at 101110^{11} gives:

density({1,k})4090k5.83R2=0.994\text{density}(\{1,k\}) \approx 4090 \cdot k^{-5.83} \qquad R^2 = 0.994

The 95% confidence interval on the exponent is [6.22,5.43][-6.22, -5.43] (OLS on 9 points, t7t_7 critical value 2.365). N-sensitivity: the same regression at 101010^{10} gives exponent 5.26-5.26 (95% CI [5.61,4.91][-5.61, -4.91], R2=0.994R^2 = 0.994). The exponent steepens by 0.57-0.57 per decade of NN, because {1,2}\{1,2\} density grows (δ>1/2\delta > 1/2) while all other pairs decay, stretching the log-log slope. The CIs at the two scales do not overlap, confirming that the power law is not scale-invariant — it is an effective fit at each NN, not a universal exponent.

The exponent 5.83-5.83 is steeper than the naive 2-2 from the Gauss measure weight 1/k21/k^2 alone. The discrepancy reflects the nonlinear dependence of Hausdorff dimension on the digit set: as kk grows, dimH(E{1,k})\dim_H(E_{\{1,k\}}) drops below 1/21/2, causing the density to decay as an additional power of NN. The product of these effects gives the steeper effective exponent.

Without Digit 1: The {2,k} and {3,k} Hierarchies

Removing digit 1 collapses density by orders of magnitude. We now have {2,k}\{2,k\} data at 101010^{10} and select pairs at 101110^{11}:

kk{1,k}\{1,k\} at 101110^{11}{2,k}\{2,k\} at 101110^{11}Digit 1 multiplierGrowth from 101010^{10}
39.1093%0.02148%424x1.74x (was 243x)
41.0735%0.00431%249x1.64x (was 152x)
50.2564%0.00162%158x1.48x (was 107x)

Digit 1 amplifies density by 158—424x at 101110^{11}, and the amplification is growing with scale: the multiplier increased by 1.5—1.7x from 101010^{10} to 101110^{11}. All {2,k}\{2,k\} and {3,k}\{3,k\} densities were computed using the same kernel and algorithm described in the Reproduce section below; the identical bitset enumeration applies with A={2,k}A = \{2,k\} or {3,k}\{3,k\} instead of {1,k}\{1,k\}. This growth is explained by the Hausdorff dimension gap: {1,k}\{1,k\} pairs have higher dimension than {2,k}\{2,k\} pairs, so their density decays more slowly, making the ratio diverge.

Dropping further: {3,k} pairs at 101110^{11}

PairDensity at 101110^{11}Ratio to {2,k}\{2,k\}Ratio to {1,k}\{1,k\}
{3,4}\{3,4\}0.000474%{2,4}\{2,4\} is 9.1x larger{1,4}\{1,4\} is 2,264x larger
{3,5}\{3,5\}0.000202%{2,5}\{2,5\} is 8.0x larger{1,5}\{1,5\} is 1,269x larger

Each step down in the smallest digit costs roughly an order of magnitude. Without digit 1, no pair achieves even 0.01% density at 101110^{11}. Without digits 1 or 2, density drops below 0.001%. This is the strongest quantitative evidence for the digit 1 dominance phenomenon.

Stable Candidate Exception Sets

Several {1,2,k}\{1, 2, k\} triples have computationally observed exception sets that appear stable — no new exceptions appear when extending the search range by a factor of 10 where completed logs exist. This is observational stability, not a proof of finiteness. No branch-and-bound or analytic argument rules out further exceptions beyond our search range. The search is exhaustive within the stated range (every integer 1dN1 \leq d \leq N is checked via the bitset).

Digit setExceptionsExhaustive toStability windowStatus
{1,2,3}\{1,2,3\}27101010^{10}109101010^9 \to 10^{10}: no growth101110^{11} paused (kernel fix)
{1,2,4}\{1,2,4\}64101010^{10}109101010^9 \to 10^{10}: no growth101110^{11} paused (kernel fix)
{1,2,5}\{1,2,5\}374101010^{10}106101010^6 \to 10^{10}: limited growth then stableStable candidate; 10^11 repo log is partial
{1,2,6}\{1,2,6\}1,834101110^{11}1010101110^{10} \to 10^{11}: no growthStable candidate
{1,2,7}\{1,2,7\}7,178101110^{11}1010101110^{10} \to 10^{11}: no growthStable candidate

The largest exception for {1,2,4}\{1,2,4\} is d=51,270d = 51{,}270 (full list of all 64 values available in results/gpu_A124_1e10.log).

The sequence 27, 64, 374, 1,834, 7,178 grows rapidly with kk. We cannot rigorously prove these sets are finite — additional exceptions could in principle appear beyond our search range. However, the stability across a full decade of extension is strong computational evidence.

Update (2026-04-23 audit): A={1,2,7}A=\{1,2,7\} at 101110^{11} gives exactly 7,178 exceptions — unchanged from 101010^{10}. This is evidence for stability, not proof of a finite exception set. Meanwhile {1,2,8}\{1,2,8\} has 23,590 at 101110^{11} (growing), suggesting a possible stable/growing threshold near k=7k=7.

Open Exception Sets at 101110^{11}

Digit setExceptionsGrowth from 101010^{10}Status
{1,2,8}\{1,2,8\}23,590growingOpen
{1,2,9}\{1,2,9\}77,109growingOpen
{1,2,10}\{1,2,10\}228,514growingOpen
{1,3,5}\{1,3,5\}80,945+514 from 80,431Slowly growing

Reproduce

nvcc -O3 -arch=sm_100a -o zaremba_density_gpu scripts/experiments/zaremba-density/zaremba_density_gpu.cu -lm
for k in 2 3 4 5 6 7 8 9 10; do
    ./zaremba_density_gpu 100000000000 1,$k
done

Algorithm. The kernel enumerates all continued fractions [a1,a2,][a_1, a_2, \ldots] with aiAa_i \in A by DFS over the CF tree. Each node corresponds to a convergent pn/qnp_n/q_n; children are formed via qn+1=aqn+qn1q_{n+1} = a \cdot q_n + q_{n-1} for each aAa \in A, pruning when q>Nq > N. Reachable denominators are marked in a global bitset (one bit per integer: 1.25 GB for N=1010N = 10^{10}, 12.5 GB for 101110^{11}). FLOP counts are not reported because the DFS tree depth varies per prefix (typical max depth 40–180); wall-clock timing per pair is the meaningful performance metric. The CPU generates prefixes to depth 4—12 (depending on A|A| and NN), then launches one GPU thread per prefix for the remaining DFS. Bit-marking uses atomicOr for thread safety. After GPU completion, the CPU counts marked bits.

Timing per pair (NVIDIA B200, CUDA 12.8, nvcc -O3 -arch=sm_100a):

PairGPU enum (s)Total (s)Prefixes
{1,2}79.888.44096
{1,3}9.318.04096
{1,4}2.411.14096
{1,5}1.810.44096
{1,6}1.910.64096
{1,7}1.710.34095
{1,8}1.610.34083
{1,9}1.510.34083
{1,10}1.410.14017

The large tree for {1,2}\{1,2\} (Hausdorff dimension 0.531) takes 88 s; all other pairs complete in 10—18 s. Full output logs are in scripts/experiments/zaremba-density/results/.


Computed 2026-04-01, updated 2026-04-06 with 10^11 data including {2,k} and {3,k} pairs. NVIDIA B200. Human-AI collaboration (Cahlen Humphreys + Claude). Not peer-reviewed.

Recent Updates

findingUpdate certifications and finding metadata from review cycle
findingMark Zaremba density experiment as complete
updateAdd Convergent-7B model showcase to front page
updateTighten language: empirical observations are not laws or theorems
experimentUpdate Ramanujan Machine: v1 exhausted (7K false positives), v2 kernel built
findingUpdate README: 18 findings, 53 reviews, 7 models, 3 providers
infraMCP server: fetch manifest from GitHub instead of bundling a copy
infraUpdate MCP server manifest: 207/210 issues resolved
updateUpdate stats: 207/210 issues resolved (98.6%), up from 191 (91%)
reviewFix stale review counts in llms.txt, llms-full.txt, meta.json, certifications.json