Open computational mathematics. AI-audited, not peer-reviewed. All code and data open for independent verification.

by cahlen Bronze
BRONZE AI Literature Audit · 2 reviews
Consensus REVISE_AND_RESUBMIT
Models Claude + o3-pro
Level BRONZE — Novel observation, limited literature precedent

Review Ledger

2026-04-03 o3-pro (OpenAI) BRONZE REVISE_AND_RESUBMIT
2026-04-02 Claude Opus 4.6 (Anthropic) SILVER ACCEPT

Issues Identified (9/9 resolved)

critical Four 'closed' exception sets ({1,2,3}=27, {1,2,4}=64, {1,2,5}=374, {1,2,6}=18... resolved
minor Include a log-log regression with confidence interval and discuss sensitivity... resolved
minor Same as first claim. resolved
important Clarify definition of ‘value’ and base claim on independently checkable data. resolved
important The 'digit 2 is 6.9x more valuable than digit 3' metric is ad hoc (ratio of f... resolved
important Provide benchmark script, kernel occupancy report and total instruction count. resolved
minor Provide precise algorithmic description, FLOP counts, memory layout and full ... resolved
important State upper bound searched, produce full list of exceptions and a proof that ... resolved
critical Claimed 10-15s per {1,k} pair at 10^10 on a single B200 is inconsistent with ... resolved

Gauss-Kuzmin supports theory. 4 stable exception sets observed (27, 64, 374, 1834). {1,k} hierarchy clean data.

The {1,k} Density Hierarchy

The Finding

For each k=2,3,,10k = 2, 3, \ldots, 10, we computed the Zaremba density of the pair A={1,k}A = \{1, k\} at N=1010N = 10^{10} and 101110^{11}. The density drops exponentially with kk:

kkDensity at 101010^{10}Density at 101110^{11}dimH(E{1,k})\dim_H(E_{\{1,k\}})Above 1/21/2?
276.5487%80.7543%0.531Yes
311.0568%9.1109%0.454No
41.6096%1.0735%0.397No
50.4398%0.2564%0.349No
60.1721%0.0876%0.309No
70.0840%0.0387%0.275No
80.0475%0.0202%0.246No
90.0297%0.0117%0.221No
100.0201%0.0074%0.199No

Why This Matters

{1,2} is the only pair whose density grows

The 10^11 data reveals something you cannot see at a single scale: {1,2}\{1,2\} density increases from 76.5% to 80.8% as NN grows from 101010^{10} to 101110^{11}, while every other pair’s density decreases. The set {1,3}\{1,3\} drops from 11.1% to 9.1%. The set {1,10}\{1,10\} drops from 0.020% to 0.007%.

This is the Hausdorff dimension threshold at work. The dimension δ\delta of the underlying Cantor set controls the long-term behavior: when 2δ>12\delta > 1 (equivalently δ>1/2\delta > 1/2), the set of representable denominators is dense enough that its density converges toward 100%. When 2δ<12\delta < 1, the set is too thin and density converges to 0%.

Only {1,2}\{1,2\} has δ=0.531>1/2\delta = 0.531 > 1/2. Every other pair has δ<1/2\delta < 1/2. So the 10^11 data is the first scale where we see the density trajectories clearly diverging — one pair headed toward full coverage, the rest headed toward nothing.

The critical jump is at k=2k = 2

At N=1010N = 10^{10}, the density ratio ρ({1,2})/ρ({1,3})=76.55/11.066.9\rho(\{1,2\}) / \rho(\{1,3\}) = 76.55 / 11.06 \approx 6.9. This is the largest consecutive ratio in the hierarchy. It measures the density gap between two specific digit pairs at a fixed search range, not an intrinsic “value” of digit 2 vs. digit 3; at different NN the ratio may shift (though we expect it to stabilize as NN \to \infty because both sets have positive Hausdorff dimension). The large jump reflects both {1,2}\{1,2\} crossing the Hausdorff dimension threshold (δ>1/2\delta > 1/2) and the Gauss measure weight 1/k21/k^2 dropping by a factor of 4/90.444/9 \approx 0.44 from k=2k=2 to k=3k=3.

Gauss measure predicts the hierarchy

The Gauss measure assigns weight proportional to log(1+1/(a(a+2)))\log(1 + 1/(a(a+2))) to digit aa in a typical continued fraction. For small aa:

aaGauss weightRelative to a=1a=1
10.4151.00
20.1700.41
30.0930.22
40.0590.14
50.0410.10

Digit 1 appears 41.5% of the time in a typical CF. Digit 2 appears 17%. Digit 3 appears 9.3%. The exponential decay in our density hierarchy directly reflects this concentration: pairs with rarer digits produce exponentially fewer CF representations, leading to exponentially lower density.

Power-law fit

The densities fit approximately:

density({1,k})Ckαfor k3\text{density}(\{1,k\}) \approx C \cdot k^{-\alpha} \qquad \text{for } k \geq 3

with α\alpha in the range 3—4 (a formal log-log regression with confidence intervals has not been performed; the exponent estimate is approximate given the small number of data points). The rough magnitude is consistent with twice the Gauss measure exponent 2-2 (from 1/k21/k^2), which is expected since density depends on the product of the two digits’ contributions.

Without Digit 1: The {2,k} Hierarchy

For comparison, we computed all {2,k}\{2, k\} pairs at 101010^{10}:

kk{1,k}\{1,k\} density{2,k}\{2,k\} densityDigit 1 multiplier
311.06%0.0455%243x
41.61%0.0106%152x
50.44%0.0041%107x
60.172%0.0023%75x
70.084%0.0013%65x
80.047%0.0009%55x
90.030%0.0006%47x
100.020%0.0005%42x

Digit 1 amplifies density by 42—243x over the equivalent pair with digit 2 (ratios computed from the same GPU kernel at N=1010N = 10^{10}; see results/gpu_A1k_1e10.log and results/gpu_A2k_1e10.log for raw counts). The amplification is strongest for small kk (where digit 1’s presence lifts the Hausdorff dimension above the critical threshold) and weakest for large kk (where both sets have such low dimension that density is near zero regardless).

Without digit 1, no pair achieves even 0.1% density. This is the strongest quantitative evidence for the digit 1 dominance phenomenon.

Closed Exception Sets

Four {1,2,k}\{1, 2, k\} triples have computationally observed exception sets that appear stable — no new exceptions appear when extending the search range by a factor of 10. This is observational stability, not a proof of finiteness. No branch-and-bound or analytic argument rules out further exceptions beyond our search range. The search is exhaustive within the stated range (every integer 1dN1 \leq d \leq N is checked via the bitset).

Digit setExceptionsExhaustive toStability windowStatus
{1,2,3}\{1,2,3\}27101010^{10}109101010^9 \to 10^{10}: no growth101110^{11} in progress
{1,2,4}\{1,2,4\}64101010^{10}109101010^9 \to 10^{10}: no growth101110^{11} in progress
{1,2,5}\{1,2,5\}374101110^{11}1010101110^{10} \to 10^{11}: no growthClosed
{1,2,6}\{1,2,6\}1,834101110^{11}1010101110^{10} \to 10^{11}: no growthClosed
{1,2,7}\{1,2,7\}7,178101110^{11}1010101110^{10} \to 10^{11}: no growthNEW — Closed

The largest exception for {1,2,4}\{1,2,4\} is d=51,270d = 51{,}270 (full list of all 64 values available in results/gpu_A124_1e10.log).

The sequence 27, 64, 374, 1,834, 7,178 grows rapidly with kk. We cannot rigorously prove these sets are finite — additional exceptions could in principle appear beyond our search range. However, the stability across a full decade of extension is strong computational evidence.

Update (2026-04-05): A={1,2,7}A=\{1,2,7\} at 101110^{11} confirms exactly 7,178 exceptions — unchanged from 101010^{10}, making this the fifth closed exception set. Meanwhile {1,2,8}\{1,2,8\} has 23,590 at 101110^{11} (growing), suggesting a sharp closed/open threshold at k=7k=7.

Open Exception Sets at 101110^{11}

Digit setExceptionsGrowth from 101010^{10}Status
{1,2,8}\{1,2,8\}23,590growingOpen
{1,2,9}\{1,2,9\}77,109growingOpen
{1,2,10}\{1,2,10\}228,514growingOpen
{1,3,5}\{1,3,5\}80,945+514 from 80,431Slowly growing

Reproduce

nvcc -O3 -arch=sm_100a -o zaremba_density_gpu scripts/experiments/zaremba-density/zaremba_density_gpu.cu -lm
for k in 2 3 4 5 6 7 8 9 10; do
    ./zaremba_density_gpu 10000000000 1,$k
done

Algorithm. The kernel enumerates all continued fractions [a1,a2,][a_1, a_2, \ldots] with aiAa_i \in A by DFS over the CF tree. Each node corresponds to a convergent pn/qnp_n/q_n; children are formed via qn+1=aqn+qn1q_{n+1} = a \cdot q_n + q_{n-1} for each aAa \in A, pruning when q>Nq > N. Reachable denominators are marked in a global bitset (1.25 GB for N=1010N = 10^{10}, one bit per integer). The CPU generates prefixes to depth 4—12 (depending on A|A| and NN), then launches one GPU thread per prefix for the remaining DFS. Bit-marking uses atomicOr for thread safety. After GPU completion, the CPU counts marked bits.

Timing per pair (NVIDIA B200, CUDA 12.8, nvcc -O3 -arch=sm_100a):

PairGPU enum (s)Total (s)Prefixes
{1,2}79.888.44096
{1,3}9.318.04096
{1,4}2.411.14096
{1,5}1.810.44096
{1,6}1.910.64096
{1,7}1.710.34095
{1,8}1.610.34083
{1,9}1.510.34083
{1,10}1.410.14017

The large tree for {1,2}\{1,2\} (Hausdorff dimension 0.531) takes 88 s; all other pairs complete in 10—18 s. Full output logs are in scripts/experiments/zaremba-density/results/.


Computed 2026-04-01 on NVIDIA B200. Human-AI collaboration (Cahlen Humphreys + Claude). Not peer-reviewed.

Recent Updates

updateDigit pair hierarchy: add 10^11 column for all {1,k} pairs
findingZaremba findings: fifth closed set {1,2,7}, open sets at 10^11, {1,2,6} confirmed
updateUpdate recent updates changelog from git log
findingZaremba density: add 10^11 results + fifth closed exception set {1,2,7}=7178
findingFix findings nav: centered flex, mobile-first, aligned with cards
findingFix findings nav: aligned grid instead of messy flex buttons
reviewTone audit: fix overclaiming across About, Interactive, Verification
updateTone down 'For Students' section: less grandiose, more genuine
updateAbout: add 'For Students and Researchers' section
updateFix overclaiming language in all experiment pages