Open computational mathematics. AI-audited, not peer-reviewed. All code and data open for independent verification.

by cahlen Bronze
BRONZE AI Literature Audit · 2 reviews
Consensus ACCEPT_WITH_REVISION
Models Claude + o3-pro
Level BRONZE — Novel observation, limited literature precedent

Review Ledger

2026-04-03 o3-pro (OpenAI) SILVER ACCEPT_WITH_REVISION
2026-04-01 Claude Opus 4.6 (Anthropic) BRONZE ACCEPT_WITH_REVISION

Issues Identified (11/14 resolved)

minor Publish the full sweep results (or compressed checksum) so others can reprodu... resolved
minor The data does not include the full sweep results or checksums, so reproductio... acknowledged
important The provided data does not include log–log regression results or error bars f... acknowledged
important Perform a log–log regression with confidence intervals and quantify goodness-... resolved
minor A full list or digest of uncovered denominators and timing logs is currently ... acknowledged
minor Provide the list (or hash digest) of uncovered denominators and timing logs s... resolved
minor The five logarithmic fit points are all empirical (N=10^6, 10^9, 10^10, 10^11... resolved
minor Clarify which points are empirical and which are forecasts; re-label the five... resolved
important The phrase '2.35× faster' is not rigorously supported in the absence of a mat... resolved
important Remove the numerical '2.35× faster' comparison or supply a rigorous bridge fr... resolved
important The 2.35x claim compares apples and oranges. Revise to acknowledge the distin... resolved
minor Five decades of extrapolation from 3 data points. The word 'predicted' should... resolved
minor Clarify that BK proves density 1 non-effectively, and that the exponent 2δ-1 ... resolved
minor IMPORTANT: The finding states the BK framework predicts power-law convergence... resolved

Peer-reviewed by Claude Opus 4.6. ACCEPT WITH REVISION: R(d) growth vs density convergence distinguished, logarithmic claim weakened to 'consistent with', 10^15 prediction marked speculative.

A={1,2} Density Grows Logarithmically, Not Power-Law

The Finding

For the digit set A={1,2}A = \{1,2\} (Hausdorff dimension δ=0.531\delta = 0.531, barely above the critical threshold 1/21/2), the Zaremba density as a function of range NN fits a logarithmic model almost exactly:

density(N)30.1+4.65log10(N)\text{density}(N) \approx 30.1 + 4.65 \cdot \log_{10}(N)

Range NNObserved densityPredicted (log model)Residual
10610^657.98%57.96%-0.02%
10910^972.06%71.97%+0.09%
101010^{10}76.55%76.62%-0.07%
101110^{11}80.75%80.64%+0.11%
101210^{12}84.58%85.10%-0.52%

With five data points spanning 6 decades (all five are empirical measurements at 10610^6, 10910^9, 101010^{10}, 101110^{11}, and 101210^{12}), the logarithmic fit is:

density(N)31.5+4.47log10(N)\text{density}(N) \approx 31.5 + 4.47 \cdot \log_{10}(N)

Regression statistics (OLS on 5 data points, x=log10(N)x = \log_{10}(N), y=density %y = \text{density \%}): a=31.5a = 31.5, b=4.47b = 4.47, R2=0.998R^2 = 0.998, max residual =0.52%= 0.52\% at N=1012N = 10^{12}. The fit explains 99.8% of variance across 6 decades. All raw density logs and data are published at cahlen/zaremba-density on Hugging Face.

Predictions

RangePredicted density
101310^{13}89.6%
101410^{14}94.0%
101510^{15}98.5%
100% at1015.310^{15.3}

The logarithmic model has held across 5 data points. Full density at 1015\sim 10^{15} remains the prediction.

Why This Matters

Relationship to BK Framework

The Bourgain-Kontorovich transfer operator framework predicts the representation count grows as R(d)d2δ1R(d) \sim d^{2\delta - 1}. For A={1,2}A = \{1,2\}:

2δ1=2(0.531)1=0.0622\delta - 1 = 2(0.531) - 1 = 0.062

Important distinction: the exponent 0.062 describes the growth of R(d)R(d) (how many CF representations each dd has), not the rate at which density (the fraction of dd with R(d)1R(d) \geq 1) converges to 100%. Density convergence depends on the full distribution of R(d)R(d) across integers, not just its mean growth. These are related but different quantities.

Our density data fits a logarithmic model (R² = 0.9984 across 5 points), but this cannot definitively rule out other functional forms (e.g., power-law with logarithmic corrections). The 5-point fit predicted 85.1% at 101210^{12}; the measured value is 84.58%, a residual of −0.53% — the largest deviation so far, possibly signaling the onset of sub-logarithmic curvature.

What this could mean

  1. Pre-asymptotic regime: At N1010N \leq 10^{10}, the system hasn’t yet reached the true asymptotic behavior. The logarithmic fit may break down at N>1012N > 10^{12} and transition to the slower power-law predicted by BK.

  2. Corrections to the leading term: The BK counting formula has error terms. If the error term is O(N2δ1ε)O(N^{2\delta - 1 - \varepsilon}) with ε\varepsilon small, the effective growth rate could appear faster than the leading exponent suggests.

  3. Logarithmic corrections: Some number-theoretic counting functions have log(N)\log(N) corrections. If R(d)d2δ1(logd)cR(d) \sim d^{2\delta-1} \cdot (\log d)^c for some c>0c > 0, this could produce the observed logarithmic density growth.

Tested prediction

The model made a sharp prediction for 101210^{12}. The run is now complete: the five-point log model predicts 85.10%85.10\%, while the observed value is 84.58%84.58\% (residual 0.52%-0.52\%). This is not a refutation, but it is the largest residual so far and should be treated as evidence that curvature may be appearing.

The next useful tests are 101310^{13} and beyond, but the current bitset implementation requires 1.25 TB for 101310^{13} and the committed 101310^{13}/101410^{14} logs currently show out-of-memory failures, not completed runs.

The Digit 1 Advantage: A Sigmoid

Our complete density sweep of all 1,023 subsets of {1,,10}\{1, \ldots, 10\} at N=106N = 10^6 reveals that digit 1’s advantage follows a sigmoid that peaks at cardinality 4. Full results: density_all_subsets_n10_1e6.csv (SHA-256: 4b052ecb952b..., 1,023 rows).

CardinalityAvg density (with 1)Avg density (without 1)Gap
211.0%0.1%10.9 pp
358.9%1.8%57.1 pp
492.1%12.7%79.4 pp
599.5%39.4%60.0 pp
6100.0%70.9%29.1 pp
7100.0%91.8%8.2 pp
8100.0%99.2%0.8 pp

At cardinality 4, digit 1 is worth 79 percentage points of density. By cardinality 8, the advantage shrinks to under 1 point — enough other digits compensate.

Exception Scaling: {1,2,k}

For the family A={1,2,k}A = \{1, 2, k\} at N=106N = 10^6, the number of uncovered integers grows rapidly with kk. A log–log OLS regression gives exponent β^=6.42\hat{\beta} = 6.42 (95% CI: [5.80, 7.04], R2=0.986R^2 = 0.986), i.e., exceptions 0.02k6.4\sim 0.02 \cdot k^{6.4}. The fit is good but the consecutive ratios fluctuate (1.5–5.8), indicating the power law is approximate:

kkExceptionsRatio to k1k-1
327
4642.4
53735.8
61,7204.6
75,3883.1
811,7462.2
921,7961.9
1033,0251.5

Adding larger third digits helps rapidly less. The “sweet spot” is k=3k = 3 (27 exceptions) — adding digit 4 gives 64 exceptions (2.4×), but adding digit 10 gives 33,025 (1,223×).

Reproduce

git clone https://github.com/cahlen/idontknow
cd idontknow

# GPU computation
nvcc -O3 -arch=sm_100a -o zaremba_density_gpu scripts/experiments/zaremba-density/zaremba_density_gpu.cu -lm
./zaremba_density_gpu 10000000000 1,2     # A={1,2} at 10^10
./zaremba_density_gpu 1000000 1,2,3       # A={1,2,3} at 10^6

Verification Hashes and Timing

All raw GPU output logs are committed to the repository. SHA-256 digests and wall-clock times:

NNCoveredDensityGPU timeLog file SHA-256
10610^6579,82057.982%< 1 s (CPU)14c69b3c0885...
10910^9720,615,32772.062%28.0 secc0c96d5817...
101010^{10}7,654,868,19176.549%88.4 s68a9512d8147...
101110^{11}80,754,334,63880.754%1,012 sbd5e57d5ef20...
101210^{12}845,791,333,63384.579%12,375 s5115d64d8c6b...

Full hashes: sha256sum scripts/experiments/zaremba-density/results/gpu_A12_*.log scripts/experiments/zaremba-density/results/density_A12_*.log

References

  1. Bourgain, J. and Kontorovich, A. (2014). “On Zaremba’s conjecture.” Annals of Mathematics, 180(1), pp. 137–196.
  2. Hensley, D. (1996). “A polynomial time algorithm for the Hausdorff dimension of continued fraction Cantor sets.” J. Number Theory, 58(1), pp. 9–45.
  3. Jenkinson, O. and Pollicott, M. (2001). “Computing the dimension of dynamically defined sets.” Ergodic Theory Dynam. Systems, 21(5), pp. 1429–1445.

Computed 2026-04-01 on 8× NVIDIA B200. This work was produced through human–AI collaboration (Cahlen Humphreys + Claude). Not independently peer-reviewed. All code and data open for verification.

Caveat: Five data points make a fit, not a proof. The logarithmic model has now been tested through 101210^{12}, and the largest residual occurs at the newest point. The prediction of 100% near 101510^{15} is speculative until more data points are collected.

Recent Updates

findingUpdate certifications and finding metadata from review cycle
findingMark Zaremba density experiment as complete
updateAdd Convergent-7B model showcase to front page
updateTighten language: empirical observations are not laws or theorems
experimentUpdate Ramanujan Machine: v1 exhausted (7K false positives), v2 kernel built
findingUpdate README: 18 findings, 53 reviews, 7 models, 3 providers
infraMCP server: fetch manifest from GitHub instead of bundling a copy
infraUpdate MCP server manifest: 207/210 issues resolved
updateUpdate stats: 207/210 issues resolved (98.6%), up from 191 (91%)
reviewFix stale review counts in llms.txt, llms-full.txt, meta.json, certifications.json