Zaremba’s Conjecture (A=5): Proof Framework
Statement
Zaremba’s Conjecture (1972). For every integer , there exists with such that has all .
Status
- Certified computation: for all from the hardened v6.1 kernel with an explicit no-overflow certificate.
- Large-scale evidence: the original v6 B200 run reports for all , but this is strong computational evidence, not a certified result, because the kernel could silently clip frontiers.
- Conditional analytic framework: the Magee-Oh-Winter congruence-counting route suggests a threshold , but this depends on unresolved operator truncation, theorem-matching, and constant-propagation gaps.
- Rigor level: finite-matrix arithmetic is partially certified with arb/MPFR, but the transport from finite Galerkin matrices to the infinite-dimensional transfer operators is not yet proved. Paper is a proof framework, not a complete proof.
Full paper: PDF · LaTeX source · Verification manifest
Proof Architecture
The framework combines three ingredients (see paper PDF for full details):
1. Brute-Force Verification ()
GPU matrix enumeration (v6 multi-pass kernel) reports every integer from 1 to 210 billion as marked. Zero uncovered. Runtime: 6,962.2 seconds (116 minutes) on 8× NVIDIA B200 (Blackwell, 183 GB each, CUDA 13.0, driver 580.126.09). Chunk configuration: 256 rounds × 8 GPUs, 119,210 seeds per chunk.
Exact invocation: ./matrix_v6 210000000000 (compiled with nvcc -O3 -arch=sm_100a scripts/experiments/zaremba-effective-bound/matrix_enum_multipass.cu -lpthread). Input: single argument . Output: log reports zero uncovered integers. SHA256 checksums of the source and log are recorded in the verification manifest. The original run did not produce a per-level no-overflow certificate.
Certification status. The original v6 kernel increments out_count for every expansion but only writes the matrix if pos < max_out, then clips the next frontier to min(h_out, BUF_SLOTS) rather than aborting on overflow. The manifest records Uncovered: 0, but the kernel did not emit a machine-checkable no-overflow certificate.
Empirical update (2026-04-22). Local v6.1 probe runs on a single RTX 5090 at the exact 210B chunk size (119,210 seeds per chunk) measure:
max_d = 10⁸: true unclipped Phase B peak frontier (no overflow; 95.5% of the B200BUF_SLOTS)max_d = 10⁹: (17.5 trillion) overflow events at localBUF_SLOTS, confirming the true per-level frontier repeatedly exceeded — but the exact true peak cannot be recovered from a clipped probe
Because per-chunk peak is monotonically non-decreasing in max_d under fixed chunk size, the true peak at is at least and very likely larger. Whether the excess is small (keeping it under B200’s ) or large (past ) cannot be determined by our local probes — that requires a B200 v6.1 re-run.
What this does not say. Clipping does not directly invalidate Uncovered = 0. The bitset atomicOr mark fires before the pos < max_out check, so a clipped matrix still marks its denominator in the bitset; what is lost is its descendants. Because the 244 million Phase A seeds produce massively redundant CF coverage, denominators whose CF paths were clipped are almost always marked by other, unclipped paths. It is entirely consistent that Uncovered = 0 is correct even with significant clipping. The problem is that the v6 kernel does not prove this — it emits no machine-checkable certificate. This is a software-audit gap, not a mathematical one.
Path to certification. A hardened replacement, matrix_enum_multipass_v6_1.cu, adds a hard overflow abort and a final no-overflow certificate block. Re-running the 210B configuration with v6.1 on 8× B200 (or equivalent ≥ 1.5 TB aggregate GPU memory) and checking that the certificate reports All peaks < BUF_SLOTS: YES and No-overflow abort fired: NO upgrades the claim to certified status. If that run aborts with No-overflow abort fired: YES, the correct response is to increase num_rounds (e.g. to 512 or 1024) to reduce per-chunk seeds until the chunk size is safe, and record the new configuration as canonical. See paper/CERTIFICATE.md for the full probe log trail and audit procedure.
2. Spectral Gap Computation (11 primes, FP64)
For each prime , the spectral gap of the congruence transfer operator is computed at FP64 precision ( Chebyshev collocation, cuBLAS power iteration). All gaps . Note: 256-bit MPFR arithmetic controls round-off, but a rigorous bound on the truncation error between the discretization and the infinite-dimensional operator (e.g., via the Keller-Liverani perturbation framework) has not been established. The “certification” applies to the finite Galerkin matrix, not rigorously to the full operator.
| Error bound | ||
|---|---|---|
| 2 | 0.845 | 0.183 |
| 3 | 0.745 | 0.342 |
| 5 | 0.956 | 0.046 |
| 7 | 0.978 | 0.022 |
| 11 | 0.886 | 0.129 |
| 13 | 0.530 | 0.887 |
| 17 | 0.912 | 0.097 |
| 19 | 0.957 | 0.045 |
| 23 | 0.861 | 0.161 |
| 29 | 0.616 | 0.623 |
| 31 | 0.780 | 0.282 |
3. Covering Argument (Frolenkov-Kan Sieve)
Sieve Bound. For coprime to prime with spectral gap , the Frolenkov-Kan sieve gives:
where (from the Lalley renewal theorem, Appendix A of the paper) and . For and any covering prime with : the main term exceeds the error . So for all coprime to .
Layered Covering. The covering proceeds in layers. For any integer , exactly one of the following holds:
-
Layer 1: is coprime to some prime . The sieve at gives . This covers all (since such cannot be divisible by all 11 primes), PLUS all larger that happen to miss at least one of these primes.
-
Layer 2: is divisible by every prime but coprime to some prime (all verified at FP64 with ). The sieve at gives . This covers all that weren’t covered by Layer 1.
-
Layer 3: is divisible by every prime but coprime to some prime (489 primes verified at FP64). The sieve at gives . This covers all not covered by previous layers.
-
Layer 4 (non-constructive tail): is divisible by every prime . Then . By Bourgain-Gamburd (2008), property () holds: there exists with for all primes . Since has finitely many prime factors, there exists a prime with . The F-K sieve at this gives for sufficiently large (depending on the non-effective constant ).
Critical note on Layer 4: This layer uses the non-constructive Bourgain-Gamburd property (), making it non-effective. However, no integer smaller than can reach this layer, and the brute-force verification to provides a massive additional safety margin for Layers 1-3.
Note: The layered covering above is the supplementary BK/sieve perspective (Appendix B of the paper). The main proof of Theorem 2 uses the MOW framework instead, which avoids the non-constructive Layer 4 entirely — see “The Magee-Oh-Winter Framework” section below.
Mathematical Setup
The Semigroup
For each digit , define . The continued fraction has numerator and denominator given by the matrix product . The representation count is . Zaremba’s Conjecture is equivalent to for all .
The Transfer Operator
The Ruelle transfer operator at parameter :
The Hausdorff dimension is the unique where . The leading eigenfunction (Patterson-Sullivan density) satisfies with .
Computed Constants
| Quantity | Value | Method |
|---|---|---|
| Hausdorff dimension | Chebyshev N=40, bisection | |
| Eigenfunction | Power iteration, 1000 steps | |
| Pressure derivative | Hellmann-Feynman formula | |
| Renewal constant | Lalley renewal theorem | |
| Untwisted spectral gap | Deflated power iteration | |
| Dolgopyat bound | Numerical (arb ball arithmetic on N=80 Galerkin matrix, not a computer-assisted Dolgopyat proof; no a-posteriori bound transporting to the full operator) | |
| Power savings | $-\log(\rho_\eta)/ |
Transitivity (Algebraic Proof)
Theorem. The semigroup acts transitively on for every prime .
Proof. Let . By Dickson’s classification:
- Not Borel: has -entry for all primes.
- Not Cartan normalizer: and have characteristic polynomials and . If they shared an eigenvector, subtracting gives , but . Contradiction.
- Not exceptional for : .
- Small primes : verified by exhaustive BFS.
Therefore , and every integer is admissible (no local obstructions).
The Magee-Oh-Winter Framework (Theorem 2)
The key upgrade from previous approaches: the Magee-Oh-Winter uniform congruence counting theorem (Crelle 2019) gives a pointwise power-saving error for the continued fractions semigroup, avoiding the circle-method minor-arc barrier entirely.
MOW Theorem: For the continued fractions semigroup :
with and independent of . This uses Dolgopyat-type transfer operator estimates, not the circle method.
From norm balls to denominators (Tauberian):
For : need , giving threshold .
The Calderón-Magee explicit spectral gap (JEMS 2025) applies to Schottky subgroups with (our qualifies), making computable in principle.
Result: , , . Margin: .
Important caveats (DISPUTED — not yet rigorous):
- The original MOW paper does not supply all constants explicitly; our constant extraction is a plausible numerical estimate, not a theorem.
- The Calderón-Magee result gives an explicit gap only for untwisted operators; extending to twisted operators at all requires additional argument.
- No detailed constant-tracking appendix (Sage or arb notebook) reproducing every numeric inequality is available. Until such a notebook is produced and independently verified, is an estimate, not a rigorous bound.
- The claim that “all load-bearing spectral data” are arb-certified applies only to the finite Galerkin matrices, not to the full infinite-dimensional operators.
Representation Growth
From our GPU computation (5.3 seconds, one B200):
Theoretical prediction: . The slight undercount (0.654 vs 0.674) is expected from finite-depth effects. Minimum at . Zero exceptions in . Full dataset: 1M rows CSV.
Reproduction
git clone https://github.com/cahlen/idontknow
cd idontknow
# Step 1: Brute force (requires 8× NVIDIA B200 or similar for 210B).
# Use v6.1 to get a no-overflow certificate in the final output.
nvcc -O3 -arch=sm_100a -o matrix_v6_1 \
scripts/experiments/zaremba-effective-bound/matrix_enum_multipass_v6_1.cu -lpthread
./matrix_v6_1 210000000000
# Step 2: Spectral gaps (requires cuBLAS)
nvcc -O3 -arch=sm_100a -o extract_ef \
scripts/experiments/zaremba-effective-bound/extract_eigenfunction.cu -lcublas -lm
./extract_ef # outputs h(0) and gaps for primes ≤ 97
New Computations (2026-03-29)
MPFR-Certified Spectral Gaps
All 11 covering primes certified at 256-bit MPFR precision (77 decimal digits) with guaranteed rounding. All gaps . This upgrades FP64 measurements to rigorous bounds.
Dolgopyat Spectral Profile (Exact Eigendecomposition)
We computed the spectral radius of via exact eigendecomposition (LAPACK ZGEEV on the full 80×80 complex matrix) for 100,000 -values.
Critical finding: Power iteration is unreliable for the twisted transfer operator — at certain values, multiple eigenvalues of similar magnitude with different phases cause oscillation instead of convergence. Full eigendecomposition is required.
- (arb-certified on , MOW kernel decay for tail). Note: this bound applies to the Chebyshev discretization of the transfer operator, not the full infinite-dimensional operator. A rigorous a-posteriori truncation error bound (e.g., via Keller-Liverani framework) transporting this result to the true operator has not been established.
- At : (70 certified digits)
- For all : uniformly
Renewal Constant
from the Lalley renewal theorem, computed via the Hellmann-Feynman formula using the left eigenmeasure and right eigenfunction of .
D₀ Calculation
With the corrected Dolgopyat bound, the MOW constant extraction gives:
- Optimal contour shift:
- — a factor of 6× below the brute-force frontier of
- (conservative above Naud bound ), all constants arb/MPFR-certified except (mpmath, 10% margin)
Relation to Shkredov (2026)
Independently and two weeks prior, Ilya Shkredov (arXiv:2603.14116, March 14, 2026) proved that for sufficiently large primes , there exists coprime to with all partial quotients of bounded by . This is a major theoretical advance but does not resolve Zaremba’s Conjecture as originally stated:
| Shkredov (2026) | This work | |
|---|---|---|
| Bound on partial quotients | (growing) | (constant) |
| Denominators | Sufficiently large primes | All integers |
| Method | Analytic number theory | GPU computation + F-K sieve |
| Computational component | None | 8× NVIDIA B200, ~2 hours |
| Status | Partial (asymptotic) | Conditional framework (computational) |
The two results are independent and complementary. Shkredov’s purely analytic approach validates the spectral/semigroup framework from a theoretical direction. Our computation provides, to our knowledge, the largest brute-force verification and the most explicit spectral data computed for this problem.
References
- Zaremba, S.K. (1972). “La méthode des ‘bons treillis’ pour le calcul des intégrales multiples.” Applications of Number Theory to Numerical Analysis, pp. 39–119.
- Shkredov, I.D. (2026). “On some results of Korobov and Larcher and Zaremba’s conjecture.” arXiv:2603.14116.
- Frolenkov, D.A. and Kan, I.D. (2014). “A strengthening of a theorem of Bourgain-Kontorovich II.” Moscow Journal of Combinatorics and Number Theory, 4(1), pp. 24–117.
- Bourgain, J. and Kontorovich, A. (2014). “On Zaremba’s conjecture.” Annals of Mathematics, 180(1), pp. 137–196.
- Bourgain, J. and Gamburd, A. (2008). “Uniform expansion bounds for Cayley graphs of .” Annals of Mathematics, 167(2), pp. 625–642.
- Dickson, L.E. (1901). Linear Groups with an Exposition of the Galois Field Theory. B.G. Teubner, Leipzig.
- Huang, ShinnYih (2015). “An improvement to Zaremba’s conjecture.” Geometric and Functional Analysis, 25(3), pp. 860–914.
- Magee, M., Oh, H., and Winter, D. (2019). “Uniform congruence counting for Schottky semigroups in SL₂(Z).” J. reine angew. Math. (Crelle), 753, pp. 89–135.
- Calderón, I. and Magee, M. (2025). “Explicit spectral gap for Schottky subgroups of SL(2,Z).” J. Eur. Math. Soc.
- Lalley, S.P. (1989). “Renewal theorems in symbolic dynamics.” Acta Math., 163, pp. 1–55.
Computed 2026-03-29 on 8× NVIDIA B200 (1.43 TB VRAM) + RTX 5090. This work was produced through human–AI collaboration: the proof strategy, CUDA kernels, interval arithmetic, and documentation were developed jointly by Cahlen Humphreys and AI agents (Claude). The mathematical arguments have not been independently peer-reviewed. All code and data are open for verification at github.com/cahlen/idontknow. Published at bigcompute.science.