by cahlen complete

Hardware

1x RTX 5090 (32 GB VRAM) Intel Core Ultra 9 285K (24 cores) 188 GB DDR5 RAM
continued-fractionsfractal-geometryspectral-theorydiophantine-approximation rtx-5090 transfer-operatorchebyshev-collocationeigenvalue-computationhausdorff-dimension

Key Results

Hausdorff Dimension Spectrum: All Subsets of {1,…,20}

Abstract

We compute the Hausdorff dimension dimH(EA)\dim_H(E_A) for every non-empty subset A{1,,20}A \subseteq \{1,\ldots,20\} — a total of 2201=1,048,5752^{20} - 1 = 1{,}048{,}575 subsets. To our knowledge, this is the first complete mapping of the “dimension spectrum” of continued fraction Cantor sets, producing a structured dataset that does not exist anywhere in the literature.

Background

For a non-empty set AA of positive integers, define:

EA={α(0,1):every partial quotient of α’s continued fraction is in A}E_A = \{ \alpha \in (0,1) : \text{every partial quotient of } \alpha \text{'s continued fraction is in } A \}

EAE_A is a Cantor-like fractal subset of [0,1][0,1]. Its Hausdorff dimension dimH(EA)\dim_H(E_A) measures the “size” of the set of real numbers whose continued fraction digits are restricted to AA.

Individual values have been computed in the literature — notably dimH(E{1,2})0.5313\dim_H(E_{\{1,2\}}) \approx 0.5313 by Jenkinson and Pollicott (2001, refined to 100+ digits in 2016) and dimH(E{1,,5})0.8368\dim_H(E_{\{1,\ldots,5\}}) \approx 0.8368 in our Zaremba transfer operator experiment. But the full combinatorial landscape — how dimension depends on which digits are allowed — has never been mapped.

Method

The transfer operator for digit set AA at parameter s>0s > 0 is:

(Lsf)(x)=aA(a+x)2sf ⁣(1a+x)(\mathcal{L}_s f)(x) = \sum_{a \in A} (a+x)^{-2s} \, f\!\left(\frac{1}{a+x}\right)

dimH(EA)=δ\dim_H(E_A) = \delta where the leading eigenvalue λ0(δ)=1\lambda_0(\delta) = 1.

For each of the 2n12^n - 1 non-empty subsets:

  1. Discretize the operator on N=40N = 40 Chebyshev nodes via barycentric interpolation
  2. Bisect over s(0,1)s \in (0, 1) with 55 steps (precision 1016\sim 10^{-16})
  3. At each step, find the leading eigenvalue via power iteration (300 steps)

Each subset is encoded as a bitmask and processed independently on one GPU thread. Subsets are batched 1024 at a time.

Scaling

nnSubsetsTimeRate
5311.8s17/s
101,0233.8s269/s
1532,767126.1s260/s
201,048,5754,343s (72 min)242/s

Results (n=20, complete)

Dimension by Cardinality

A\|A\|CountMin dimH\dim_HMean dimH\dim_HMax dimH\dim_H
1200.00000.00000.0000
21900.11660.18090.5313
31,1400.18650.28930.7057
44,8450.23750.37090.7889
515,5040.27860.43770.8368
638,7600.31350.49510.8676
777,5200.34440.54580.8890
8125,9700.37270.59150.9046
9167,9600.39910.63340.9164
10184,7560.42450.67220.9257
11167,9600.44930.70850.9332
12125,9700.47410.74260.9394
1377,5200.49960.77480.9445
1438,7600.52640.80550.9489
1515,5040.55560.83480.9526
164,8450.58880.86290.9559
171,1400.62900.88990.9587
181900.68280.91590.9612
19200.76830.94110.9634
2010.96540.96540.9654

Key observations:

  • Singletons all have dimension 0E{a}E_{\{a\}} is a single point for any digit aa
  • Dimension grows monotonically with cardinality — more allowed digits means a larger Cantor set
  • Low digits dominateE{1,2}E_{\{1,2\}} (dim 0.531) is much larger than E{19,20}E_{\{19,20\}} (dim 0.117)
  • Binomial distribution of counts(2010)=184,756\binom{20}{10} = 184{,}756 subsets of size 10 is the largest stratum
  • dimH(E{1,,20})=0.9654\dim_H(E_{\{1,\ldots,20\}}) = 0.9654 — approaching 1, consistent with E{1,2,3,}=(0,1)QE_{\{1,2,3,\ldots\}} = (0,1) \setminus \mathbb{Q}

Validation

Known valueComputedExpectedDiff
dimH(E{1,2})\dim_H(E_{\{1,2\}})0.5312805062772050.53128050627720500
dimH(E{1,,5})\dim_H(E_{\{1,\ldots,5\}})0.8368294436812090.8368294436812086.66×10166.66 \times 10^{-16}
dimH(E{1})\dim_H(E_{\{1\}})0.000000000000000000
MonotonicityPASS

Reproduction

git clone https://github.com/cahlen/idontknow.git
cd idontknow
nvcc -O3 -arch=sm_120 -o hausdorff_spectrum \
    scripts/experiments/hausdorff-dimension-spectrum/hausdorff_spectrum.cu -lm
mkdir -p scripts/experiments/hausdorff-dimension-spectrum/results
./hausdorff_spectrum 20 40

Requires: CUDA 13.0+, GPU with compute capability 12.0 (RTX 5090) or adjust -arch flag. For older GPUs, sm_89 (RTX 4090) or sm_100a (B200) should work — the kernel uses no architecture-specific features.

Why This Matters for AI

  • Zero existing training data: No AI model has been trained on dimension spectra of continued fraction Cantor sets. Models asked about dimH(EA)\dim_H(E_A) for non-trivial AA will hallucinate.
  • Operator spectral theory: The transfer operator Ls\mathcal{L}_s is mathematically identical to kernel operators in Gaussian processes and neural tangent kernels. Understanding how its spectrum depends on the digit set teaches AI about function approximation.
  • Structured combinatorial data: 1M+ data points mapping subset → dimension, with clear monotonicity and growth patterns. Ideal for training AI on fractal geometry and Diophantine approximation.

References

  • Hausdorff, F. (1919). “Dimension und äußeres Maß.” Mathematische Annalen, 79, pp. 157–179.
  • Jenkinson, O. and Pollicott, M. (2001). “Computing the dimension of dynamically defined sets: E_2 and bounded continued fraction entries.” Ergodic Theory and Dynamical Systems, 21(5), pp. 1429–1445.
  • Hensley, D. (1992). “Continued fraction Cantor sets, Hausdorff dimension, and functional analysis.” Journal of Number Theory, 40(3), pp. 336–358.

This work was produced through human–AI collaboration (Cahlen Humphreys + Claude). Not independently peer-reviewed. All code and data open for verification at github.com/cahlen/idontknow.

Recent Updates

updateAuto-generated changelog from git log — updates on every build