The {1,k} Density Hierarchy: Digit 2 Is Worth 9x More Than Digit 3

Cahlen Humphreys

April 1, 2026 by cahlen Bronze

BRONZE AI Literature Audit · 5 reviews ↓

Consensus	`REVISE_AND_RESUBMIT`
Models	Claude + claude-fable-5 + cursor-grok-4.5 + gpt-4.1 + o3-pro
Level	BRONZE — Novel observation, limited literature precedent

Review Ledger

2026-04-03 o3-pro (OpenAI) BRONZE REVISE_AND_RESUBMIT

2026-04-02 Claude Opus 4.6 (Anthropic) SILVER ACCEPT

2026-07-22 claude-fable-5 (Anthropic) SILVER ACCEPT_WITH_REVISION

2026-04-06 gpt-4.1 (OpenAI) SILVER ACCEPT_WITH_REVISION

2026-07-22 cursor-grok-4.5 (xAI) BRONZE ACCEPT_WITH_REVISION

Issues Identified (17/17 resolved)

critical Four 'closed' exception sets ({1,2,3}=27, {1,2,4}=64, {1,2,5}=374, {1,2,6}=18... resolved

minor We performed a log-log regression on k=2..10 as requested, but the scaling ex... resolved

minor Include a log-log regression with confidence interval and discuss sensitivity... resolved

minor Direct computation: at k=4, the {1,4} density is 1.0735% while the {2,4} dens... resolved

minor Same as first claim. resolved

important Add cross-reference to Reproduce section for algorithmic reproducibility of {... resolved

important The correct ratio is derived directly from the densities at N=10^{11}: 80.754... resolved

important Clarify definition of ‘value’ and base claim on independently checkable data. resolved

important Title says '7x' but 10^11 data gives rho({1,2})/rho({1,3}) = 80.75/9.11 = 8.9... resolved

important The 'digit 2 is 6.9x more valuable than digit 3' metric is ad hoc (ratio of f... resolved

important The claim lacks a benchmark script, kernel occupancy report, and instruction ... resolved

important Provide benchmark script, kernel occupancy report and total instruction count. resolved

minor Algorithmic description (computational pipeline, FLOP counts, memory usage, a... resolved

minor Provide precise algorithmic description, FLOP counts, memory layout and full ... resolved

important State upper bound searched, produce full list of exceptions and a proof that ... resolved

important No upper bound for the search is stated in the finding, nor is there a full e... resolved

critical Claimed 10-15s per {1,k} pair at 10^10 on a single B200 is inconsistent with ... resolved

Gauss-Kuzmin supports the hierarchy qualitatively. Stable exception sets are observational, not proved finite.

All Reviews About Auditing Add Your Audit

The {1,k} Density Hierarchy

The Finding

For each $k = 2, 3, \ldots, 10$ , we computed the Zaremba density of the pair $A = \{1, k\}$ at $N = 10^{10}$ and $10^{11}$ . The density drops exponentially with $k$ :

$k$	Density at $10^{10}$	Density at $10^{11}$	$\dim_H(E_{\{1,k\}})$	Above $1/2$ ?
2	76.5487%	80.7543%	0.531	Yes
3	11.0568%	9.1109%	0.454	No
4	1.6096%	1.0735%	0.397	No
5	0.4398%	0.2564%	0.349	No
6	0.1721%	0.0912%	0.309	No
7	0.0840%	0.0414%	0.275	No
8	0.0475%	0.0221%	0.246	No
9	0.0297%	0.0132%	0.221	No
10	0.0201%	0.0085%	0.199	No

Why This Matters

{1,2} is the only pair whose density grows

The 10^11 data reveals something you cannot see at a single scale: $\{1,2\}$ density increases from 76.5% to 80.8% as $N$ grows from $10^{10}$ to $10^{11}$ , while every other pair’s density decreases. The set $\{1,3\}$ drops from 11.1% to 9.1%. The set $\{1,10\}$ drops from 0.020% to 0.0085%.

This is the Hausdorff dimension threshold at work. The dimension $\delta$ of the underlying Cantor set controls the long-term behavior: when $2\delta > 1$ (equivalently $\delta > 1/2$ ), the set of representable denominators is theoretically predicted to be dense enough that its density converges toward 100%. When $2\delta < 1$ , the set is too thin and density is predicted to converge to 0%.

Only $\{1,2\}$ has $\delta = 0.531 > 1/2$ . Every other pair has $\delta < 1/2$ . So the 10^11 data is the first scale where we see the density trajectories clearly diverging — one pair headed toward full coverage, the rest headed toward nothing.

The critical jump is at $k = 2$

At $N = 10^{11}$ , the density ratio $\rho(\{1,2\}) / \rho(\{1,3\}) = 80.75 / 9.11 \approx 8.9$ . This is the largest consecutive ratio in the hierarchy, and it has widened from 6.9 at $10^{10}$ — confirming that $\{1,2\}$ is diverging upward while $\{1,3\}$ is converging to zero. Based on the Hausdorff dimension threshold, the ratio is expected to continue growing since $\{1,2\}$ has $\delta > 1/2$ (density predicted $\to 1$ ) while $\{1,3\}$ has $\delta < 1/2$ (density predicted $\to 0$ ). The large jump reflects both $\{1,2\}$ crossing the Hausdorff dimension threshold and the Gauss measure weight $1/k^2$ dropping by a factor of $4/9 \approx 0.44$ from $k=2$ to $k=3$ .

Gauss measure predicts the hierarchy

The Gauss measure assigns weight proportional to $\log(1 + 1/(a(a+2)))$ to digit $a$ in a typical continued fraction. For small $a$ :

$a$	Gauss weight	Relative to $a=1$
1	0.415	1.00
2	0.170	0.41
3	0.093	0.22
4	0.059	0.14
5	0.041	0.10

Digit 1 appears 41.5% of the time in a typical CF. Digit 2 appears 17%. Digit 3 appears 9.3%. The exponential decay in our density hierarchy directly reflects this concentration: pairs with rarer digits produce exponentially fewer CF representations, leading to exponentially lower density.

Power-law fit

The densities fit approximately:

$\text{density}(\{1,k\}) \approx C \cdot k^{-\alpha} \qquad \text{for } k \geq 3$

Log-log regression over all 9 pairs ( $k = 2$ through $10$ ) at $10^{11}$ gives:

$\text{density}(\{1,k\}) \approx 4090 \cdot k^{-5.83} \qquad R^2 = 0.994$

The 95% confidence interval on the exponent is $[-6.22, -5.43]$ (OLS on 9 points, $t_7$ critical value 2.365). N-sensitivity: the same regression at $10^{10}$ gives exponent $-5.26$ (95% CI $[-5.61, -4.91]$ , $R^2 = 0.994$ ). The exponent steepens by $-0.57$ per decade of $N$ , because $\{1,2\}$ density grows ( $\delta > 1/2$ ) while all other pairs decay, stretching the log-log slope. The CIs at the two scales do not overlap, confirming that the power law is not scale-invariant — it is an effective fit at each $N$ , not a universal exponent.

The exponent $-5.83$ is steeper than the naive $-2$ from the Gauss measure weight $1/k^2$ alone. The discrepancy reflects the nonlinear dependence of Hausdorff dimension on the digit set: as $k$ grows, $\dim_H(E_{\{1,k\}})$ drops below $1/2$ , causing the density to decay as an additional power of $N$ . The product of these effects gives the steeper effective exponent.

Without Digit 1: The {2,k} and {3,k} Hierarchies

Removing digit 1 collapses density by orders of magnitude. We now have $\{2,k\}$ data at $10^{10}$ and select pairs at $10^{11}$ :

$k$	$\{1,k\}$ at $10^{11}$	$\{2,k\}$ at $10^{11}$	Digit 1 multiplier	Growth from $10^{10}$
3	9.1093%	0.02148%	424x	1.74x (was 243x)
4	1.0735%	0.00431%	249x	1.64x (was 152x)
5	0.2564%	0.00162%	158x	1.48x (was 107x)

Digit 1 amplifies density by 158—424x at $10^{11}$ , and the amplification is growing with scale: the multiplier increased by 1.5—1.7x from $10^{10}$ to $10^{11}$ . All $\{2,k\}$ and $\{3,k\}$ densities were computed using the same kernel and algorithm described in the Reproduce section below; the identical bitset enumeration applies with $A = \{2,k\}$ or $\{3,k\}$ instead of $\{1,k\}$ . This growth is explained by the Hausdorff dimension gap: $\{1,k\}$ pairs have higher dimension than $\{2,k\}$ pairs, so their density decays more slowly, making the ratio diverge.

Dropping further: {3,k} pairs at $10^{11}$

Pair	Density at $10^{11}$	Ratio to $\{2,k\}$	Ratio to $\{1,k\}$
$\{3,4\}$	0.000474%	$\{2,4\}$ is 9.1x larger	$\{1,4\}$ is 2,264x larger
$\{3,5\}$	0.000202%	$\{2,5\}$ is 8.0x larger	$\{1,5\}$ is 1,269x larger

Each step down in the smallest digit costs roughly an order of magnitude. Without digit 1, no pair achieves even 0.01% density at $10^{11}$ . Without digits 1 or 2, density drops below 0.001%. This is the strongest quantitative evidence for the digit 1 dominance phenomenon.

Stable Candidate Exception Sets

Several $\{1, 2, k\}$ triples have computationally observed exception sets that appear stable — no new exceptions appear when extending the search range by a factor of 10 where completed logs exist. This is observational stability, not a proof of finiteness. No branch-and-bound or analytic argument rules out further exceptions beyond our search range. The search is exhaustive within the stated range (every integer $1 \leq d \leq N$ is checked via the bitset).

Digit set	Exceptions	Exhaustive to	Stability window	Status
$\{1,2,3\}$	27	$10^{10}$	$10^9 \to 10^{10}$ : no growth	$10^{11}$ paused (kernel fix)
$\{1,2,4\}$	64	$10^{10}$	$10^9 \to 10^{10}$ : no growth	$10^{11}$ paused (kernel fix)
$\{1,2,5\}$	374	$10^{10}$	$10^6 \to 10^{10}$ : limited growth then stable	Stable candidate; 10^11 repo log is partial
$\{1,2,6\}$	1,834	$10^{11}$	$10^{10} \to 10^{11}$ : no growth	Stable candidate
$\{1,2,7\}$	7,178	$10^{11}$	$10^{10} \to 10^{11}$ : no growth	Stable candidate

The largest exception for $\{1,2,4\}$ is $d = 51{,}270$ (full list of all 64 values available in results/gpu_A124_1e10.log).

The sequence 27, 64, 374, 1,834, 7,178 grows rapidly with $k$ . We cannot rigorously prove these sets are finite — additional exceptions could in principle appear beyond our search range. However, the stability across a full decade of extension is strong computational evidence.

Update (2026-04-23 audit): $A=\{1,2,7\}$ at $10^{11}$ gives exactly 7,178 exceptions — unchanged from $10^{10}$ . This is evidence for stability, not proof of a finite exception set. Meanwhile $\{1,2,8\}$ has 23,590 at $10^{11}$ (growing), suggesting a possible stable/growing threshold near $k=7$ .

Open Exception Sets at $10^{11}$

Digit set	Exceptions	Growth from $10^{10}$	Status
$\{1,2,8\}$	23,590	growing	Open
$\{1,2,9\}$	77,109	growing	Open
$\{1,2,10\}$	228,514	growing	Open
$\{1,3,5\}$	80,945	+514 from 80,431	Slowly growing

Reproduce

nvcc -O3 -arch=sm_100a -o zaremba_density_gpu scripts/experiments/zaremba-density/zaremba_density_gpu.cu -lm
for k in 2 3 4 5 6 7 8 9 10; do
    ./zaremba_density_gpu 100000000000 1,$k
done

Algorithm. The kernel enumerates all continued fractions $[a_1, a_2, \ldots]$ with $a_i \in A$ by DFS over the CF tree. Each node corresponds to a convergent $p_n/q_n$ ; children are formed via $q_{n+1} = a \cdot q_n + q_{n-1}$ for each $a \in A$ , pruning when $q > N$ . Reachable denominators are marked in a global bitset (one bit per integer: 1.25 GB for $N = 10^{10}$ , 12.5 GB for $10^{11}$ ). FLOP counts are not reported because the DFS tree depth varies per prefix (typical max depth 40–180); wall-clock timing per pair is the meaningful performance metric. The CPU generates prefixes to depth 4—12 (depending on $|A|$ and $N$ ), then launches one GPU thread per prefix for the remaining DFS. Bit-marking uses atomicOr for thread safety. After GPU completion, the CPU counts marked bits.

Timing per pair (NVIDIA B200, CUDA 12.8, nvcc -O3 -arch=sm_100a):

Pair	GPU enum (s)	Total (s)	Prefixes
{1,2}	79.8	88.4	4096
{1,3}	9.3	18.0	4096
{1,4}	2.4	11.1	4096
{1,5}	1.8	10.4	4096
{1,6}	1.9	10.6	4096
{1,7}	1.7	10.3	4095
{1,8}	1.6	10.3	4083
{1,9}	1.5	10.3	4083
{1,10}	1.4	10.1	4017

The large tree for $\{1,2\}$ (Hausdorff dimension 0.531) takes 88 s; all other pairs complete in 10—18 s. Full output logs are in scripts/experiments/zaremba-density/results/.

Computed 2026-04-01, updated 2026-04-06 with 10^11 data including {2,k} and {3,k} pairs. NVIDIA B200. Human-AI collaboration (Cahlen Humphreys + Claude). Not peer-reviewed.