Zcash Block Time Reduction Appears Safe for NU7 w/ Zebra Only Devnet

evan-forbes · May 4, 2026, 4:41pm

Authors: Evan Forbes, Dev Ojha (@ValarDragon)

Valar Group

Summary

A key question when choosing PoW block times is what happens with stale block rates, and fork/re-org rates. Lower block times improve UX of users and market makers, giving them faster confirmation times for small size transactions. However, lower block times increase the stale block rate as block propagation delay takes a larger percentage of block time. We want to understand how stale block rates perform at very decentralized, zebra-only networks, at a block time of 25s.

We empirically measure block propagation delay, block times, stale-heights and re-org rates in experiments using 100 geographically distributed Zebra nodes. The nodes are split across 19 regions, including US, Europe, India, Australia, and Singapore. They are also split across cloud providers. Hashpower is evenly distributed across these nodes. With properly configured TCP connections, the experiment falls within the safe operating range measured here: a sub-5% reach-based expected stale-block rate, a sub-5% observed stale-height rate, and a sub-0.5% observed higher-cumulative-work branch-switch rate. This keeps us at roughly the same stale rate ETH POW had in mainnet, at their 12.5 second blocks. A mainnet comparison is consistent with the expectation that mainnet should have somewhat lower stale and fork rates than these deliberately decentralized devnets.

In tandem we build a theoretical model for how to estimate stale rate, and validate it with the experimental data.

Stale-block rates and fork/reorg rates can be modeled from the time it takes a new block to propagate through the network. In an experiment with 100 geographically distributed Zebra nodes, measured block propagation, block times, stale-height events, and higher-cumulative-work branch switches all remained in the expected safe operating range: a sub-5% expected stale-block rate, a sub-5% observed stale-height rate, and a sub-0.5% observed higher-cumulative-work branch-switch rate.

This leads us to conclude that NU7 is safe to decrease the target spacing to 25 seconds pending a large portion of the network adjusting their TCP configurations.

Theory

Note: This model is a baseline for honest propagation-induced stale blocks. It assumes miners publish blocks when found, mine the best tip they currently know, and that the target block rate is approximately stationary over the time window of interest.

Core Model

The core quantity is tau_eff, the network’s work-weighted old-tip mining time after a new block is found. “Target spacing” is the average block time as defined in Zebra":

expected stale blocks per accepted block = tau_eff / target spacing

or:

phi = tau_eff / T

where:

T is the target spacing
phi is the expected stale blocks per accepted block
tau_eff is measured in old-tip work-seconds

For discrete miners, if miner i has work share w_i and receives and verifies the block after d_i seconds:

tau_eff = sum_i(w_i * d_i)

A miner with 10% of expected block-producing work that keeps mining the old tip for 8s adds 0.8s to tau_eff. A miner with 30% of expected block-producing work that keeps mining the old tip for 2.5s adds 0.75s. Add those weighted times and divide by target spacing to get expected stale blocks per accepted block.

tau_eff_intuition2880×1392 178 KB

Equivalently, let u(t) be the fraction of total work still mining the old tip t seconds after a new block is found. Then:

tau_eff = integral_0^inf u(t) dt

So tau_eff compresses the full propagation curve into one effective old-tip mining time. The model is linear up to this point: if one miner keeps mining the old tip twice as long, that miner’s contribution to tau_eff and phi doubles.

What The Model Predicts

Under the standard Poisson approximation, if S is the number of stale competing blocks caused by one accepted block, then S ~ Poisson(phi). This gives the following quantities:

quantity	formula
expected stale blocks per accepted block	`E[S] = phi = tau_eff / T`
probability of at least one stale competing block	`P[S >= 1] = 1 - e^(-phi)`
probability of exactly one stale competing block	`P[S = 1] = e^(-phi) * phi`
probability of two or more stale competing blocks	`P[S >= 2] = 1 - e^(-phi) * (1 + phi)`

Throughout, stale rate means expected stale blocks per accepted block: E[S] = phi. The probability of one or more stale competing blocks is a different metric. The two are close when phi is small because 1 - e^(-phi) ~= phi, but P[S >= 1] is bounded by 1 while E[S] continues to grow linearly.

stale_rate_probability_model2304×1344 206 KB

Fork And Reorg Boundary

A stale block is a valid discovered block that does not end up on the eventual canonical chain. In the simplest propagation race:

H
|-- A1
|-- B1

A1 and B1 are competing siblings. Whichever branch later loses leaves the other block stale.

A fork is the temporary state where different parts of the network are mining different valid tips. A fork can produce stale blocks, but a stale block is an outcome, while a fork is the competing-branch state that exists before the network converges. In the experimental section below, we call the observable fraction of canonical heights with at least one competing non-canonical block the stale-height rate. That is the binary event P[S >= 1].

A reorg happens when a node switches from the branch it had accepted to a different branch with more cumulative work. With roughly constant difficulty, “more cumulative work” is approximately “more blocks.” For example:

H
|-- A1
|-- B1 -- B2

A node that had accepted A1 will switch to B2 once it learns the B1 -> B2 branch. That creates a one-block reorg for that node, because A1 is removed from its active chain. It is a two-block competing branch, but not a two-block reorg.

So modeling reorg depth requires more than phi = tau_eff / T. The stale-block model only counts sibling blocks found during the old-tip mining window. A fork or reorg model must track the branch race after the sibling exists: which miners know about each branch, which tip they are mining, pairwise propagation delays, tie-breaking, work shares, and cumulative-work selection. For honest propagation-induced forks, the natural next model is an event-driven network simulation. The phi model remains the baseline input for how often the first competing sibling appears; the fork/reorg model adds the branch race that follows.

Block-Time Expectations

For block times themselves, PoW block discovery is modeled as a Poisson process, so the waiting time B until the next block is exponential:

P[B <= t] = 1 - e^(-t / T)

With target spacing T, healthy PoW block times have:

E[B] = T
stddev(B) = T
median(B) = T * ln(2) ~= 0.693 * T

So a healthy chain targeting 25s blocks should have a median block time around 17.3s. A median below the target spacing is normal; it is a consequence of exponential waiting times, not evidence that blocks are arriving too quickly.

block_time_and_stale_expectations2880×1296 202 KB

Reach-Based Proxy

The preferred input is per-miner propagation data, because it estimates tau_eff directly:

[{time_seconds: d_i, work_share: w_i}, ...]

where each entry says that miners with work share w_i kept mining the old tip for d_i seconds. The companion model.py includes stale_rate_expectation_from_propagation_points(...) for that form. It normalizes work_share, so callers may pass fractions, percentages, or expected-work weights.

When per-miner propagation data is unavailable, we can use node-level block distribution measurements as a fallback proxy. Here reach_90 means the time it takes for a block to be distributed to 90% of measured network nodes. For example, reach_90 = 2.5s says that 90% of measured nodes had received the block by 2.5s. It does not say that 90% of block-producing work had received it, and it does not mean all work kept mining the old tip for 2.5s.

Let D(t) be the fraction of measured nodes that have received the block by time t. Then 1 - D(t) is the fraction of measured nodes that do not yet have the block. Under the proxy assumption that measured nodes are representative of where block-producing work receives and validates blocks:

tau_eff_proxy ~= integral_0^inf (1 - D(t)) dt

and therefore:

expected stale blocks per accepted block ~= tau_eff_proxy / T

This is the same area-under-the-curve idea as tau_eff = integral u(t) dt, but with node-weighted distribution data used as the fallback input.

reach_proxy_tau_eff1824×1296 202 KB

If only reach_90 = r_90 is available, tau_eff is not determined by that single number, so we need a shape assumption. The headline examples below assume the first 90% of measured nodes receive the block evenly between 0 and r_90. In that case D(t) rises linearly from 0 to 0.90, so 1 - D(t) falls linearly from 1.00 to 0.10. The area through r_90 is:

((1.00 + 0.10) / 2) * r_90 = 0.55 * r_90

If the remaining 10% receives the block almost immediately after r_90, then tau_eff_proxy ~= 0.55 * r_90. If those nodes remain without the block longer, add the tail area:

tau_eff_proxy ~= 0.55 * r_90 + tail_area

Different tail assumptions give different stale-rate estimates. That is why reach_90 should produce a range, not one definitive stale-rate number.

The headline examples use target spacing T = 25s, assume measured nodes are a reasonable proxy for block-producing work, assume linear distribution from 0% at 0s to 90% at reach_90, and assume no material tail beyond reach_90:

`reach_90`	*`tau_eff_proxy ~= 0.55 reach_90`**	expected stale rate
`2.5s`	`1.375s`	`5.50%`
`2.0s`	`1.100s`	`4.40%`
`1.0s`	`0.550s`	`2.20%`

Experimental Results

To compare a Zebra network against the above theory, we ran a 100-miner Zebra-only PoW network. Each node:

ran a single CPU miner
8 vCPU
16 GB RAM
1-3 Gbps connection
geographically distributed over 19 regions, including Australia, Singapore, India, Europe, and the US
modified TCP parameters per feature: add warnings and script to configure tcp for greatly improved full block propagation · Issue #10511 · ZcashFoundation/zebra · GitHub

The propagation sample uses blocks of at least 1MiB. The stale-height, stale-block, block-time, and branch-switch analysis uses all observed canonical heights in the stabilized window.

For this comparison, the analysis uses canonical heights 400 through 1468 after waiting for block times to stabilize, with target spacing T = 25s.

experiment_block_propagation1500×930 64.1 KB

The mean time to reach 90% of measured nodes was 1.48s, with median 1.44s. Using the simple reach proxy:

tau_eff_proxy ~= 0.55 * reach_90

the mean effective old-tip mining time is:

tau_eff_proxy ~= 0.55 * 1.48s ~= 0.82s

With T = 25s, that implies:

phi_proxy ~= 0.82 / 25 ~= 3.26%

and the corresponding probability of one or more stale competing blocks is:

1 - e^(-phi_proxy) ~= 3.21%

This gives two theory-side quantities to compare against the trace:

phi_proxy ~= 3.26%, the expected stale blocks per accepted block
1 - e^(-phi_proxy) ~= 3.21%, the expected fraction of heights with at least one competing stale block

The experiment saw 52 heights with at least one competing non-canonical block out of 1069 observed canonical heights:

52 / 1069 ~= 4.86%

There were 52 total non-canonical competing blocks, so the observed stale blocks per accepted block were:

52 / 1069 ~= 4.86%

So the reach-based theory proxy underestimates the observed stale-height event rate in this run, but both the proxy and the observed rate remain below 5%:

metric	theory/proxy	observed
block-time mean (seconds)	`25.0`	`26.2`
block-time median (seconds)	`17.3`	`18.0`
stale blocks per accepted block	`3.26%`	`4.86%`
higher-cumulative-work branch-switch events	`0.25%`	`0.37%`

In count terms, the proxy predicts about 34.3 stale-height events and about 34.9 stale blocks over 1069 heights. The trace observed 52 stale-height events and 52 stale blocks. The reach proxy is intentionally simple: it uses one percentile, assumes no material tail after 90% reach, and treats measured nodes as a proxy for block-producing work. The stronger conclusion is that the propagation-based estimate and observed stale behavior remain in the same sub-5% operating range.

For the higher-cumulative-work branch-switch row, split the estimate into the probability of an initial competing sibling and the conditional branch-race outcome:

P[cumulative-work branch switch] ~= P[B1 exists] * P[B2 wins | B1 exists]

The reach-based stale model supplies the first term:

P[B1 exists] ~= 1 - e^(-phi_proxy) ~= 3.21%

The trace supplies the branch-race term. Among the 52 stale-height events, 4 produced a switch to a higher-cumulative-work, higher-height branch, so:

P[B2 wins | B1 exists] ~= 4 / 52 ~= 7.69%

If we use stale blocks rather than stale heights as the denominator, the branch race term is also 4 / 52 ~= 7.69%, because every stale height in this trace had one competing stale block.

That gives:

P[cumulative-work branch switch] ~= 3.21% * 7.69% ~= 0.25%

Over 1069 canonical heights, the model therefore expects about:

1069 * 0.25% ~= 2.63 higher-cumulative-work branch-switch events

The experiment observed 4, or:

4 / 1069 ~= 0.37%

Using the observed stale-height rate for the first term instead of the reach-based proxy gives 4.86% * 7.69% ~= 0.37%, exactly four events over the window. That is a trace-conditioned check, not an independent prediction, but it shows that the observed higher-cumulative-work switch events are consistent with the branch-race model.

The difference between 3.26% and 4.86% is expected directionally: the reach_90 shortcut estimates the old-tip work area from one percentile, assumes no material tail after 90% reach, and uses only the subset of blocks with reach-90 propagation data. It also treats measured nodes as a proxy for block-producing work. The trace includes the full branch outcome over the observed height window.

The block-time data is also consistent with a Poisson process targeting 25s. The mean inter-block time was 26.2s, while the median was 18.0s, close to the expected healthy median of 25 * ln(2) ~= 17.3s.

experiment_block_times2100×750 120 KB

The direct stale-height view is the count of competing blocks at each height. Each stale height in this trace had one competing block, so the stale-block count (52) is equal to the number of heights with a stale block (52).

experiment_competing_blocks_per_height1575×720 18.4 KB

The stale-height-rate plot below separates the aggregate observed stale-height rate from the reach-based approximation. It does not use the rolling-window line from the earlier combined script. The orange line is the per-block approximation of P[S >= 1] from each block’s reach_90; the red line is the observed experiment-wide stale-height rate.

experiment_stale_height_rates1650×780 110 KB

The trace-level fork-switch data adds more detail. It contained 56 unique switch episodes: 52 equal-work, same-height switches, and 4 higher_work_higher_height episodes. These higher-cumulative-work episodes switched from heights 408 to 409, 936 to 937, 1266 to 1267, and 1335 to 1336. These are the events counted in the 0.37% observed higher-cumulative-work branch-switch rate above.

Mainnet Comparison

When comparing runs with similar block sizes and block times, mainnet appears to have lower stale and fork rates than the devnets. Comparable devnet runs show stale-block rates around 1%, while recent mainnet observations are closer to 0.1%.

environment	observed stale/fork rate
comparable devnet runs	`~1%`
mainnet	`~0.1%`

There are two likely reasons for the difference.

Network Topology: Mainnet’s high mining concentration (<20 pools) minimizes propagation latency compared to the intentionally fragmented devnet. This aligns with observed devnet trends where reduced miner counts lower the fork rate by decreasing total old-tip mining time.

Measurement Gap: Devnet has 100% observability, whereas Mainnet measurement is subject to selection bias. Because Zebra nodes only gossip the winning tip, stale blocks have a approx 50% chance of remaining invisible to any single observation point. Observed Mainnet fork rates are therefore a lower bound, despite being lower than devnet results.

This likely means that we can likely expect mainnet to have a lower fork rate than our “worst case” experiments above.

Conclusion

The main result is that the propagation model and the experiment agree at the scale that matters for safety. With the modified TCP configuration, the expected stale-block rate was 3.26%, the observed stale-height rate was 4.86%, and the observed higher-cumulative-work branch-switch rate was 0.37%. Mainnet shows a meaningfully lower stale block rate than expected, however some of this could be due to a more centralized network and the inability to measure every single stale block without having access to every miner.

Most importantly, even in the worst case with blocks full of orchard transactions and a highly geographically distributed network, we observe stale block rates lower than 5%, suggesting that it is safe to move forward with a block reduction pending network wide tcp configuration changes.

Reproducing the Plots

The experiment data referenced in this report can be downloaded from this Google Drive folder.

Run the commands below from the repository root:

python3 archive/plot_tau_eff_intuition.py
python3 archive/plot_stale_rate_explainers.py
python3 plot_experiment_results.py

The scripts require Python with matplotlib, numpy, and pandas available.

The archived explainer scripts regenerate:

tau_eff_intuition.png
stale_rate_probability_model.png
block_time_and_stale_expectations.png
reach_proxy_tau_eff.png

The experiment plotting script regenerates:

experiment_block_propagation.png
experiment_block_times.png
experiment_competing_blocks_per_height.png
experiment_stale_height_rates.png
experiment_stale_height_rate_proxy.csv
experiment_results_summary.json

plot_experiment_results.py defaults to experiment data at /home/evan/src/zcash/experiments/valar-1/data. To use a different checkout or exported data directory (such as the downloaded data linked above), set POW_MODELING_DATA_DIR:

POW_MODELING_DATA_DIR=/path/to/valar-1/data python3 plot_experiment_results.py

zerodartz · May 5, 2026, 5:57pm

i wonder how much it might affect the performance in reality where not all nodes have as fast connection speed(probably network speed is least of the bottleneck), some nodes might have less cpu cores and some less ram.

for comparsion zebrad official recommended specs:

1. 4 CPU cores
2. 16 GB RAM
3. 300 GB available disk space
4. 100 Mbps network connection

maybe its not a problem even with some nodes having worse specs, or in worst case they might have to upgrade their node hardware to stay up to speed with others.

evan-forbes · May 5, 2026, 11:51pm

the bandwidth here definitely helps, fortunately the network has some load balancing properties inherent in gossip. Meaning, a node cannot gossip a valid block until it downloads it, and if the node doesn’t respond in time the timeout gets hit.

this results in slower nodes being less likely to contribute to the network. There are lot of ways to improve this in the future as well.

ofc, eventually if zcash scales without some sort of meta succinctness proof or miracle software/cryptography then inherently to verify the chain, node resources will have to go up.

bitcoincashautist · May 22, 2026, 5:44pm

Hi, I’m working on a CHIP for BCH (“CHIP-2025-03 Faster Blocks for Bitcoin Cash”, can’t post links here) to reduce target block time from 600s to 60s, and someone pointed me to here so I thought to exchange some notes. We didn’t yet get to testing stage like you did, but I made a through analytical assessment in that CHIP, and our criterion is 2% stale rate in the worst case. There’s a section on why we chose 2% (impact on margin advantage of biggest pool vs smallest pool).

One question, did Zcash node happen to remove the cs_main propagation bottleneck (see “Async Block Relaying With Compact Block Relay (BIP-152)” on Bitcoin stackexchange)? We inherited this from Satoshi’s node, and it slows down propagation for validation-heavy blocks. It should be possible to just verify PoW and Merkle root and propagate it ASAP, did you already implement this optimization?

Back to orphan rate, in our CHIP we model orphan loss rate from the point of view of individual pools and this is what I get for Zcash with 25s target time, your 1.48s propagation, and current pool distribution:

Pool	Hash Rate	Hash Rate, %	Expected time for others to find a block	Average Propagation Time	Probability of Orphan Race Happening	Probability of Orphan Race Happening and Losing it
ViaBTC	4.99	43.93%	44.58	1.48	3.27%	1.83%
f2pool	2.19	19.28%	30.97	1.48	4.67%	3.77%
antpool	1.68	14.79%	29.34	1.48	4.92%	4.19%
2miners	0.99	8.72%	27.39	1.48	5.26%	4.80%
luxor	0.49	4.30%	26.12	1.48	5.51%	5.27%
poolin	0.35	3.12%	25.80	1.48	5.57%	5.40%
binance	0.30	2.62%	25.67	1.48	5.60%	5.45%
kryptex	0.16	1.41%	25.36	1.48	5.67%	5.59%
zhash	0.09	0.81%	25.20	1.48	5.70%	5.66%
mining-dutch	0.08	0.66%	25.17	1.48	5.71%	5.67%
others	0.04	0.37%	25.09	1.48	5.73%	5.71%

				Network orphan rate:	4.26%
				Top pool advantage:	3.88%

Last I checked (while ViaBTC stats still showed orphan rate on their stats page), ViaBTC had Zcash orphan rate at only 0.03%. With 75s target time, that implies propagation time of under 100ms, so this ~5% is some worst case and you can probably expect much lower. JPT Lovejoy (“An empirical analysis of chain reorganizations and double-spend attacks on proof-of-work cryptocurrencies”, MIT, 2020) showed 0.06% for Zcash (I had to calculate the % from paper’s tables).

For reference, Nervos (CKB) has 10s target time and only ~3.3% orphan rate.

Appendix: I had Claude read both this post and the CHIP section (“Mining Centralization Risk”), to summarize the differences.

Methodological Differences

Zcash: Empirical-First

The Zcash analysis builds a theoretical model (\phi = \tau_{eff} / T) and then validates it experimentally by running a 100-node geographically distributed devnet across 19 regions. They have direct observability into every stale block because they control all nodes.

BCH: Analytical-First

The BCH analysis is almost entirely analytical, derived from first principles and calibrated against published BTC mainnet statistics (KIT Institute, ViaBTC). They don’t run their own large-scale devnet; instead they decompose propagation time into components:

t_{prop} = (0.5 + \text{isMissingTXs}) \cdot RTT + \frac{n \cdot \text{block\_size} \cdot (0.03 + f_{missing})}{BW} + (n-1) \cdot d_{internal}

Different Orphan/Stale Rate Models

Zcash Model

Uses the work-weighted integral approach:

\tau_{eff} = \int_0^\infty u(t)\, dt, \quad \phi = \frac{\tau_{eff}}{T}

With Poisson approximation: P[S \geq 1] = 1 - e^{-\phi}

BCH Model

Uses a simpler exponential with hash-rate weighting:

p_{orphan} = 1 - e^{-t_{prop}/T}

And critically introduces a per-pool centralization analysis:

p_{orphan\_loss} = (1 - h) \cdot (1 - e^{(1-h) \cdot (-t_{prop}/T)})

where h is the pool’s hash rate share. This is something the Zcash analysis doesn’t explicitly model.

The Centralization Framing — A Major Difference

The BCH paper places mining centralization risk front and center. Its core argument:

Larger pools have lower orphan rates → higher margins
This creates a “centralization spiral”
Therefore: pick a threshold below which the margin asymmetry is within normal market noise (they argue 2%)

The Zcash paper barely touches this. It focuses on:

Stale rates
Reorg/branch-switch rates
Comparison to Ethereum PoW at 12.5s (which had similar rates)

The BCH approach is arguably more rigorous about why a given threshold matters, grounding it in real mining-company margin data from CoinShares.

Threshold Selection

BCH: 2% orphan rate

Justified by comparing to Bitcoin’s daily price volatility (>2%), arguing miners already absorb shocks of that magnitude regularly.

Zcash: ~5% stale rate

Justified by comparison to Ethereum PoW mainnet at 12.5s blocks, which operated successfully at similar rates.

The Zcash threshold is 2.5x more permissive. This makes sense given Zcash already had 75s blocks and is making a 3x reduction, while BCH is making a 10x reduction from a much larger base.

ValarDragon · May 24, 2026, 1:38pm

Ooh thanks for the detailed post on this! Exciting that Nervos is 10s as well, I had mentally pegged Ethereum as the fastest non-DAG PoW chain.

Zcash has never done compact blocks, the reasons for why historically I’m unclear on. But as you note, this doesn’t help at some block size. I suspect its probably still helpful at 25s, but my goal is to upgrade to even faster block times.

So the reach_90 block prop = 1.48s in this decentralized testnet. We’re pretty confident that mainnet must be lower. (this same testnet at 75s blocktimes, had 10x higher stale rate)

This per-pool analysis is cool! Thankfully in Zcash, miners absorb shocks of far more than 5% per day (and actually aren’t near Opex cost yet, until far more hashpower comes online) Combined with a mainnet gossip parameter less than 1.48s, I think this advantage factor should drop. I believe that pool distribution is pre-foundry splitting out as well

Topic		Replies	Views
Proposal: Lower Zcash block target spacing to 25s Technology	17	974	June 3, 2026
Zebra 1.0.0 Stable Release Zebra	33	2473	March 10, 2024
Zcash Blockchain Size—Risks? General	87	6974	July 1, 2025
How does someone find a 17 blocks in 4 seconds? Time warp attack? Mining	42	6550	September 10, 2016
Heavily increased transaction load since June 14 General	45	3085	July 18, 2022