Migrate to new `rand_core` utility functions by newpavlov · Pull Request #1686 · rust-random/rand

newpavlov · 2025-11-19T16:28:17Z

Cargo.toml

src/rngs/reseeding.rs

newpavlov · 2025-11-19T17:49:35Z

I migrated rand_chacha and got the following benchmark results for reseeding_bytes (the difference is relative to master):

reseeding_bytes/chacha20_4k
                        time:   [430.50 µs 430.77 µs 431.28 µs]
                        thrpt:  [2.2643 GiB/s 2.2670 GiB/s 2.2684 GiB/s]
                 change:
                        time:   [-5.1885% -5.0626% -4.9034%] (p = 0.00 < 0.05)
                        thrpt:  [+5.1562% +5.3326% +5.4724%]
                        Performance has improved.
Found 9 outliers among 100 measurements (9.00%)
  4 (4.00%) high mild
  5 (5.00%) high severe
Benchmarking reseeding_bytes/chacha20_16k: Collecting 100 samples in estimated 6reseeding_bytes/chacha20_16k
                        time:   [430.45 µs 430.50 µs 430.55 µs]
                        thrpt:  [2.2682 GiB/s 2.2684 GiB/s 2.2687 GiB/s]
                 change:
                        time:   [-1.2013% -1.1700% -1.1420%] (p = 0.00 < 0.05)
                        thrpt:  [+1.1552% +1.1839% +1.2159%]
                        Performance has improved.
Found 4 outliers among 100 measurements (4.00%)
  1 (1.00%) high mild
  3 (3.00%) high severe
Benchmarking reseeding_bytes/chacha20_32k: Collecting 100 samples in estimated 6reseeding_bytes/chacha20_32k
                        time:   [430.43 µs 430.47 µs 430.52 µs]
                        thrpt:  [2.2683 GiB/s 2.2686 GiB/s 2.2688 GiB/s]
                 change:
                        time:   [-0.5035% -0.4768% -0.4502%] (p = 0.00 < 0.05)
                        thrpt:  [+0.4523% +0.4791% +0.5060%]
                        Change within noise threshold.
Found 9 outliers among 100 measurements (9.00%)
  3 (3.00%) low mild
  6 (6.00%) high mild
Benchmarking reseeding_bytes/chacha20_64k: Collecting 100 samples in estimated 6reseeding_bytes/chacha20_64k
                        time:   [430.33 µs 430.37 µs 430.41 µs]
                        thrpt:  [2.2689 GiB/s 2.2691 GiB/s 2.2693 GiB/s]
                 change:
                        time:   [-0.3146% -0.1478% +0.0631%] (p = 0.10 > 0.05)
                        thrpt:  [-0.0631% +0.1481% +0.3156%]
                        No change in performance detected.
Found 5 outliers among 100 measurements (5.00%)
  3 (3.00%) high mild
  2 (2.00%) high severe
Benchmarking reseeding_bytes/chacha20_256k: Collecting 100 samples in estimated reseeding_bytes/chacha20_256k
                        time:   [430.47 µs 430.52 µs 430.57 µs]
                        thrpt:  [2.2681 GiB/s 2.2683 GiB/s 2.2686 GiB/s]
                 change:
                        time:   [+0.0455% +0.1558% +0.2295%] (p = 0.00 < 0.05)
                        thrpt:  [-0.2289% -0.1556% -0.0455%]
                        Change within noise threshold.
Found 8 outliers among 100 measurements (8.00%)
  1 (1.00%) low mild
  4 (4.00%) high mild
  3 (3.00%) high severe
Benchmarking reseeding_bytes/chacha20_1024k: Collecting 100 samples in estimatedreseeding_bytes/chacha20_1024k
                        time:   [430.47 µs 430.51 µs 430.56 µs]
                        thrpt:  [2.2681 GiB/s 2.2684 GiB/s 2.2686 GiB/s]
                 change:
                        time:   [-0.1499% +0.0899% +0.2590%] (p = 0.47 > 0.05)
                        thrpt:  [-0.2583% -0.0898% +0.1501%]
                        No change in performance detected.
Found 4 outliers among 100 measurements (4.00%)
  4 (4.00%) high severe

As expected, the difference looks mostly within the noise threshold. The smallest block outlier is probably just a consequence of the sloppy measurement setup (x86-64 laptop without disabled frequency scaling).

dhardy · 2025-11-19T17:53:38Z

Thanks. This needs to be compared using the following diff against master:

$ jd -r wq --git
diff --git a/Cargo.toml b/Cargo.toml
index 13aea84ed7..0064cb9a56 100644
--- a/Cargo.toml
+++ b/Cargo.toml
@@ -76,7 +76,8 @@
 rand_core = { version = "0.10.0-rc-2", default-features = false }
 log = { version = "0.4.4", optional = true }
 serde = { version = "1.0.103", features = ["derive"], optional = true }
-chacha20 = { version = "=0.10.0-rc.5", default-features = false, features = ["rng"], optional = true }
+# chacha20 = { version = "=0.10.0-rc.5", default-features = false, features = ["rng"], optional = true }
+chacha20 = { path = "rand_chacha", optional = true, package = "rand_chacha" }
 getrandom = { version = "0.3.0", optional = true }
 
 [dev-dependencies]

Running benches now. BTW you only included fill_bytes benchmarks. All StdRng benchmarks should apply.

dhardy · 2025-11-19T17:59:54Z

Some more results:

random_bytes/std        time:   [2.8826 µs 2.8979 µs 2.9156 µs]
                        thrpt:  [334.94 MiB/s 336.98 MiB/s 338.77 MiB/s]
                 change:
                        time:   [-2.1829% -1.7829% -1.2474%] (p = 0.00 < 0.05)
                        thrpt:  [+1.2632% +1.8153% +2.2317%]
                        Performance has improved.
Found 17 outliers among 100 measurements (17.00%)
  2 (2.00%) low severe
  1 (1.00%) low mild
  3 (3.00%) high mild
  11 (11.00%) high severe
random_bytes/thread     time:   [2.9700 µs 2.9945 µs 3.0219 µs]
                        thrpt:  [323.16 MiB/s 326.12 MiB/s 328.81 MiB/s]
                 change:
                        time:   [-1.2714% -0.2641% +0.6526%] (p = 0.60 > 0.05)
                        thrpt:  [-0.6483% +0.2648% +1.2878%]
                        No change in performance detected.
Found 8 outliers among 100 measurements (8.00%)
  4 (4.00%) high mild
  4 (4.00%) high severe
random_u32/std          time:   [11.678 ns 11.695 ns 11.715 ns]
                        thrpt:  [325.62 MiB/s 326.17 MiB/s 326.66 MiB/s]
                 change:
                        time:   [-17.300% -17.028% -16.772%] (p = 0.00 < 0.05)
                        thrpt:  [+20.151% +20.522% +20.920%]
                        Performance has improved.
Found 78 outliers among 1000 measurements (7.80%)
  2 (0.20%) low mild
  33 (3.30%) high mild
  43 (4.30%) high severe
random_u32/thread       time:   [17.863 ns 17.904 ns 17.965 ns]
                        thrpt:  [212.34 MiB/s 213.06 MiB/s 213.55 MiB/s]
                 change:
                        time:   [+19.568% +20.963% +22.167%] (p = 0.00 < 0.05)
                        thrpt:  [-18.145% -17.330% -16.366%]
                        Performance has regressed.
Found 85 outliers among 1000 measurements (8.50%)
  1 (0.10%) low mild
  41 (4.10%) high mild
  43 (4.30%) high severe
random_u64/std          time:   [19.242 ns 19.320 ns 19.407 ns]
                        thrpt:  [393.12 MiB/s 394.90 MiB/s 396.49 MiB/s]
                 change:
                        time:   [-5.5397% -5.1726% -4.7543%] (p = 0.00 < 0.05)
                        thrpt:  [+4.9916% +5.4547% +5.8646%]
                        Performance has improved.
Found 111 outliers among 1000 measurements (11.10%)
  45 (4.50%) high mild
  66 (6.60%) high severe
random_u64/thread       time:   [24.032 ns 24.044 ns 24.056 ns]
                        thrpt:  [317.15 MiB/s 317.31 MiB/s 317.47 MiB/s]
                 change:
                        time:   [+13.591% +14.264% +14.821%] (p = 0.00 < 0.05)
                        thrpt:  [-12.908% -12.483% -11.965%]
                        Performance has regressed.
Found 55 outliers among 1000 measurements (5.50%)
  1 (0.10%) low mild
  30 (3.00%) high mild
  24 (2.40%) high severe

Edit: updated. This was run with CPU frequency pinned to 577 MHz but still shows bad variance, so take with a big pinch of salt.

newpavlov · 2025-11-19T18:02:23Z

This needs to be compared using the following diff against master:

Done. I updated the results in my previous comment. Interestingly, the result for chacha20_4k seems to be reproducible.

dhardy · 2025-11-20T08:55:20Z

Updated, but variance is still high. You might want to run these benches yourself.

newpavlov · 2025-11-21T10:58:25Z

Results for Ryzen 7 2700x:

Benchmarking reseeding_bytes/chacha20_4k: Collecting 100 samples in estimated 6.reseeding_bytes/chacha20_4k
                        time:   [655.25 µs 655.92 µs 656.73 µs]
                        thrpt:  [1.4870 GiB/s 1.4888 GiB/s 1.4904 GiB/s]
                 change:
                        time:   [-7.7435% -7.6262% -7.5005%] (p = 0.00 < 0.05)
                        thrpt:  [+8.1087% +8.2558% +8.3935%]
                        Performance has improved.
Found 14 outliers among 100 measurements (14.00%)
  14 (14.00%) high mild
Benchmarking reseeding_bytes/chacha20_16k: Collecting 100 samples in estimated 6reseeding_bytes/chacha20_16k
                        time:   [654.47 µs 654.89 µs 655.44 µs]
                        thrpt:  [1.4899 GiB/s 1.4912 GiB/s 1.4922 GiB/s]
                 change:
                        time:   [-0.5124% -0.2880% -0.0777%] (p = 0.01 < 0.05)
                        thrpt:  [+0.0777% +0.2889% +0.5150%]
                        Change within noise threshold.
Found 8 outliers among 100 measurements (8.00%)
  2 (2.00%) high mild
  6 (6.00%) high severe
Benchmarking reseeding_bytes/chacha20_32k: Collecting 100 samples in estimated 6reseeding_bytes/chacha20_32k
                        time:   [646.76 µs 646.93 µs 647.10 µs]
                        thrpt:  [1.5091 GiB/s 1.5095 GiB/s 1.5099 GiB/s]
                 change:
                        time:   [+0.2419% +0.3457% +0.4417%] (p = 0.00 < 0.05)
                        thrpt:  [-0.4398% -0.3446% -0.2413%]
                        Change within noise threshold.
Found 11 outliers among 100 measurements (11.00%)
  8 (8.00%) high mild
  3 (3.00%) high severe
Benchmarking reseeding_bytes/chacha20_64k: Collecting 100 samples in estimated 6reseeding_bytes/chacha20_64k
                        time:   [647.24 µs 647.90 µs 648.65 µs]
                        thrpt:  [1.5055 GiB/s 1.5073 GiB/s 1.5088 GiB/s]
                 change:
                        time:   [+1.0239% +1.1825% +1.3324%] (p = 0.00 < 0.05)
                        thrpt:  [-1.3149% -1.1687% -1.0135%]
                        Performance has regressed.
Found 7 outliers among 100 measurements (7.00%)
  4 (4.00%) high mild
  3 (3.00%) high severe
Benchmarking reseeding_bytes/chacha20_256k: Collecting 100 samples in estimated reseeding_bytes/chacha20_256k
                        time:   [646.76 µs 647.04 µs 647.34 µs]
                        thrpt:  [1.5086 GiB/s 1.5093 GiB/s 1.5099 GiB/s]
                 change:
                        time:   [+1.6612% +1.7655% +1.8890%] (p = 0.00 < 0.05)
                        thrpt:  [-1.8540% -1.7349% -1.6341%]
                        Performance has regressed.
Found 6 outliers among 100 measurements (6.00%)
  2 (2.00%) high mild
  4 (4.00%) high severe
Benchmarking reseeding_bytes/chacha20_1024k: Collecting 100 samples in estimatedreseeding_bytes/chacha20_1024k
                        time:   [658.56 µs 659.74 µs 661.02 µs]
                        thrpt:  [1.4774 GiB/s 1.4802 GiB/s 1.4829 GiB/s]
                 change:
                        time:   [+2.0450% +2.1795% +2.3088%] (p = 0.00 < 0.05)
                        thrpt:  [-2.2567% -2.1330% -2.0040%]
                        Performance has regressed.
Found 11 outliers among 100 measurements (11.00%)
  9 (9.00%) high mild
  2 (2.00%) high severe

newpavlov · 2025-11-21T11:02:47Z

Results for M4 Mac:

Benchmarking reseeding_bytes/chacha20_4k: Collecting 100 samples in estimated 6.reseeding_bytes/chacha20_4k
                        time:   [1.2357 ms 1.2358 ms 1.2359 ms]
                        thrpt:  [809.12 MiB/s 809.19 MiB/s 809.26 MiB/s]
                 change:
                        time:   [-12.536% -12.496% -12.453%] (p = 0.00 < 0.05)
                        thrpt:  [+14.225% +14.281% +14.332%]
                        Performance has improved.
Found 14 outliers among 100 measurements (14.00%)
  1 (1.00%) low severe
  2 (2.00%) low mild
  7 (7.00%) high mild
  4 (4.00%) high severe
Benchmarking reseeding_bytes/chacha20_16k: Warming up for 500.00 ms
Warning: Unable to complete 100 samples in 5.0s. You may wish to increase target time to 6.4s, enable flat sampling, or reduce sample count to 60.
Benchmarking reseeding_bytes/chacha20_16k: Collecting 100 samples in estimated 6reseeding_bytes/chacha20_16k
                        time:   [1.2346 ms 1.2347 ms 1.2348 ms]
                        thrpt:  [809.82 MiB/s 809.91 MiB/s 809.97 MiB/s]
                 change:
                        time:   [-3.1229% -3.0897% -3.0547%] (p = 0.00 < 0.05)
                        thrpt:  [+3.1509% +3.1882% +3.2235%]
                        Performance has improved.
Found 9 outliers among 100 measurements (9.00%)
  3 (3.00%) high mild
  6 (6.00%) high severe
Benchmarking reseeding_bytes/chacha20_32k: Warming up for 500.00 ms
Warning: Unable to complete 100 samples in 5.0s. You may wish to increase target time to 6.4s, enable flat sampling, or reduce sample count to 60.
Benchmarking reseeding_bytes/chacha20_32k: Collecting 100 samples in estimated 6reseeding_bytes/chacha20_32k
                        time:   [1.2355 ms 1.2356 ms 1.2357 ms]
                        thrpt:  [809.23 MiB/s 809.33 MiB/s 809.40 MiB/s]
                 change:
                        time:   [-1.3590% -1.3166% -1.2751%] (p = 0.00 < 0.05)
                        thrpt:  [+1.2916% +1.3342% +1.3777%]
                        Performance has improved.
Found 10 outliers among 100 measurements (10.00%)
  1 (1.00%) low mild
  4 (4.00%) high mild
  5 (5.00%) high severe
Benchmarking reseeding_bytes/chacha20_64k: Warming up for 500.00 ms
Warning: Unable to complete 100 samples in 5.0s. You may wish to increase target time to 6.4s, enable flat sampling, or reduce sample count to 60.
Benchmarking reseeding_bytes/chacha20_64k: Collecting 100 samples in estimated 6reseeding_bytes/chacha20_64k
                        time:   [1.2345 ms 1.2347 ms 1.2350 ms]
                        thrpt:  [809.71 MiB/s 809.94 MiB/s 810.08 MiB/s]
                 change:
                        time:   [-0.5510% -0.4807% -0.4293%] (p = 0.00 < 0.05)
                        thrpt:  [+0.4311% +0.4830% +0.5541%]
                        Change within noise threshold.
Found 17 outliers among 100 measurements (17.00%)
  2 (2.00%) low mild
  7 (7.00%) high mild
  8 (8.00%) high severe
Benchmarking reseeding_bytes/chacha20_256k: Warming up for 500.00 ms
Warning: Unable to complete 100 samples in 5.0s. You may wish to increase target time to 6.3s, enable flat sampling, or reduce sample count to 60.
Benchmarking reseeding_bytes/chacha20_256k: Collecting 100 samples in estimated reseeding_bytes/chacha20_256k
                        time:   [1.2360 ms 1.2380 ms 1.2407 ms]
                        thrpt:  [805.99 MiB/s 807.73 MiB/s 809.04 MiB/s]
                 change:
                        time:   [+0.2114% +0.3427% +0.4928%] (p = 0.00 < 0.05)
                        thrpt:  [-0.4904% -0.3415% -0.2110%]
                        Change within noise threshold.
Found 9 outliers among 100 measurements (9.00%)
  1 (1.00%) high mild
  8 (8.00%) high severe
Benchmarking reseeding_bytes/chacha20_1024k: Warming up for 500.00 ms
Warning: Unable to complete 100 samples in 5.0s. You may wish to increase target time to 6.3s, enable flat sampling, or reduce sample count to 60.
Benchmarking reseeding_bytes/chacha20_1024k: Collecting 100 samples in estimated reseeding_bytes/chacha20_1024k
                        time:   [1.2358 ms 1.2359 ms 1.2360 ms]
                        thrpt:  [809.06 MiB/s 809.14 MiB/s 809.21 MiB/s]
                 change:
                        time:   [+0.4471% +0.4792% +0.5138%] (p = 0.00 < 0.05)
                        thrpt:  [-0.5112% -0.4769% -0.4451%]
                        Change within noise threshold.
Found 9 outliers among 100 measurements (9.00%)
  5 (5.00%) high mild
  4 (4.00%) high severe

dhardy · 2025-11-21T11:06:00Z

I've been attempting to get more consistent results: https://gist.github.com/dhardy/514742f635df81053a4c2b95c32004da

Can you benchmark random_u{32,64}/{std,thread}?

dhardy · 2025-11-21T11:12:21Z

My last run (this PR vs master over rand_chacha):

Benchmark results


$ taskset -c 4 cargo bench --bench generators -- --baseline master
    Finished `bench` profile [optimized] target(s) in 0.03s
     Running benches/generators.rs (/home/dhardy-extra/.cache/cargo-build/release/deps/generators-05bf2418c7dd064a)
random_bytes/pcg32      time:   [311.19 ns 311.52 ns 311.86 ns]
                        thrpt:  [3.0580 GiB/s 3.0613 GiB/s 3.0646 GiB/s]
                 change:
                        time:   [-11.839% -11.727% -11.625%] (p = 0.00 < 0.05)
                        thrpt:  [+13.155% +13.285% +13.429%]
                        Performance has improved.
random_bytes/pcg64      time:   [253.03 ns 253.26 ns 253.54 ns]
                        thrpt:  [3.7615 GiB/s 3.7656 GiB/s 3.7690 GiB/s]
                 change:
                        time:   [-0.1131% -0.0081% +0.1020%] (p = 0.88 > 0.05)
                        thrpt:  [-0.1019% +0.0081% +0.1132%]
                        No change in performance detected.
random_bytes/pcg64mcg   time:   [211.88 ns 212.15 ns 212.45 ns]
                        thrpt:  [4.4890 GiB/s 4.4953 GiB/s 4.5010 GiB/s]
                 change:
                        time:   [-2.2738% -2.1487% -2.0277%] (p = 0.00 < 0.05)
                        thrpt:  [+2.0696% +2.1958% +2.3267%]
                        Performance has improved.
random_bytes/pcg64dxsm  time:   [256.14 ns 256.43 ns 256.73 ns]
                        thrpt:  [3.7148 GiB/s 3.7191 GiB/s 3.7232 GiB/s]
                 change:
                        time:   [-0.0220% +0.0890% +0.1942%] (p = 0.10 > 0.05)
                        thrpt:  [-0.1938% -0.0889% +0.0220%]
                        No change in performance detected.
random_bytes/chacha8    time:   [251.83 ns 252.00 ns 252.20 ns]
                        thrpt:  [3.7814 GiB/s 3.7844 GiB/s 3.7869 GiB/s]
                 change:
                        time:   [+0.5484% +0.6414% +0.7327%] (p = 0.00 < 0.05)
                        thrpt:  [-0.7274% -0.6374% -0.5454%]
                        Change within noise threshold.
Found 8 outliers among 100 measurements (8.00%)
  8 (8.00%) high mild
random_bytes/chacha12   time:   [320.78 ns 321.33 ns 321.86 ns]
                        thrpt:  [2.9630 GiB/s 2.9679 GiB/s 2.9730 GiB/s]
                 change:
                        time:   [-1.4311% -1.2958% -1.1478%] (p = 0.00 < 0.05)
                        thrpt:  [+1.1611% +1.3128% +1.4519%]
                        Performance has improved.
random_bytes/chacha20   time:   [456.78 ns 457.09 ns 457.41 ns]
                        thrpt:  [2.0849 GiB/s 2.0864 GiB/s 2.0878 GiB/s]
                 change:
                        time:   [-1.3588% -1.2410% -1.1258%] (p = 0.00 < 0.05)
                        thrpt:  [+1.1386% +1.2566% +1.3775%]
                        Performance has improved.
Found 7 outliers among 100 measurements (7.00%)
  5 (5.00%) high mild
  2 (2.00%) high severe
random_bytes/std        time:   [319.57 ns 319.75 ns 319.95 ns]
                        thrpt:  [2.9807 GiB/s 2.9826 GiB/s 2.9842 GiB/s]
                 change:
                        time:   [+2.9038% +3.0131% +3.1260%] (p = 0.00 < 0.05)
                        thrpt:  [-3.0313% -2.9249% -2.8219%]
                        Performance has regressed.
Found 12 outliers among 100 measurements (12.00%)
  11 (11.00%) high mild
  1 (1.00%) high severe
random_bytes/small      time:   [170.21 ns 170.28 ns 170.36 ns]
                        thrpt:  [5.5979 GiB/s 5.6007 GiB/s 5.6030 GiB/s]
                 change:
                        time:   [-0.7668% -0.6414% -0.5145%] (p = 0.00 < 0.05)
                        thrpt:  [+0.5171% +0.6455% +0.7728%]
                        Change within noise threshold.
Found 13 outliers among 100 measurements (13.00%)
  6 (6.00%) high mild
  7 (7.00%) high severe
random_bytes/os         time:   [1.4123 µs 1.4127 µs 1.4131 µs]
                        thrpt:  [691.08 MiB/s 691.28 MiB/s 691.46 MiB/s]
                 change:
                        time:   [-0.4199% -0.3606% -0.3013%] (p = 0.00 < 0.05)
                        thrpt:  [+0.3022% +0.3619% +0.4217%]
                        Change within noise threshold.
Found 4 outliers among 100 measurements (4.00%)
  4 (4.00%) high mild
random_bytes/thread     time:   [323.04 ns 323.34 ns 323.68 ns]
                        thrpt:  [2.9464 GiB/s 2.9495 GiB/s 2.9522 GiB/s]
                 change:
                        time:   [+0.2355% +0.3743% +0.5049%] (p = 0.00 < 0.05)
                        thrpt:  [-0.5023% -0.3730% -0.2350%]
                        Change within noise threshold.
Found 19 outliers among 100 measurements (19.00%)
  4 (4.00%) low mild
  7 (7.00%) high mild
  8 (8.00%) high severe
random_u32/pcg32        time:   [1.0420 ns 1.0424 ns 1.0428 ns]

thrpt:  [3.5723 GiB/s 3.5737 GiB/s 3.5750 GiB/s]

change:

time:   [+4.0874% +4.1614% +4.2425%] (p = 0.00 < 0.05)

thrpt:  [-4.0698% -3.9952% -3.9269%]

Performance has regressed.

random_u32/pcg64        time:   [1.3486 ns 1.3508 ns 1.3528 ns]

thrpt:  [2.7537 GiB/s 2.7578 GiB/s 2.7624 GiB/s]

change:

time:   [-0.1667% +0.0050% +0.1878%] (p = 0.96 > 0.05)

thrpt:  [-0.1875% -0.0050% +0.1670%]

No change in performance detected.

Found 406 outliers among 1000 measurements (40.60%)

220 (22.00%) low severe

17 (1.70%) low mild

5 (0.50%) high mild

164 (16.40%) high severe

random_u32/pcg64mcg     time:   [942.39 ps 942.68 ps 942.98 ps]

thrpt:  [3.9505 GiB/s 3.9518 GiB/s 3.9530 GiB/s]

change:

time:   [-0.5956% -0.5539% -0.5153%] (p = 0.00 < 0.05)

thrpt:  [+0.5179% +0.5570% +0.5992%]

Change within noise threshold.

Found 39 outliers among 1000 measurements (3.90%)

39 (3.90%) high mild

random_u32/pcg64dxsm    time:   [1.4300 ns 1.4304 ns 1.4308 ns]

thrpt:  [2.6037 GiB/s 2.6044 GiB/s 2.6052 GiB/s]

change:

time:   [-1.1536% -1.0395% -0.9396%] (p = 0.00 < 0.05)

thrpt:  [+0.9485% +1.0504% +1.1670%]

Change within noise threshold.

Found 1 outliers among 1000 measurements (0.10%)

1 (0.10%) high mild

random_u32/chacha8      time:   [981.29 ps 981.63 ps 981.97 ps]

thrpt:  [3.7937 GiB/s 3.7950 GiB/s 3.7963 GiB/s]

change:

time:   [+2.3021% +2.3477% +2.3941%] (p = 0.00 < 0.05)

thrpt:  [-2.3381% -2.2938% -2.2503%]

Performance has regressed.

Found 63 outliers among 1000 measurements (6.30%)

63 (6.30%) high mild

random_u32/chacha12     time:   [1.2546 ns 1.2550 ns 1.2554 ns]

thrpt:  [2.9674 GiB/s 2.9684 GiB/s 2.9693 GiB/s]

change:

time:   [+2.2369% +2.2778% +2.3211%] (p = 0.00 < 0.05)

thrpt:  [-2.2685% -2.2271% -2.1880%]

Performance has regressed.

Found 49 outliers among 1000 measurements (4.90%)

48 (4.80%) high mild

1 (0.10%) high severe

random_u32/chacha20     time:   [1.7742 ns 1.7749 ns 1.7757 ns]

thrpt:  [2.0980 GiB/s 2.0988 GiB/s 2.0997 GiB/s]

change:

time:   [-1.3972% -1.3506% -1.3037%] (p = 0.00 < 0.05)

thrpt:  [+1.3210% +1.3691% +1.4170%]

Performance has improved.

Found 1 outliers among 1000 measurements (0.10%)

1 (0.10%) high mild

random_u32/std          time:   [1.2572 ns 1.2577 ns 1.2582 ns]

thrpt:  [2.9609 GiB/s 2.9621 GiB/s 2.9632 GiB/s]

change:

time:   [-0.0821% -0.0394% +0.0066%] (p = 0.07 > 0.05)

thrpt:  [-0.0066% +0.0395% +0.0822%]

No change in performance detected.

Found 32 outliers among 1000 measurements (3.20%)

1 (0.10%) low mild

31 (3.10%) high mild

random_u32/small        time:   [638.58 ps 638.83 ps 639.08 ps]

thrpt:  [5.8291 GiB/s 5.8315 GiB/s 5.8337 GiB/s]

change:

time:   [-0.3715% -0.3186% -0.2696%] (p = 0.00 < 0.05)

thrpt:  [+0.2703% +0.3196% +0.3729%]

Change within noise threshold.

random_u32/os           time:   [14.440 ns 14.444 ns 14.448 ns]

thrpt:  [264.04 MiB/s 264.10 MiB/s 264.17 MiB/s]

change:

time:   [-2.3424% -2.3049% -2.2665%] (p = 0.00 < 0.05)

thrpt:  [+2.3190% +2.3593% +2.3986%]

Performance has improved.

Found 176 outliers among 1000 measurements (17.60%)

2 (0.20%) low severe

21 (2.10%) low mild

100 (10.00%) high mild

53 (5.30%) high severe

random_u32/thread       time:   [1.4727 ns 1.4733 ns 1.4739 ns]

thrpt:  [2.5275 GiB/s 2.5285 GiB/s 2.5295 GiB/s]

change:

time:   [+15.577% +15.636% +15.698%] (p = 0.00 < 0.05)

thrpt:  [-13.568% -13.522% -13.477%]

Performance has regressed.

Found 1 outliers among 1000 measurements (0.10%)

1 (0.10%) high mild
random_u64/pcg32        time:   [2.0784 ns 2.0790 ns 2.0795 ns]

thrpt:  [3.5828 GiB/s 3.5838 GiB/s 3.5847 GiB/s]

change:

time:   [-1.2166% -1.1738% -1.1293%] (p = 0.00 < 0.05)

thrpt:  [+1.1422% +1.1877% +1.2316%]

Performance has improved.

Found 223 outliers among 1000 measurements (22.30%)

1 (0.10%) low severe

11 (1.10%) high mild

211 (21.10%) high severe

random_u64/pcg64        time:   [1.3189 ns 1.3195 ns 1.3200 ns]

thrpt:  [5.6444 GiB/s 5.6466 GiB/s 5.6493 GiB/s]

change:

time:   [-4.2114% -4.1155% -4.0255%] (p = 0.00 < 0.05)

thrpt:  [+4.1943% +4.2922% +4.3965%]

Performance has improved.

Found 74 outliers among 1000 measurements (7.40%)

4 (0.40%) low severe

70 (7.00%) high mild

random_u64/pcg64mcg     time:   [942.99 ps 943.33 ps 943.68 ps]

thrpt:  [7.8952 GiB/s 7.8982 GiB/s 7.9010 GiB/s]

change:

time:   [-1.1264% -1.0812% -1.0349%] (p = 0.00 < 0.05)

thrpt:  [+1.0457% +1.0930% +1.1392%]

Performance has improved.

Found 78 outliers among 1000 measurements (7.80%)

78 (7.80%) high mild

random_u64/pcg64dxsm    time:   [1.2413 ns 1.2419 ns 1.2426 ns]

thrpt:  [5.9961 GiB/s 5.9993 GiB/s 6.0025 GiB/s]

change:

time:   [-14.280% -14.235% -14.192%] (p = 0.00 < 0.05)

thrpt:  [+16.539% +16.598% +16.659%]

Performance has improved.

random_u64/chacha8      time:   [1.5400 ns 1.5407 ns 1.5413 ns]

thrpt:  [4.8339 GiB/s 4.8359 GiB/s 4.8380 GiB/s]

change:

time:   [+7.4912% +7.5654% +7.6415%] (p = 0.00 < 0.05)

thrpt:  [-7.0991% -7.0333% -6.9691%]

Performance has regressed.

Found 3 outliers among 1000 measurements (0.30%)

2 (0.20%) low mild

1 (0.10%) high mild

random_u64/chacha12     time:   [2.0764 ns 2.0774 ns 2.0783 ns]

thrpt:  [3.5849 GiB/s 3.5865 GiB/s 3.5882 GiB/s]

change:

time:   [+3.9570% +4.0275% +4.0919%] (p = 0.00 < 0.05)

thrpt:  [-3.9310% -3.8716% -3.8064%]

Performance has regressed.

Found 1 outliers among 1000 measurements (0.10%)

1 (0.10%) high mild

random_u64/chacha20     time:   [3.2758 ns 3.2788 ns 3.2817 ns]

thrpt:  [2.2704 GiB/s 2.2724 GiB/s 2.2744 GiB/s]

change:

time:   [+7.5646% +7.6660% +7.7529%] (p = 0.00 < 0.05)

thrpt:  [-7.1951% -7.1202% -7.0326%]

Performance has regressed.

Found 88 outliers among 1000 measurements (8.80%)

88 (8.80%) low mild

random_u64/std          time:   [2.0755 ns 2.0764 ns 2.0773 ns]

thrpt:  [3.5867 GiB/s 3.5883 GiB/s 3.5898 GiB/s]

change:

time:   [+3.3175% +3.3705% +3.4253%] (p = 0.00 < 0.05)

thrpt:  [-3.3119% -3.2606% -3.2110%]

Performance has regressed.

Found 1 outliers among 1000 measurements (0.10%)

1 (0.10%) high severe

random_u64/small        time:   [649.93 ps 650.16 ps 650.40 ps]

thrpt:  [11.455 GiB/s 11.460 GiB/s 11.464 GiB/s]

change:

time:   [-0.5487% -0.4995% -0.4523%] (p = 0.00 < 0.05)

thrpt:  [+0.4543% +0.5020% +0.5517%]

Change within noise threshold.

Found 79 outliers among 1000 measurements (7.90%)

78 (7.80%) high mild

1 (0.10%) high severe

random_u64/os           time:   [21.233 ns 21.244 ns 21.255 ns]

thrpt:  [358.94 MiB/s 359.13 MiB/s 359.32 MiB/s]

change:

time:   [-0.4470% -0.3909% -0.3412%] (p = 0.00 < 0.05)

thrpt:  [+0.3423% +0.3924% +0.4490%]

Change within noise threshold.

Found 3 outliers among 1000 measurements (0.30%)

2 (0.20%) high mild

1 (0.10%) high severe

random_u64/thread       time:   [2.4415 ns 2.4425 ns 2.4435 ns]

thrpt:  [3.0491 GiB/s 3.0504 GiB/s 3.0516 GiB/s]

change:

time:   [+20.037% +20.088% +20.142%] (p = 0.00 < 0.05)

thrpt:  [-16.765% -16.728% -16.692%]

Performance has regressed.
init_gen/pcg32          time:   [6.4907 ns 6.4987 ns 6.5061 ns]

change: [-26.347% -26.251% -26.159%] (p = 0.00 < 0.05)

Performance has improved.

Found 17 outliers among 100 measurements (17.00%)

10 (10.00%) low severe

7 (7.00%) low mild

init_gen/pcg64          time:   [11.313 ns 11.325 ns 11.337 ns]

change: [+2.0110% +2.1472% +2.2829%] (p = 0.00 < 0.05)

Performance has regressed.

Found 20 outliers among 100 measurements (20.00%)

9 (9.00%) low severe

7 (7.00%) low mild

3 (3.00%) high mild

1 (1.00%) high severe

init_gen/pcg64mcg       time:   [5.2545 ns 5.2600 ns 5.2650 ns]

change: [-30.416% -30.345% -30.281%] (p = 0.00 < 0.05)

Performance has improved.

Found 13 outliers among 100 measurements (13.00%)

8 (8.00%) low severe

4 (4.00%) low mild

1 (1.00%) high mild

init_gen/pcg64dxsm      time:   [10.731 ns 10.744 ns 10.758 ns]

change: [+0.3016% +0.4397% +0.5751%] (p = 0.00 < 0.05)

Change within noise threshold.

init_gen/chacha8        time:   [18.228 ns 18.265 ns 18.301 ns]

change: [-43.180% -43.078% -42.981%] (p = 0.00 < 0.05)

Performance has improved.

init_gen/chacha12       time:   [18.211 ns 18.231 ns 18.248 ns]

change: [-44.115% -44.024% -43.938%] (p = 0.00 < 0.05)

Performance has improved.

init_gen/chacha20       time:   [18.418 ns 18.453 ns 18.486 ns]

change: [-43.315% -43.156% -42.992%] (p = 0.00 < 0.05)

Performance has improved.

Found 3 outliers among 100 measurements (3.00%)

1 (1.00%) low mild

2 (2.00%) high mild

init_gen/std            time:   [29.005 ns 29.050 ns 29.095 ns]

change: [-11.674% -11.537% -11.395%] (p = 0.00 < 0.05)

Performance has improved.

init_gen/small          time:   [7.5812 ns 7.5932 ns 7.6061 ns]

change: [-10.714% -10.493% -10.307%] (p = 0.00 < 0.05)

Performance has improved.
init_from_u64/pcg32     time:   [6.4354 ns 6.4433 ns 6.4518 ns]

change: [-0.5989% -0.4299% -0.2640%] (p = 0.00 < 0.05)

Change within noise threshold.

init_from_u64/pcg64     time:   [9.5029 ns 9.5091 ns 9.5168 ns]

change: [-0.3128% -0.2001% -0.0739%] (p = 0.00 < 0.05)

Change within noise threshold.

Found 21 outliers among 100 measurements (21.00%)

4 (4.00%) high mild

17 (17.00%) high severe

init_from_u64/pcg64mcg  time:   [5.3457 ns 5.3538 ns 5.3613 ns]

change: [+0.8078% +0.9571% +1.0996%] (p = 0.00 < 0.05)

Change within noise threshold.

Found 11 outliers among 100 measurements (11.00%)

11 (11.00%) low mild

init_from_u64/pcg64dxsm time:   [8.8702 ns 8.8811 ns 8.8925 ns]

change: [+2.2172% +2.3486% +2.4811%] (p = 0.00 < 0.05)

Performance has regressed.

Found 8 outliers among 100 measurements (8.00%)

8 (8.00%) high mild

init_from_u64/chacha8   time:   [16.883 ns 16.896 ns 16.912 ns]

change: [-39.452% -39.384% -39.308%] (p = 0.00 < 0.05)

Performance has improved.

Found 20 outliers among 100 measurements (20.00%)

20 (20.00%) high mild

init_from_u64/chacha12  time:   [17.065 ns 17.078 ns 17.092 ns]

change: [-38.976% -38.872% -38.765%] (p = 0.00 < 0.05)

Performance has improved.

init_from_u64/chacha20  time:   [17.040 ns 17.061 ns 17.081 ns]

change: [-39.798% -39.721% -39.642%] (p = 0.00 < 0.05)

Performance has improved.

Found 18 outliers among 100 measurements (18.00%)

5 (5.00%) low severe

3 (3.00%) high mild

10 (10.00%) high severe

init_from_u64/std       time:   [28.460 ns 28.509 ns 28.553 ns]

change: [+1.7528% +1.8934% +2.0544%] (p = 0.00 < 0.05)

Performance has regressed.

init_from_u64/small     time:   [3.5562 ns 3.5602 ns 3.5639 ns]

change: [-0.4964% -0.3506% -0.1897%] (p = 0.00 < 0.05)

Change within noise threshold.

Found 1 outliers among 100 measurements (1.00%)

1 (1.00%) high severe
init_from_seed/pcg32    time:   [7.1359 ns 7.1483 ns 7.1613 ns]

change: [-2.5424% -2.2865% -2.0217%] (p = 0.00 < 0.05)

Performance has improved.

init_from_seed/pcg64    time:   [12.281 ns 12.297 ns 12.313 ns]

change: [-0.7089% -0.5746% -0.4367%] (p = 0.00 < 0.05)

Change within noise threshold.

Found 10 outliers among 100 measurements (10.00%)

9 (9.00%) high mild

1 (1.00%) high severe

init_from_seed/pcg64mcg time:   [6.1753 ns 6.1813 ns 6.1863 ns]

change: [+0.5807% +0.7076% +0.8480%] (p = 0.00 < 0.05)

Change within noise threshold.

Found 21 outliers among 100 measurements (21.00%)

2 (2.00%) low severe

1 (1.00%) low mild

2 (2.00%) high mild

16 (16.00%) high severe

init_from_seed/pcg64dxsm

time:   [12.438 ns 12.445 ns 12.454 ns]

change: [+0.6345% +0.7606% +0.8906%] (p = 0.00 < 0.05)

Change within noise threshold.

Found 12 outliers among 100 measurements (12.00%)

10 (10.00%) high mild

2 (2.00%) high severe

init_from_seed/chacha8  time:   [23.638 ns 23.651 ns 23.661 ns]

change: [-22.624% -22.567% -22.516%] (p = 0.00 < 0.05)

Performance has improved.

Found 15 outliers among 100 measurements (15.00%)

4 (4.00%) low severe

8 (8.00%) low mild

3 (3.00%) high mild

init_from_seed/chacha12 time:   [23.731 ns 23.748 ns 23.769 ns]

change: [-20.661% -20.589% -20.509%] (p = 0.00 < 0.05)

Performance has improved.

Found 10 outliers among 100 measurements (10.00%)

10 (10.00%) high mild

init_from_seed/chacha20 time:   [23.391 ns 23.428 ns 23.463 ns]

change: [-24.045% -23.938% -23.820%] (p = 0.00 < 0.05)

Performance has improved.

init_from_seed/std      time:   [32.974 ns 33.022 ns 33.071 ns]

change: [+9.8802% +10.022% +10.168%] (p = 0.00 < 0.05)

Performance has regressed.

Found 17 outliers among 100 measurements (17.00%)

3 (3.00%) high mild

14 (14.00%) high severe

init_from_seed/small    time:   [8.2168 ns 8.2229 ns 8.2302 ns]

change: [+0.3328% +0.4250% +0.5177%] (p = 0.00 < 0.05)

Change within noise threshold.

Found 21 outliers among 100 measurements (21.00%)

4 (4.00%) low mild

3 (3.00%) high mild

14 (14.00%) high severe
reseeding_bytes/chacha20_4k

time:   [361.22 µs 361.52 µs 361.80 µs]

thrpt:  [2.6992 GiB/s 2.7012 GiB/s 2.7035 GiB/s]

change:

time:   [-6.9830% -6.8503% -6.7178%] (p = 0.00 < 0.05)

thrpt:  [+7.2016% +7.3541% +7.5073%]

Performance has improved.

Found 3 outliers among 100 measurements (3.00%)

2 (2.00%) low severe

1 (1.00%) high severe

reseeding_bytes/chacha20_16k

time:   [357.40 µs 357.87 µs 358.39 µs]

thrpt:  [2.7249 GiB/s 2.7288 GiB/s 2.7324 GiB/s]

change:

time:   [-4.0674% -3.9165% -3.7616%] (p = 0.00 < 0.05)

thrpt:  [+3.9086% +4.0762% +4.2398%]

Performance has improved.

reseeding_bytes/chacha20_32k

time:   [358.21 µs 358.76 µs 359.30 µs]

thrpt:  [2.7179 GiB/s 2.7220 GiB/s 2.7262 GiB/s]

change:

time:   [-3.1127% -2.9629% -2.8104%] (p = 0.00 < 0.05)

thrpt:  [+2.8917% +3.0534% +3.2127%]

Performance has improved.

reseeding_bytes/chacha20_64k

time:   [358.22 µs 358.83 µs 359.42 µs]

thrpt:  [2.7171 GiB/s 2.7215 GiB/s 2.7261 GiB/s]

change:

time:   [-2.7345% -2.5840% -2.4238%] (p = 0.00 < 0.05)

thrpt:  [+2.4841% +2.6525% +2.8114%]

Performance has improved.

reseeding_bytes/chacha20_256k

time:   [357.83 µs 358.28 µs 358.75 µs]

thrpt:  [2.7221 GiB/s 2.7257 GiB/s 2.7291 GiB/s]

change:

time:   [-2.1898% -2.0414% -1.8872%] (p = 0.00 < 0.05)

thrpt:  [+1.9235% +2.0839% +2.2388%]

Performance has improved.

reseeding_bytes/chacha20_1024k

time:   [357.76 µs 358.15 µs 358.56 µs]

thrpt:  [2.7236 GiB/s 2.7267 GiB/s 2.7297 GiB/s]

change:

time:   [-2.4086% -2.2623% -2.1183%] (p = 0.00 < 0.05)

thrpt:  [+2.1641% +2.3146% +2.4681%]

Performance has improved.

Summary:

Many ~2% variations; some significantly larger
random_u32/std has -0.0394% deviation but random_u32/thread has +15.636% time which potentially indicates an issue with reseeding
random_u64/std has +3.3705% deviation but random_u64/thread has +20.088% time (concurs)
init_gen benches for ChaCha are approx -40% time (much faster); low importance
reseeding_bytes benchers are 2-4% more throughput (slightly faster)

newpavlov · 2025-11-21T13:38:00Z

I see a 50% slower performance for random_u32/thread. It's a pretty weird result which does not depend on the reseeding threshold. We probably have a problem with inlining somewhere. I need to inspect generated assembly to say more.

I don't think we should bother with sub-3% differences. Without a careful setup I get such difference for different benchmark runs on the same code.

dhardy · 2025-11-21T13:52:32Z

which does not depend on the reseeding threshold

Unsurprising; it's the cost of an extra check for each output. The question is whether the CPU can run this with minimal overhead; it appears that Zen 3 has lower overhead than M4 but neither is able to eliminate it.

Given how few cycles are typically required to generate a word, an extra check can be significant. This is why I'm interested in rust-random/rand_core#26.

I don't think we should bother with sub-3% differences.

No, I consider that insignificant (or barely significant if many benches show results skewed the same way).

newpavlov · 2025-11-21T14:19:56Z

Unsurprising; it's the cost of an extra check for each output.

Ah, I see. The old code performs the reseeding check only when cached block is exhausted. While in this PR we pay the (easily predictable) branch and decrement cost on every next_u* call.

I am still hesitant to keep the block traits just for ReseedingRng. I will play with some alternative ideas first, they probably will be a bit less clean than the block trait, but considering that the primary use of ReseedingRng is ThreadRng, it may be fine.

dhardy

Some comments on the impact of rust-random/rand_core#24 on block RNG implementations.

Summary: not enormously significant, but overall slightly negative in my (subjective) opinion (depending on one's view of the Results: Default requirement).

dhardy · 2025-11-24T09:15:59Z

rand_chacha/src/chacha.rs

 const BLOCK_WORDS: u8 = 16;

-#[repr(transparent)]
-pub struct Array64<T>([T; 64]);


This was needed because of the bound Results: Default and [_; 64] not implementing this.

It's a nice but ultimately unimportant improvement of the new utility fns.

dhardy · 2025-11-24T09:18:13Z

rand_chacha/src/chacha.rs

+        impl $ChaChaXRng {
+            fn buffer_index(&self) -> u32 {
+                self.buffer[0]
+            }
+
+            fn generate_and_set(&mut self, index: usize) {
+                assert!(index < self.buffer.len());
+                self.buffer[0] = if index != 0 {
+                    self.core.next_block(&mut self.buffer);
+                    index as u32
+                } else {
+                    self.buffer.len() as u32
+                }
+            }
        }


These fns are needed to support get/set word-pos fns. They were provided by BlockRng.

These feel like low-level details. They very much depend on using buffer[0] as an index, which isn't something properly documented anywhere.

We could fix this with a Buffer type, but this is undesirable. We could perhaps fix this with a Buffer trait, though it wouldn't be well documented.

dhardy · 2025-11-24T09:19:00Z

rand_chacha/src/chacha.rs

            fn from_seed(seed: Self::Seed) -> Self {
-                let core = $ChaChaXCore::from_seed(seed);
                Self {
-                    rng: BlockRng::new(core),
+                    core: $ChaChaXCore::from_seed(seed),
+                    buffer: le::new_buffer(),
                }
            }
        }

        impl RngCore for $ChaChaXRng {
            #[inline]
            fn next_u32(&mut self) -> u32 {
-                self.rng.next_u32()
+                let Self { core, buffer } = self;
+                le::next_word_via_gen_block(buffer, |block| core.next_block(block))
            }

            #[inline]
            fn next_u64(&mut self) -> u64 {
-                self.rng.next_u64()
+                let Self { core, buffer } = self;
+                le::next_u64_via_gen_block(buffer, |block| core.next_block(block))
            }

            #[inline]
-            fn fill_bytes(&mut self, bytes: &mut [u8]) {
-                self.rng.fill_bytes(bytes)
+            fn fill_bytes(&mut self, dst: &mut [u8]) {
+                let Self { core, buffer } = self;
+                le::fill_bytes_via_gen_block(dst, buffer, |block| core.next_block(block));
            }


This stuff all requires marginally lower-level implementations; not very significant.

newpavlov · 2025-12-08T13:08:50Z

With the updated implementation of ThreadRng the difference between next_u32/thread and next_u32/std is around 2% (1.45 ns vs 1.42 ns).

There is still a small performance difference when compared against master, but it's much smaller now. I will need to take a look at generated assembly to see what causes this difference.

dhardy · 2025-12-09T05:25:22Z

Why have you updated ReseedingRng yet not used it in ThreadRng? Is this a result of the PR evolving over time?

newpavlov · 2025-12-09T10:14:40Z

Because ReseedingRng is s part of the rand public API. If we are to (re)move it, I think it's better to do it in a separate PR.

IMO we should discourage it's use. Users arguably should just use a good PRNGs seeded just once.

newpavlov · 2025-12-15T10:44:38Z

Closing based on discussion in rust-random/rand_core#31.

dhardy · 2026-02-02T08:54:28Z

src/rngs/thread.rs

-// Number of generated bytes after which to reseed `ThreadRng`.
-// According to benchmarks, reseeding has a noticeable impact with thresholds
-// of 32 kB and less. We choose 64 kB to avoid significant overhead.
-const THREAD_RNG_RESEED_THRESHOLD: u64 = 1024 * 64;
+/// Number of generated ChaCha blocks (64 bytes) after which we reseed `ThreadRng`.
+///
+/// According to benchmarks, reseeding has a noticeable impact with thresholds
+/// of 32 KiB and less. We choose 64 KiB (64 * 1024) to avoid significant overhead.
+const THREAD_RNG_RESEED_THRESHOLD: u64 = 1024;


I believe this is wrong: four 16-word blocks (64 words total) are generated at once, but block_pos still refers to the 16-word block counter, hence this should be 4096.

No, ChaCha generates 64 byte blocks. The RNG implementation generates [u32; 64], i.e. 4 blocks or 256 bytes.

I know that. But the block counter gets incremented by 4 every time blocks are generated: https://github.com/RustCrypto/stream-ciphers/blob/master/chacha20/src/backends/soft.rs#L51

You use THREAD_RNG_RESEED_THRESHOLD to compare against value returned by block_pos, which tracks number of generated stream cipher blocks, not how many times the Generate::generate method was called.

block_pos still refers to the 16-word block counter, hence this should be 4096

We declare that reseeding happens after 64 KiB, not after 64K words.

You use THREAD_RNG_RESEED_THRESHOLD to compare against value returned by block_pos, which tracks number of generated stream cipher blocks, not how many times the Generate::generate method was called.

But it's still this counter, right?

We declare that reseeding happens after 64 KiB, not after 64K words.

Aha; 4-byte-words is the factor I was missing.

But it's still this counter, right?

What do you mean by "this counter"? Both THREAD_RNG_RESEED_THRESHOLD and get_block_pos work with the same "unit", i.e. number of generated stream cipher blocks.

Exactly that.

Migrate to new rand_core utility functions

e9f4e84

newpavlov mentioned this pull request Nov 19, 2025

Simplify trait BlockRngCore and rename to Generator rust-random/rand_core#26

Merged

1 task

newpavlov commented Nov 19, 2025

View reviewed changes

Cargo.toml Show resolved Hide resolved

newpavlov mentioned this pull request Nov 19, 2025

Remove rand_chacha #1687

Closed

dhardy reviewed Nov 19, 2025

View reviewed changes

src/rngs/reseeding.rs Show resolved Hide resolved

newpavlov added 5 commits November 19, 2025 20:25

port rand_chacha

f5cce31

patch rand_core for benches

90b9fee

fix some warnings

0baca57

use rand_chacha instead of chacha20

9ba4cad

fix benches

569262a

fix warning

3f4c4c8

fix doc

4767661

newpavlov added 3 commits November 19, 2025 21:07

uncomment default features

60baaba

fix rand_chacha docs

2154e9f

add inline attributes

9110387

remove criterion-cycles-per-byte dependency

e338cd2

dhardy mentioned this pull request Nov 24, 2025

New rand_core utility fns + Generator trait #1688

Closed

dhardy reviewed Nov 24, 2025

View reviewed changes

This was referenced Nov 30, 2025

Replace block module with helper functions rust-random/rand_core#24

Closed

Use new rand_core::block::Generator trait #1690

Closed

Block generation & buffering API design rust-random/rand_core#31

Closed

newpavlov added 5 commits December 8, 2025 14:42

Merge branch 'master' into rand_core/block_utils

28c538e

Update to BlockBuffer

de706e4

new ThreadRng impl

0045925

Add #[inline] to ThreadRng methods

8827d15

Add #[inline] to rand_chacha

f75c6d7

newpavlov added 2 commits December 8, 2025 16:32

tweak code

a6d5133

remove ThreadRngInner

363d488

newpavlov marked this pull request as ready for review December 9, 2025 16:21

newpavlov closed this Dec 15, 2025

newpavlov deleted the rand_core/block_utils branch December 19, 2025 09:46

newpavlov mentioned this pull request Jan 28, 2026

Use chacha20::ChaChaCore directly #1720

Closed

dhardy mentioned this pull request Jan 28, 2026

Remove ReseedingRng #1722

Merged

1 task

dhardy reviewed Feb 2, 2026

View reviewed changes

This was referenced Feb 2, 2026

Make {get,set}_block_pos inherent methods pub RustCrypto/stream-ciphers#516

Merged

Update chacha20: use ChaChaCore directly; remove bytes_until_reseed field #1726

Merged

Uh oh!

Conversation

newpavlov commented Nov 19, 2025

Uh oh!

Uh oh!

Uh oh!

newpavlov commented Nov 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dhardy commented Nov 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dhardy commented Nov 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

newpavlov commented Nov 19, 2025

Uh oh!

dhardy commented Nov 20, 2025

Uh oh!

newpavlov commented Nov 21, 2025

Uh oh!

newpavlov commented Nov 21, 2025

Uh oh!

dhardy commented Nov 21, 2025

Uh oh!

dhardy commented Nov 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

newpavlov commented Nov 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dhardy commented Nov 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

newpavlov commented Nov 21, 2025

Uh oh!

dhardy left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

newpavlov commented Dec 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dhardy commented Dec 9, 2025

Uh oh!

newpavlov commented Dec 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

newpavlov commented Dec 15, 2025

Uh oh!

dhardy Feb 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

newpavlov Feb 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

newpavlov Feb 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

newpavlov commented Nov 19, 2025 •

edited

Loading

dhardy commented Nov 19, 2025 •

edited

Loading

dhardy commented Nov 19, 2025 •

edited

Loading

dhardy commented Nov 21, 2025 •

edited

Loading

newpavlov commented Nov 21, 2025 •

edited

Loading

dhardy commented Nov 21, 2025 •

edited

Loading

newpavlov commented Dec 8, 2025 •

edited

Loading

newpavlov commented Dec 9, 2025 •

edited

Loading

dhardy Feb 2, 2026 •

edited

Loading

newpavlov Feb 2, 2026 •

edited

Loading

newpavlov Feb 2, 2026 •

edited

Loading