新增或更改内容#

NumPy 1.17.0 引入了 Generator 作为对旧版 RandomState 的改进替代品。以下是两种实现的快速比较。

特性	旧版等效项	注释
`Generator`	`RandomState`	`Generator` 需要一个流源，称为 `BitGenerator`。提供了许多这样的源。`RandomState` 默认使用梅森旋转算法 `MT19937`，但也可以用任何 BitGenerator 实例化。
`random`	`random_sample`，`rand`	访问 BitGenerator 中的值，将它们转换为区间 `[0.0., 1.0)` 中的 `float64`。除了 `size` 关键字参数外，现在还支持 `dtype='d'` 或 `dtype='f'`，以及一个 `out` 关键字参数来填充用户提供的数组。还支持许多其他分布。
`integers`	`randint`，`random_integers`	使用 `endpoint` 关键字参数来调整区间 `high` 端点的包含或排除。

正态、指数和伽马生成器使用 256 步 Ziggurat 方法，比 NumPy 中的默认实现快 2-10 倍 standard_normal、standard_exponential 或 standard_gamma。由于算法的改变，无法使用 Generator 来重现这些分布或任何依赖它们的分布方法的精确随机值。

In [1]: import numpy.random

In [2]: rng = np.random.default_rng()

In [3]: %timeit -n 1 rng.standard_normal(100000)
   ...: %timeit -n 1 numpy.random.standard_normal(100000)
   ...: 
936 us +- 4.33 us per loop (mean +- std. dev. of 7 runs, 1 loop each)
1.72 ms +- 11.5 us per loop (mean +- std. dev. of 7 runs, 1 loop each)

In [4]: %timeit -n 1 rng.standard_exponential(100000)
   ...: %timeit -n 1 numpy.random.standard_exponential(100000)
   ...: 
464 us +- 3.84 us per loop (mean +- std. dev. of 7 runs, 1 loop each)
1.23 ms +- 7.66 us per loop (mean +- std. dev. of 7 runs, 1 loop each)

In [5]: %timeit -n 1 rng.standard_gamma(3.0, 100000)
   ...: %timeit -n 1 numpy.random.standard_gamma(3.0, 100000)
   ...: 
1.75 ms +- 9.05 us per loop (mean +- std. dev. of 7 runs, 1 loop each)
3.48 ms +- 4.1 us per loop (mean +- std. dev. of 7 runs, 1 loop each)

integers 现在是从离散均匀分布生成整数随机数的规范方法。这同时取代了 randint 和已弃用的 random_integers。
rand 和 randn 方法只能通过旧版 RandomState 使用。
Generator.random 现在是生成浮点随机数的规范方法，它取代了 RandomState.random_sample、sample 和 ranf，所有这些都是别名。这与 Python 的 random.random 保持一致。
所有比特生成器都可以通过 CTypes (ctypes) 和 CFFI (cffi) 生成双精度浮点数、uint64 和 uint32。这允许这些比特生成器在 numba 中使用。
下游项目可以通过 Cython 使用比特生成器。
所有比特生成器都使用 SeedSequence 来将种子整数转换为初始化状态。
可选的 dtype 参数，接受 np.float32 或 np.float64 来为选择的分布生成单精度或双精度均匀随机变量。integers 接受具有任何有符号或无符号整数 dtype 的 dtype 参数。
- 均匀分布 (random 和 integers)
- 正态分布 (standard_normal)
- 标准伽马分布 (standard_gamma)
- 标准指数分布 (standard_exponential)

In [6]: rng = np.random.default_rng()

In [7]: rng.random(3, dtype=np.float64)
Out[7]: array([0.09583911, 0.93160588, 0.71947891])

In [8]: rng.random(3, dtype=np.float32)
Out[8]: array([0.50844425, 0.20221537, 0.7923881 ], dtype=float32)

In [9]: rng.integers(0, 256, size=3, dtype=np.uint8)
Out[9]: array([  4, 201, 126], dtype=uint8)

可选的 out 参数，允许为选择的分布填充现有数组
- 均匀分布 (random)
- 正态分布 (standard_normal)
- 标准伽马分布 (standard_gamma)
- 标准指数分布 (standard_exponential)
这允许使用合适的 BitGenerators 并行地将大型数组分块填充，从而实现多线程。

In [10]: rng = np.random.default_rng()

In [11]: existing = np.zeros(4)

In [12]: rng.random(out=existing[:2])
Out[12]: array([0.42493599, 0.03707944])

In [13]: print(existing)
[0.42493599 0.03707944 0.         0.        ]

对于诸如 choice、permutation 和 shuffle 之类的函数，添加了可选的 axis 参数，用于控制对多维数组执行操作的轴。

In [14]: rng = np.random.default_rng()

In [15]: a = np.arange(12).reshape((3, 4))

In [16]: a
Out[16]: 
array([[ 0,  1,  2,  3],
       [ 4,  5,  6,  7],
       [ 8,  9, 10, 11]])

In [17]: rng.choice(a, axis=1, size=5)
Out[17]: 
array([[ 1,  2,  2,  2,  0],
       [ 5,  6,  6,  6,  4],
       [ 9, 10, 10, 10,  8]])

In [18]: rng.shuffle(a, axis=1)        # Shuffle in-place

In [19]: a
Out[19]: 
array([[ 1,  3,  2,  0],
       [ 5,  7,  6,  4],
       [ 9, 11, 10,  8]])

添加了一个从复数正态分布采样的方法 (complex_normal)