Theano

Theano
原作者	蒙特利尔大学的蒙特利尔学习算法研究所（英语：Montreal Institute for Learning Algorithms）（MILA）
開發者	PyMC开发团队
首次发布	2007年，17年前
当前版本	2.25.5（2024年10月3日；穩定版本）;
源代码库	github.com/pymc-devs/pytensor;
编程语言	Python, CUDA
平台	Linux, macOS, Windows
类型	机器学习, 函式库
许可协议	3条款BSD许可证
网站	pytensor.readthedocs.io/en/latest/

Theano及其分叉PyTensor，是一个Python库和优化的编译器，用来操纵和求值数学表达式特别是矩阵值表达式^[2]。在其中，计算使用NumPy风格的语法来表达并被编译，用来在CPU或者GPU架构上高效的运行。

历史

Theano是开源项目^[3]，主要由蒙特利尔大学的蒙特利尔学习算法研究所（英语：Montreal Institute for Learning Algorithms）（MILA）开发^[4]。软件名字取自古代哲学家Theano（英语：Theano (philosopher)）。在2017年9月28日，Pascal Lamblin发布了来自约书亚·本希奥的一则信息，MILA负责人说：由于更强大的工业参与者的竞争，主要的开发在1.0发行之后将会停止^[5]。Theano 1.0.0随后在2017年11月15日发行^[6]。

在2018年5月17日，Chris Fonnesbeck代表PyMC开发团队写道：PyMC开发者将在他们退场后取得对Theano维护的控制权^[7]。在2021年1月绝大部份的Theano代码基被重新建造，并增加了通过JAX和Numba的编译，修订后的这个计算后端以新名字Aesara发行。2022年11月28日，PyMC团队宣布采用从Aesara计划分叉出PyTensor^[8]。

样例代码

下列代码以PyTensor用作介绍的例子：

import pytensor
from pytensor import tensor as pt

# 声明2个符号浮点标量
a = pt.dscalar("a")
b = pt.dscalar("b")

# 建立一个简单的表达式
c = a + b

# 将这个表达式转换成一个可调用对象，
# 它接收'(a, b)'值作为输入并计算出一个值给'c'
f_c = pytensor.function([a, b], c)

assert f_c(1.5, 2.5) == 4.0

# 计算样例表达式关于'a'的梯度
dc = pytensor.grad(c, a)

f_dc = pytensor.function([a, b], dc)

assert f_dc(1.5, 2.5) == 1.0

>>> import pytensor
>>> from pytensor import tensor as pt
>>>
>>> # 通过'pytensor.function'编译函数还能优化表达式图
>>> # 它会移除不必要的运算并将特定运算替代为更有效的运算
>>> 
>>> v = pt.vector("v")
>>> M = pt.matrix("M")
>>> 
>>> d = a/a + (M + a).dot(v)
>>> 
>>> pytensor.dprint(d)
Add [id A]
 ├─ ExpandDims{axis=0} [id B]
 │  └─ True_div [id C]
 │     ├─ a [id D]
 │     └─ a [id D]
 └─ dot [id E]
    ├─ Add [id F]
    │  ├─ M [id G]
    │  └─ ExpandDims{axes=[0, 1]} [id H]
    │     └─ a [id D]
    └─ v [id I]
<_io.TextIOWrapper name='<stdout>' mode='w' encoding='utf-8'>
>>> 
>>> f_d = pytensor.function([a, v, M], d)
>>> 
>>> # 'a/a' -> '1'而点积被替代为BLAS函数(i.e. CGemv)
>>> pytensor.dprint(f_d)
Add [id A] 5
 ├─ [1.] [id B]
 └─ CGemv{inplace} [id C] 4
    ├─ AllocEmpty{dtype='float64'} [id D] 3
    │  └─ Shape_i{0} [id E] 2
    │     └─ M [id F]
    ├─ 1.0 [id G]
    ├─ Add [id H] 1
    │  ├─ M [id F]
    │  └─ ExpandDims{axes=[0, 1]} [id I] 0
    │     └─ a [id J]
    ├─ v [id K]
    └─ 0.0 [id L]
<_io.TextIOWrapper name='<stdout>' mode='w' encoding='utf-8'>

参见

深度学习软件比较（英语：Comparison of deep learning software）
可微分编程

引用

^ Release 2.25.5. 2024年10月3日 [2024年10月27日].
^ Bergstra, J.; O. Breuleux; F. Bastien; P. Lamblin; R. Pascanu; G. Desjardins; J. Turian; D. Warde-Farley; Y. Bengio. Theano: A CPU and GPU Math Expression Compiler (PDF). Proceedings of the Python for Scientific Computing Conference (SciPy) 2010. 30 June 2010 [2020-11-06]. （原始内容存档 (PDF)于2020-11-01）.
^ Github Repository. [2020-11-06]. （原始内容存档于2020-11-16）.
^ deeplearning.net. [2020-11-06]. （原始内容存档于2017-12-13）.
^ Lamblin, Pascal. MILA and the future of Theano. theano-users (邮件列表). 28 September 2017 [28 September 2017]. （原始内容存档于2011-01-22）.
^ Release Notes – Theano 1.0.0 documentation. [2020-11-06]. （原始内容存档于2020-09-14）.
^ Developers, PyMC. Theano, TensorFlow and the Future of PyMC. Medium. 2019-06-01 [2019-08-27]. （原始内容存档于2020-08-06）（英语）.
^ PyMC forked Aesara to PyTensor. [2023-08-17]. （原始内容存档于2023-07-18）.

外部链接

官方网站 (GitHub)
Theano（页面存档备份，存于互联网档案馆） at Deep Learning, Université de Montréal

[wikidata-752cd249a213494535ad3e11a6383d8ff9d704f3-v3-1] Release 2.25.5. 2024年10月3日 [2024年10月27日].

[2] Bergstra, J.; O. Breuleux; F. Bastien; P. Lamblin; R. Pascanu; G. Desjardins; J. Turian; D. Warde-Farley; Y. Bengio. Theano: A CPU and GPU Math Expression Compiler (PDF). Proceedings of the Python for Scientific Computing Conference (SciPy) 2010. 30 June 2010 [2020-11-06]. （原始内容存档 (PDF)于2020-11-01）.

[3] Github Repository. [2020-11-06]. （原始内容存档于2020-11-16）.

[4] rning.net. [2020-11-06]. （原始内容存档于2017-12-13）.

[5] Lamblin, Pascal. MILA and the future of Theano. theano-users (邮件列表). 28 September 2017 [28 September 2017]. （原始内容存档于2011-01-22）.

[6] Release Notes – Theano 1.0.0 documentation. [2020-11-06]. （原始内容存档于2020-09-14）.

[7] Developers, PyMC. Theano, TensorFlow and the Future of PyMC. Medium. 2019-06-01 [2019-08-27]. （原始内容存档于2020-08-06）（英语）.

[8] PyMC forked Aesara to PyTensor. [2023-08-17]. （原始内容存档于2023-07-18）.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

查论编深度学习软件（英语：Comparison of deep learning software）
开源软件	Apache Singa（英语：Apache Singa） Blocks（英语：Blocks） Caffe Deeplearning4j Dlib（英语：Dlib） Microsoft Cognitive Toolkit MXNet OpenNN PyTorch scikit-learn LangChain Gradio RETURNN（英语：RETURNN） TensorFlow Keras Theano Torch（英语：Torch (machine learning)）
专有	Neural Designer（英语：Neural Designer） Wolfram Mathematica
分类比较