Optical neural network

Schematic of an optical neural network that functions as a logic gate (above) and its implementation in microwave frequencies (below). The intermediate diffractive metasurfaces function as hidden layers.^[1]

An optical neural network is a physical implementation of an artificial neural network with optical components. Early optical neural networks used a photorefractive Volume hologram to interconnect arrays of input neurons to arrays of output with synaptic weights in proportion to the multiplexed hologram's strength.^[2] Volume holograms were further multiplexed using spectral hole burning to add one dimension of wavelength to space to achieve four dimensional interconnects of two dimensional arrays of neural inputs and outputs.^[3] This research led to extensive research on alternative methods using the strength of the optical interconnect for implementing neuronal communications.^[4]

Some artificial neural networks that have been implemented as optical neural networks include the Hopfield neural network^[5] and the Kohonen self-organizing map with liquid crystal spatial light modulators^[6] Optical neural networks can also be based on the principles of neuromorphic engineering, creating neuromorphic photonic systems. Typically, these systems encode information in the networks using spikes, mimicking the functionality of spiking neural networks in optical and photonic hardware. Photonic devices that have demonstrated neuromorphic functionalities include (among others) vertical-cavity surface-emitting lasers,^[7]^[8] integrated photonic modulators,^[9] optoelectronic systems based on superconducting Josephson junctions^[10] or systems based on resonant tunnelling diodes.^[11]

Electrochemical vs. optical neural networks

Biological neural networks function on an electrochemical basis, while optical neural networks use electromagnetic waves. Optical interfaces to biological neural networks can be created with optogenetics, but is not the same as an optical neural networks. In biological neural networks there exist a lot of different mechanisms for dynamically changing the state of the neurons, these include short-term and long-term synaptic plasticity. Synaptic plasticity is among the electrophysiological phenomena used to control the efficiency of synaptic transmission, long-term for learning and memory, and short-term for short transient changes in synaptic transmission efficiency. Implementing this with optical components is difficult, and ideally requires advanced photonic materials. Properties that might be desirable in photonic materials for optical neural networks include the ability to change their efficiency of transmitting light, based on the intensity of incoming light.

Rising Era of Optical Neural Networks

With the increasing significance of computer vision in various domains, the computational cost of these tasks has increased, making it more important to develop the new approaches of the processing acceleration. Optical computing has emerged as a potential alternative to GPU acceleration for modern neural networks, particularly considering the looming obsolescence of Moore's Law. Consequently, optical neural networks have garnered increased attention in the research community. Presently, two primary methods of optical neural computing are under research: silicon photonics-based and free-space optics. Each approach has its benefits and drawbacks; while silicon photonics may offer superior speed, it lacks the massive parallelism that free-space optics can deliver. Given the substantial parallelism capabilities of free-space optics, researchers have focused on taking advantage of it. One implementation, proposed by Lin et al.,^[12] involves the training and fabrication of phase masks for a handwritten digit classifier. By stacking 3D-printed phase masks, light passing through the fabricated network can be read by a photodetector array of ten detectors, each representing a digit class ranging from 1 to 10. Although this network can achieve terahertz-range classification, it lacks flexibility, as the phase masks are fabricated for a specific task and cannot be retrained. An alternative method for classification in free-space optics, introduced by Cahng et al.,^[13] employs a 4F system that is based on the convolution theorem to perform convolution operations. This system uses two lenses to execute the Fourier transforms of the convolution operation, enabling passive conversion into the Fourier domain without power consumption or latency. However, the convolution operation kernels in this implementation are also fabricated phase masks, limiting the device's functionality to specific convolutional layers of the network only. In contrast, Li et al.^[14] proposed a technique involving kernel tiling to use the parallelism of the 4F system while using a Digital Micromirror Device (DMD) instead of a phase mask. This approach allows users to upload various kernels into the 4F system and execute the entire network's inference on a single device. Unfortunately, modern neural networks are not designed for the 4F systems, as they were primarily developed during the CPU/GPU era. Mostly because they tend to use a lower resolution and a high number of channels in their feature maps.

Other Implementations

In 2007 there was one model of Optical Neural Network: the Programmable Optical Array/Analogic Computer (POAC). It had been implemented in the year 2000 and reported based on modified Joint Fourier Transform Correlator (JTC) and Bacteriorhodopsin (BR) as a holographic optical memory. Full parallelism, large array size and the speed of light are three promises offered by POAC to implement an optical CNN. They had been investigated during the last years with their practical limitations and considerations yielding the design of the first portable POAC version.

The practical details – hardware (optical setups) and software (optical templates) – were published. However, POAC is a general purpose and programmable array computer that has a wide range of applications including:

Progress in the 2020s

Taichi from Tsinghua University in Beijing is a hybrid ONN that combines the power efficiency and parallelism of optical diffraction and the configurability of optical interference. Taichi offers 13.96 million parameters. Taichi avoids the high error rates that afflict deep (multi-layer) networks by combining clusters of fewer-layer diffractive units with arrays of interferometers for reconfigurable computation. Its encoding protocol divides large network models into sub-models that can be distributed across multiple chiplets in parallel.^[15]

Taichi achieved 91.89% accuracy in tests with the Omniglot database. It was also used to generate music Bach and generate images the styles of Van Gogh and Munch.^[15]

The developers claimed energy efficiency of up to 160 trillion operations second^-1 watt^-1 and an area efficiency of 880 trillion multiply-accumulate operations mm^-2 or 10³ more energy efficient than the NVIDIA H100, and 10² times more energy efficient and 10 times more area efficient than previous ONNs.^[15]

Time dimension has recently been introduced into diffrative nueral network by fs laser lithography of perovskite hydration. The temporal behaviour of the neuron can be modulated by the fs laser at the nanoscale, enabling a programmable holographic neural network with temporal evolution functionality, i.e., the functionality can change with time under the hydration stimuli. An in-memory temporal inference functionality was demonstrated to mimic the function evolution of the human brain,i.e.,the functionality can change from simple digit image classification to more complicated digit and clothing product image classification with time. This is the first time of introducting time dimension into the optical neural netwrok, laying a foundation for future brain-like photonic chip development. ^[16]

References

^ Qian, Chao; Lin, Xiao; Lin, Xiaobin; Xu, Jian; Sun, Yang; Li, Erping; Zhang, Baile; Chen, Hongsheng (2020). "Performing optical logic operations by a diffractive neural network". Light: Science & Applications. 9 (59): 59. Bibcode:2020LSA.....9...59Q. doi:10.1038/s41377-020-0303-2. PMC 7154031. PMID 32337023.
^ Wagner K, Psaltis D (1988). "Adaptive optical networks using photorefractive crystals". Appl. Opt. 27 (9): 1752–1759. Bibcode:1988ApOpt..27.1752P. doi:10.1364/AO.27.001752. PMID 20531647.
^ Weverka R, Wagner K, Saffman M (1991). "Fully interconnected, two-dimensional neural arrays using wavelength-multiplexed volume holograms". Optics Letters. 16 (11): 826–828. Bibcode:1991OptL...16..826W. doi:10.1364/OL.16.000826. PMID 19776798.
^ Wagner K, Psaltis D (1993). "Optical neural networks: an introduction by the feature editors". Appl. Opt. 32 (8): 1261–1263. Bibcode:1993ApOpt..32.1261W. doi:10.1364/AO.32.001261. PMID 20820259.
^ Ramachandran R, Gunasekaran N (2000). "Optical Implementation of Two Dimensional Bipolar Hopfield Model Neural Network (Scientific Note)" (PDF). Proceedings-National Science Council Republic of China Part a Physical Science and Engineering. 24 (1): 73–8. Archived from the original (PDF) on 12 October 2004.
^ Duvillier J, Killinger M, Heggarty K, Yao K, de Bougrenet de la Tocnaye JL (January 1994). "All-optical implementation of a self-organizing map: a preliminary approach". Applied Optics. 33 (2): 258–66. Bibcode:1994ApOpt..33..258D. doi:10.1364/AO.33.000258. PMID 20862015.
^ Hejda M, Robertson J, Bueno J, Alanis J, Hurtado A (2021-06-01). "Neuromorphic encoding of image pixel data into rate-coded optical spike trains with a photonic VCSEL-neuron". APL Photonics. 6 (6): 060802. Bibcode:2021APLP....6f0802H. doi:10.1063/5.0048674. ISSN 2378-0967.
^ Robertson J, Hejda M, Bueno J, Hurtado A (April 2020). "Ultrafast optical integration and pattern classification for neuromorphic photonics based on spiking VCSEL neurons". Scientific Reports. 10 (1): 6098. Bibcode:2020NatSR..10.6098R. doi:10.1038/s41598-020-62945-5. PMC 7142074. PMID 32269249.
^ George JK, Mehrabian A, Amin R, Meng J, de Lima TF, Tait AN, et al. (February 2019). "Neuromorphic photonics with electro-absorption modulators". Optics Express. 27 (4): 5181–5191. arXiv:1809.03545. Bibcode:2019OExpr..27.5181G. doi:10.1364/OE.27.005181. PMID 30876120. S2CID 80625696.
^ Shainline JM (January 2020). "Fluxonic Processing of Photonic Synapse Events". IEEE Journal of Selected Topics in Quantum Electronics. 26 (1): 1–15. arXiv:1904.02807. Bibcode:2020IJSTQ..2627473S. doi:10.1109/JSTQE.2019.2927473. ISSN 1077-260X. S2CID 102352120.
^ Romeira B, Javaloyes J, Ironside CN, Figueiredo JM, Balle S, Piro O (September 2013). "Excitability and optical pulse generation in semiconductor lasers driven by resonant tunneling diode photo-detectors". Optics Express. 21 (18): 20931–40. Bibcode:2013OExpr..2120931R. doi:10.1364/OE.21.020931. hdl:10400.1/11954. PMID 24103966. S2CID 480070.
^ Lin, Xing; Rivenson, Yair; Yardimci, Nezih T.; Veli, Muhammed; Luo, Yi; Jarrahi, Mona; Ozcan, Aydogan (7 September 2018). "All-optical machine learning using diffractive deep neural networks". Science. 361 (6406): 1004–1008. arXiv:1804.08711. Bibcode:2018Sci...361.1004L. doi:10.1126/science.aat8084. PMID 30049787. S2CID 13753997.
^ Chang, Julie; Sitzmann, Vincent; Dun, Xiong; Heidrich, Wolfgang; Wetzstein, Gordon (17 August 2018). "Hybrid optical-electronic convolutional neural networks with optimized diffractive optics for image classification". Scientific Reports. 8 (1): 12324. Bibcode:2018NatSR...812324C. doi:10.1038/s41598-018-30619-y. PMC 6098044. PMID 30120316. S2CID 256961403.
^ Li, Shurui; Miscuglio, Mario; Sorger, Volker J.; Gupta, Puneet (2020). "Channel Tiling for Improved Performance and Accuracy of Optical Neural Network Accelerators". arXiv:2011.07391 [cs.ET].
^ ^a ^b ^c CHOI, CHARLES Q. (12 April 2024). "AI Chip Trims Energy Budget Back by 99+ Percent - IEEE Spectrum". IEEE. Retrieved 2024-04-17.
^ Zhang, Yinan; Zhu, Shengting; Hu, Jinming; Gu, Min (6 October 2024). "Femtosecond laser direct nanolithography of perovskite hydration for temporally programmable holograms". Nature Communications. 15: 1661. doi:10.1038/s41467-024-51148-5.

[1] Qian, Chao; Lin, Xiao; Lin, Xiaobin; Xu, Jian; Sun, Yang; Li, Erping; Zhang, Baile; Chen, Hongsheng (2020). "Performing optical logic operations by a diffractive neural network". Light: Science & Applications. 9 (59): 59. Bibcode:2020LSA.....9...59Q. doi:10.1038/s41377-020-0303-2. PMC 7154031. PMID 32337023.

[2] Wagner K, Psaltis D (1988). "Adaptive optical networks using photorefractive crystals". Appl. Opt. 27 (9): 1752–1759. Bibcode:1988ApOpt..27.1752P. doi:10.1364/AO.27.001752. PMID 20531647.

[3] Weverka R, Wagner K, Saffman M (1991). "Fully interconnected, two-dimensional neural arrays using wavelength-multiplexed volume holograms". Optics Letters. 16 (11): 826–828. Bibcode:1991OptL...16..826W. doi:10.1364/OL.16.000826. PMID 19776798.

[4] Wagner K, Psaltis D (1993). "Optical neural networks: an introduction by the feature editors". Appl. Opt. 32 (8): 1261–1263. Bibcode:1993ApOpt..32.1261W. doi:10.1364/AO.32.001261. PMID 20820259.

[5] Ramachandran R, Gunasekaran N (2000). "Optical Implementation of Two Dimensional Bipolar Hopfield Model Neural Network (Scientific Note)" (PDF). Proceedings-National Science Council Republic of China Part a Physical Science and Engineering. 24 (1): 73–8. Archived from the original (PDF) on 12 October 2004.

[pmid20862015-6] Duvillier J, Killinger M, Heggarty K, Yao K, de Bougrenet de la Tocnaye JL (January 1994). "All-optical implementation of a self-organizing map: a preliminary approach". Applied Optics. 33 (2): 258–66. Bibcode:1994ApOpt..33..258D. doi:10.1364/AO.33.000258. PMID 20862015.

[7] Hejda M, Robertson J, Bueno J, Alanis J, Hurtado A (2021-06-01). "Neuromorphic encoding of image pixel data into rate-coded optical spike trains with a photonic VCSEL-neuron". APL Photonics. 6 (6): 060802. Bibcode:2021APLP....6f0802H. doi:10.1063/5.0048674. ISSN 2378-0967.

[8] Robertson J, Hejda M, Bueno J, Hurtado A (April 2020). "Ultrafast optical integration and pattern classification for neuromorphic photonics based on spiking VCSEL neurons". Scientific Reports. 10 (1): 6098. Bibcode:2020NatSR..10.6098R. doi:10.1038/s41598-020-62945-5. PMC 7142074. PMID 32269249.

[9] George JK, Mehrabian A, Amin R, Meng J, de Lima TF, Tait AN, et al. (February 2019). "Neuromorphic photonics with electro-absorption modulators". Optics Express. 27 (4): 5181–5191. arXiv:1809.03545. Bibcode:2019OExpr..27.5181G. doi:10.1364/OE.27.005181. PMID 30876120. S2CID 80625696.

[10] Shainline JM (January 2020). "Fluxonic Processing of Photonic Synapse Events". IEEE Journal of Selected Topics in Quantum Electronics. 26 (1): 1–15. arXiv:1904.02807. Bibcode:2020IJSTQ..2627473S. doi:10.1109/JSTQE.2019.2927473. ISSN 1077-260X. S2CID 102352120.

[11] Romeira B, Javaloyes J, Ironside CN, Figueiredo JM, Balle S, Piro O (September 2013). "Excitability and optical pulse generation in semiconductor lasers driven by resonant tunneling diode photo-detectors". Optics Express. 21 (18): 20931–40. Bibcode:2013OExpr..2120931R. doi:10.1364/OE.21.020931. hdl:10400.1/11954. PMID 24103966. S2CID 480070.

[12] Lin, Xing; Rivenson, Yair; Yardimci, Nezih T.; Veli, Muhammed; Luo, Yi; Jarrahi, Mona; Ozcan, Aydogan (7 September 2018). "All-optical machine learning using diffractive deep neural networks". Science. 361 (6406): 1004–1008. arXiv:1804.08711. Bibcode:2018Sci...361.1004L. doi:10.1126/science.aat8084. PMID 30049787. S2CID 13753997.

[13] Chang, Julie; Sitzmann, Vincent; Dun, Xiong; Heidrich, Wolfgang; Wetzstein, Gordon (17 August 2018). "Hybrid optical-electronic convolutional neural networks with optimized diffractive optics for image classification". Scientific Reports. 8 (1): 12324. Bibcode:2018NatSR...812324C. doi:10.1038/s41598-018-30619-y. PMC 6098044. PMID 30120316. S2CID 256961403.

[14] Li, Shurui; Miscuglio, Mario; Sorger, Volker J.; Gupta, Puneet (2020). "Channel Tiling for Improved Performance and Accuracy of Optical Neural Network Accelerators". arXiv:2011.07391 [cs.ET].

[:0-15] CHOI, CHARLES Q. (12 April 2024). "AI Chip Trims Energy Budget Back by 99+ Percent - IEEE Spectrum". IEEE. Retrieved 2024-04-17.

[16] Zhang, Yinan; Zhu, Shengting; Hu, Jinming; Gu, Min (6 October 2024). "Femtosecond laser direct nanolithography of perovskite hydration for temporally programmable holograms". Nature Communications. 15: 1661. doi:10.1038/s41467-024-51148-5.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

[13]

[14]

[15]

[16]

v t e Differentiable computing
General	Differentiable programming Information geometry Statistical manifold Automatic differentiation Neuromorphic computing Pattern recognition Ricci calculus Computational learning theory Inductive bias
Hardware	IPU TPU VPU Memristor SpiNNaker
Software libraries	TensorFlow PyTorch Keras scikit-learn Theano JAX Flux.jl MindSpore
Portals Computer programming Technology