Xingyuan Xu1, Mengxi Tan1, Bill Corcoran2, Jiayang Wu1, Andreas Boes3, Thach G. Nguyen3, Sai T. Chu4, Brent E. Little5, Damien G. Hicks1,6, Roberto Morandotti7,8, Arnan Mitchell3, and and David J. Moss1
Corresponding Author: ${correspondingAuthorString}
Abstract: Convolutional neural networks (CNNs), inspired by biological visual cortex systems, are a powerful category of artificial neural networks that can extract the hierarchical features of raw data to greatly reduce the network parametric complexity and enhance the predicting accuracy. They are of significant interest for machine learning tasks such as computer vision, speech recognition, playing board games and medical diagnosis [1-7]. Optical neural networks offer the promise of dramatically accelerating computing speed to overcome the inherent bandwidth bottleneck of electronics. Here, we demonstrate a universal optical vector convolutional accelerator operating beyond 10 Tera-FLOPS (floating point operations per second), generating convolutions of images of 250,000 pixels with 8-bit resolution for 10 kernels simultaneously — enough for facial image recognition. We then use the same hardware to sequentially form a deep optical CNN with ten output neurons, achieving successful recognition of full 10 digits with 900 pixel handwritten digit images with 88% accuracy. Our results are based on simultaneously interleaving temporal, wavelength and spatial dimensions enabled by an integrated microcomb source. This approach is scalable and trainable to much more complex networks for demanding applications such as unmanned vehicle and real-time video recognition.
Key words: Microcombs; Neural Networks; Convolutional Accelerator; Photonics; Deep Optical Neural Networks; Ultrahigh speed
Cite as: JOSarXiv.202102.0002
Version History
Article views: 135 Times PDF downloads: 6 Times
Manuscript received: 24 February 2021
Manuscript published: 24 February 2021
JOSarXiv © 2019 All Rights Reserved