TY - JOUR
T1 - Irrelevant Pixels are Everywhere: Find and Exclude Them for More Efficient Computer Vision
AU - Tung, Caleb
AU - Goel, Abhinav
AU - Hu, Xiao
AU - Eliopoulos, Nick
AU - Amobi, Emmanuel
AU - Thiruvathukal, George K
AU - Chaudhary, Vipin
AU - Lu, Yung-Hisang
N1 - Caleb Tung, Abhinav Goel, Xiao Hu, Nicholas Eliopoulos, Emmanuel Amobi, George Thiruvathukal, Vipin Chaudhary and Yung-Hsiang Lu, "Irrelevant Pixels are Everywhere: Find and Exclude Them for More Efficient Computer Vision", Proceedings of 2022 IEEE 4th International Conference on Artificial Intelligence Circuits and Systems (AICAS) , 2022.
PY - 2022/7/1
Y1 - 2022/7/1
N2 - Computer vision is often performed using Convolutional Neural Networks (CNNs). CNNs are compute-intensive and challenging to deploy on power-constrained systems such as mobile and Internet-of-Things (IoT) devices. CNNs are compute-intensive because they indiscriminately compute many features on all pixels of the input image. We observe that, given a computer vision task, images often contain pixels that are irrelevant to the task. For example, if the task is looking for cars, pixels in the sky are not very useful. Therefore, we propose that a CNN be modified to only operate on relevant pixels to save computation and energy. We propose a method to study three popular computer vision datasets, finding that 48% of pixels are irrelevant. We also propose the focused convolution to modify a CNN’s convolutional layers to reject the pixels that are marked irrelevant. On an embedded device, we observe no loss in accuracy, while inference latency, energy consumption, and multiply-add count are all reduced by about 45%.
AB - Computer vision is often performed using Convolutional Neural Networks (CNNs). CNNs are compute-intensive and challenging to deploy on power-constrained systems such as mobile and Internet-of-Things (IoT) devices. CNNs are compute-intensive because they indiscriminately compute many features on all pixels of the input image. We observe that, given a computer vision task, images often contain pixels that are irrelevant to the task. For example, if the task is looking for cars, pixels in the sky are not very useful. Therefore, we propose that a CNN be modified to only operate on relevant pixels to save computation and energy. We propose a method to study three popular computer vision datasets, finding that 48% of pixels are irrelevant. We also propose the focused convolution to modify a CNN’s convolutional layers to reject the pixels that are marked irrelevant. On an embedded device, we observe no loss in accuracy, while inference latency, energy consumption, and multiply-add count are all reduced by about 45%.
KW - computer vision
KW - low-power devices
KW - embedded systems
KW - datasets
UR - https://ecommons.luc.edu/cs_facpubs/293
M3 - Article
JO - Computer Science: Faculty Publications and Other Works
JF - Computer Science: Faculty Publications and Other Works
ER -