TY - JOUR
T1 - Reusing Deep Learning Models: Challenges and Directions in Software Engineering
AU - Davis, James C
AU - Jajal, Purvish
AU - Jiang, Wenxin
AU - Schorlemmer, Taylor R.
AU - Synovic, Nicholas
AU - Thiruvathukal, George K.
N1 - Davis, James C.; Jajal, Purvish; Jiang, Wenxin; Schorlemmer, Taylor R; Synovic, Nicholas; Thiruvathukal, George K., Reusing Deep Learning Models: Challenges and Directions in Software Engineering. IEEE JVA Symposium on Modern Computing at IEEE Services 2023, https://doi.org/10.6084/m9.figshare.23317556.v1
PY - 2023/7/1
Y1 - 2023/7/1
N2 - Deep neural networks (DNNs) achieve state-of-the-art performance in many areas, including computer vision, system configuration, and question-answering. However, DNNs are expensive to develop, both in intellectual effort (e.g., devising new architectures) and computational costs (e.g., training). Re-using DNNs is a promising direction to amortize costs within a company and across the computing industry. As with any new technology, however, there are many challenges in re-using DNNs. These challenges include both missing technical capabilities and missing engineering practices. This vision paper describes challenges in current approaches to DNN re-use. We summarize studies of re-use failures across the spectrum of re-use techniques, including conceptual (e.g., re-using based on a research paper), adaptation (e.g., re-using by building on an existing implementation), and deployment (e.g., direct re-use on a new device). We outline possible advances that would improve each kind of re-use.
AB - Deep neural networks (DNNs) achieve state-of-the-art performance in many areas, including computer vision, system configuration, and question-answering. However, DNNs are expensive to develop, both in intellectual effort (e.g., devising new architectures) and computational costs (e.g., training). Re-using DNNs is a promising direction to amortize costs within a company and across the computing industry. As with any new technology, however, there are many challenges in re-using DNNs. These challenges include both missing technical capabilities and missing engineering practices. This vision paper describes challenges in current approaches to DNN re-use. We summarize studies of re-use failures across the spectrum of re-use techniques, including conceptual (e.g., re-using based on a research paper), adaptation (e.g., re-using by building on an existing implementation), and deployment (e.g., direct re-use on a new device). We outline possible advances that would improve each kind of re-use.
KW - machine learning
KW - deep learning
KW - pre-trained models
KW - re-use
KW - empirical software engineering
KW - position paper
KW - vision paper
UR - https://ecommons.luc.edu/cs_facpubs/325
M3 - Article
JO - Computer Science: Faculty Publications and Other Works
JF - Computer Science: Faculty Publications and Other Works
ER -