Reusing Deep Learning Models: Challenges and Directions in Software Engineering

James C Davis, Purvish Jajal, Wenxin Jiang, Taylor R. Schorlemmer, Nicholas Synovic, George K. Thiruvathukal

Research output: Contribution to journalArticlepeer-review

Abstract

Deep neural networks (DNNs) achieve state-of-the-art performance in many areas, including computer vision, system configuration, and question-answering. However, DNNs are expensive to develop, both in intellectual effort (e.g., devising new architectures) and computational costs (e.g., training). Re-using DNNs is a promising direction to amortize costs within a company and across the computing industry. As with any new technology, however, there are many challenges in re-using DNNs. These challenges include both missing technical capabilities and missing engineering practices. This vision paper describes challenges in current approaches to DNN re-use. We summarize studies of re-use failures across the spectrum of re-use techniques, including conceptual (e.g., re-using based on a research paper), adaptation (e.g., re-using by building on an existing implementation), and deployment (e.g., direct re-use on a new device). We outline possible advances that would improve each kind of re-use.

Original languageAmerican English
JournalComputer Science: Faculty Publications and Other Works
StatePublished - Jul 1 2023

Keywords

  • machine learning
  • deep learning
  • pre-trained models
  • re-use
  • empirical software engineering
  • position paper
  • vision paper

Disciplines

  • Artificial Intelligence and Robotics
  • Computer Sciences
  • Software Engineering

Cite this