Talks
Summer 2022
![](https://old.simons.berkeley.edu/sites/default/files/styles/workshop_main/public/deep_learning_theory_1.png?itok=_xAuSzS8)
Tutorial: Training & Deploying Multi-Modal Models
Thursday, August 4th, 2022, 11:15 am–12:30 pm
Speaker:
Brandon McKinzie (Apple)
Location:
Calvin Lab Auditorium
Abstract: Modern multi-modal models such as CLIP require significant engineering efforts to efficiently train, evaluate, and deploy. Furthermore, such models typically serve as a backbone feature extractor for many downstream tasks. This talk will provide an overview of how we’ve accomplished this at Apple, where CLIP now powers a large number of user experiences on iOS. We’ll cover concepts such as multi-node multi-gpu distributed training on billions of examples, transfer learning for downstream tasks, model pruning, efficient on-device inference with transformers, and more.