Deep Learning and Optimization Seminar
Organizing Committee:
- Dr. Tao LIN, Assistant Professor @ Westlake University
- Dr. Chen LIU, Assistant Professor @ City University of Hong Kong
- Dr. Kun YUAN, Assistant Professor @ Peking University
Contact & Discussion: You are welcome to join our Google group and our Slack.
Coming Up
Date (yyyy-mm-dd hh:mm) | Presenter | Topic or Paper | Materials |
---|---|---|---|
2023-?-? 16:30-17:30 (BJT) | Anastasia Koloskova | ? | paper1, paper2. |
Past Events
Date | Presenter | Topic or Paper | Materials |
---|---|---|---|
2023-07-20 10:00-11:00 (BJT) | Libin Zhu, UCSD | Spikes in the training loss of SGD, catapults and feature learning | abstract, slides, video. |
2023-07-17 16:00-17:00 | Fanghui Liu, EPFL | The role of over-parameterization in machine learning - a function space perspective | abstract, slides, video. |
2023-07-11 19:00-20:00 | Tongtian Zhu, ZJU | Decentralize to Generalize? On the Asymptotic Equivalence of Decentralized SGD and Average-direction SAM | abstract, slides, video, paper. |
2023-06-28 16:00-17:00 | Francesco Croce, University of Tübingen | How to Quickly Obtain Models Robust to Multiple and Different Threats, and Their Advantages | abstract, video. |
2023-06-19 15:00-16:00 | Binhang Yuan, HKUST | Accommodating LLM Training over Decentralized Computational Resources | abstract, slides, video. |
2023-06-07 19:00-20:00 | Bohang Zhang, Peking University | Rethinking the expressive power of gnns via graph biconnectivity | abstract, slides, video, paper. |
2023-05-31 15:30-16:30 | Jiaxin Shi, Google Deepmind | Sequence Modeling with Multiresolution Convolutional Memory | abstract, paper. |
2023-04-28 09:00-10:00 | Ziming Liu, MIT | Physics of deep learning: Understanding grokking via the lens of physics | abstract, slides. |
2023-04-24 19:30-20:30 | Dingfan Chen, CISPA | Privacy-preserving Generative Modeling | abstract, slides. |
2023-04-21 16:00-17:00 | Jialin Liu, DAMO Academy (US) |
Towards Constituting Mathematical Structures for Learning to Optimize | abstract, slides, paper. |
2023-04-12 17:00-18:00 | Maksym Andriushchenko, EPFL | SGD with large step sizes learns sparse features | abstract, video, paper. |
2022-12-01 16:00-17:30 | Ligeng Zhu, MIT | Algorithm-System Co-Design for TinyML. | abstract, paper, slides, website, demo, code. |