WebJan 19, 2024 · We propose a form of gradient surgery that projects a task's gradient onto the normal plane of the gradient of any other task that has a conflicting gradient. On a series of challenging multi-task supervised and multi-task RL problems, this approach leads to substantial gains in efficiency and performance. WebJan 5, 2024 · The objective of multi-task learning (MTL) [ 3, 26] is to develop methods that can tackle a large variety of tasks within a single model. MTL has multiple practical benefits. First, learning shared parameters across multiple tasks leads to representations that can be more data-efficient to train and also generalize better to unseen data.
Gradient Surgery for Multi-Task Learning DeepAI
WebWe propose a form of gradient surgery that projects a task's gradient onto the normal plane of the gradient of any other task that has a conflicting gradient. On a series of … WebMDL-NAS: A Joint Multi-domain Learning framework for Vision Transformer Shiguang Wang · TAO XIE · Jian Cheng · Xingcheng ZHANG · Haijun Liu Independent Component Alignment for Multi-Task Learning Dmitry Senushkin · Nikolay Patakin · Arsenii Kuznetsov · Anton Konushin Revisiting Prototypical Network for Cross Domain Few-Shot Learning ioannis theofilakis
Gradient Surgery for Multi-Task Learning OpenReview
WebAbstract: Multi-task learning technique is widely utilized in machine learning modeling where commonalities and differences across multiple tasks are exploited. However, multiple conflicting objectives often occur in multi-task learning. ... Moreover, the gradient surgery for the multi-gradient descent algorithm is proposed to obtain a stable ... WebJan 19, 2024 · We propose a form of gradient surgery that projects a task's gradient onto the normal plane of the gradient of any other task that has a conflicting gradient. WebWe propose a form of gradient surgery that projects a task's gradient onto the normal plane of the gradient of any other task that has a gradient. On a series of challenging … on set death