Supervised Contrastive Learning as Multi-Objective Optimization for Fine-Tuning Large Pre-trained Language Models

Youness Moukafih; Mounir Ghogho; Kamel Smaïli

doi:10.48550/arXiv.2209.14161

Pré-Publication, Document De Travail Année : 2022

Supervised Contrastive Learning as Multi-Objective Optimization for Fine-Tuning Large Pre-trained Language Models

(1, 2) , (1) , (2)

1
2

Youness Moukafih

Fonction : Auteur
PersonId : 1112452

Université Internationale de Rabat

Statistical Machine Translation and Speech Modelization and Text

Mounir Ghogho

Fonction : Auteur

Université Internationale de Rabat

Kamel Smaïli

Fonction : Auteur
PersonId : 2521
IdHAL : kamel-smaili
IdRef : 034429700

Statistical Machine Translation and Speech Modelization and Text

Résumé

Recently, Supervised Contrastive Learning (SCL) has been shown to achieve excellent performance in most classification tasks. In SCL, a neural network is trained to optimize two objectives: pull an anchor and positive samples together in the embedding space, and push the anchor apart from the negatives. However, these two different objectives may conflict, requiring trade-offs between them during optimization. In this work, we formulate the SCL problem as a Multi-Objective Optimization problem for the fine-tuning phase of RoBERTa language model. Two methods are utilized to solve the optimization problem: (i) the linear scalarization (LS) method, which minimizes a weighted linear combination of pertask losses; and (ii) the Exact Pareto Optimal (EPO) method which finds the intersection of the Pareto front with a given preference vector. We evaluate our approach on several GLUE benchmark tasks, without using data augmentations, memory banks, or generating adversarial examples. The empirical results show that the proposed learning strategy significantly outperforms a strong competitive contrastive learning baseline.

Mots clés

Few-shot Learning Multi-opjective Optimization Text Classification

Domaines

Informatique et langage [cs.CL]

PreprintXarch.pdf (651.69 Ko)

Kamel Smaïli : Connectez-vous pour contacter le contributeur

https://hal.science/hal-03792475

Soumis le : vendredi 30 septembre 2022-10:47:53

Dernière modification le : lundi 11 septembre 2023-17:41:19

Dates et versions

hal-03792475 , version 1 (30-09-2022)

Identifiants

HAL Id : hal-03792475 , version 1
DOI : 10.48550/arXiv.2209.14161

Citer

Youness Moukafih, Mounir Ghogho, Kamel Smaïli. Supervised Contrastive Learning as Multi-Objective Optimization for Fine-Tuning Large Pre-trained Language Models. 2022. ⟨hal-03792475⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA UNIV-LORRAINE LORIA LORIA-NLPKD

48 Consultations

16 Téléchargements

Supervised Contrastive Learning as Multi-Objective Optimization for Fine-Tuning Large Pre-trained Language Models

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager