What does KnowBert-UMLS forget?

Guilhem Piat; Nasredine Semmar; Julien Tourille; Alexandre Allauzen; Hassane Essafi

doi:10.1109/AICCSA59173.2023.10479333

Communication Dans Un Congrès Année : 2023

What does KnowBert-UMLS forget?

(1) , (1) , (1) , (2) , (3)

1
2
3

Guilhem Piat

Fonction : Auteur

Laboratoire Analyse Sémantique Textes et Images

Nasredine Semmar

Fonction : Auteur

Laboratoire Analyse Sémantique Textes et Images

Julien Tourille

Fonction : Auteur

Laboratoire Analyse Sémantique Textes et Images

Alexandre Allauzen

Fonction : Auteur

Laboratoire d'analyse et modélisation de systèmes pour l'aide à la décision

Hassane Essafi

Fonction : Auteur

Département Intelligence Ambiante et Systèmes Interactifs

Résumé

Integrating a source of structured prior knowledge, such as a knowledge graph, into transformer-based language models is an increasingly popular method for increasing data efficiency and adapting them to a target domain. However, most methods for integrating structured knowledge into language models require additional training in order to adapt the model to the non-textual modality. This process typically leads to some amount of catastrophic forgetting on the general domain. KnowBert is one such knowledge integration method which can incorporate information from a variety of knowledge graphs to enhance the capabilities of transformer-based language models such as BERT. We conduct a qualitative analysis of the results of KnowBert-UMLS, a biomedically specialized KnowBert model, on a variety of linguistic tasks. Our results reveal that its increased understanding of biomedical concepts comes at the cost, specifically, of general common-sense knowledge and understanding of casual speech.

Mots clés

Domain Adaptation Knowledge based systems Catastrophic Forgetting Machine learning Biomedical informatics

Domaines

Apprentissage [cs.LG]

Fichier principal

AICCSA_2023_Paper_IEEE_Guilhem_Piat_NoteIEEE.pdf (223.35 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Contributeur MAP CEA : Connectez-vous pour contacter le contributeur

https://cea.hal.science/cea-04559677

Soumis le : jeudi 25 avril 2024-17:31:08

Dernière modification le : dimanche 28 avril 2024-03:18:58

Dates et versions

cea-04559677 , version 1 (25-04-2024)

Identifiants

HAL Id : cea-04559677 , version 1
DOI : 10.1109/AICCSA59173.2023.10479333

Citer

Guilhem Piat, Nasredine Semmar, Julien Tourille, Alexandre Allauzen, Hassane Essafi. What does KnowBert-UMLS forget?. AICCSA 2023 - 20th ACS/IEEE International Conference on Computer Systems and Applications, Dec 2023, Gizeh, Egypt. pp.1-8, ⟨10.1109/AICCSA59173.2023.10479333⟩. ⟨cea-04559677⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CEA CNRS UNIV-DAUPHINE DRT LAMSADE-DAUPHINE CEA-UPSAY PSL UNIV-PARIS-SACLAY LIST GS-COMPUTER-SCIENCE GS-SPORT-HUMAN-MOVEMENT

11 Consultations

4 Téléchargements

What does KnowBert-UMLS forget?

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager