Investigating Large Vision Model Training Challenges on Satellite Datasets

Hitesh Jain; Sagar Verma; Siddharth Gupta

Communication Dans Un Congrès Année : 2023

Investigating Large Vision Model Training Challenges on Satellite Datasets

(1, 2) , (3, 4, 2) , (2)

1
2
3
4

Hitesh Jain

Fonction : Auteur
PersonId : 1291045

Indian Institute of Technology [Gandhinagar]

Granular.ai

Sagar Verma

Fonction : Auteur
PersonId : 743250
IdHAL : versag

OPtimisation Imagerie et Santé

Centre de vision numérique

Granular.ai

Siddharth Gupta

Fonction : Auteur

Granular.ai

Résumé

Contrastive learning methods that bridge textual descriptions and images, such as Contrastive Language-Image Pre-training (CLIP), have demonstrated remarkable advancements. These foundational models have shown exceptional performance in tasks related to zero-shot image classification, as evidenced by their substantial enhancement of zero-shot ImageNet accuracy from the prior state-of-the-art of 12\% to an impressive 76\%. However, the exposure of these models to satellite images during training has been limited, resulting in suboptimal performance when dealing with geospatial data. This limitation raises a pivotal question: Can these foundational models, which have demonstrated potential across multiple domains, be trained on geospatial imagery out-of-box? To answer this question, we perform a study on training CLIP on diverse geospatial datasets. Within our research, we delve into unique challenges in this context and discuss the strategies we employ to address these challenges effectively. We demonstrate that handling resolution is crucial when training CLIP like models on a large multi-resolution dataset.

Mots clés

remote sensing neural networks robustness

Domaines

Informatique [cs] Intelligence artificielle [cs.AI] Traitement des images [eess.IV]

Fichier principal

Investigating_LVM_on_GeoSpatial_Data.pdf (7.86 Mo)

Origine : Fichiers produits par l'(les) auteur(s)

Sagar Verma : Connectez-vous pour contacter le contributeur

https://hal.science/hal-04231035

Soumis le : vendredi 6 octobre 2023-13:44:51

Dernière modification le : vendredi 17 novembre 2023-16:05:57

Archivage à long terme le : dimanche 7 janvier 2024-18:43:02

Dates et versions

hal-04231035 , version 1 (06-10-2023)

Identifiants

HAL Id : hal-04231035 , version 1

Citer

Hitesh Jain, Sagar Verma, Siddharth Gupta. Investigating Large Vision Model Training Challenges on Satellite Datasets. InGARSS 2023 - India Geoscience and Remote Sensing Symposium, IEEE, Dec 2023, Bengaluru, India. ⟨hal-04231035⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

INRIA CVN CENTRALESUPELEC INRIA2 UNIV-PARIS-SACLAY GS-COMPUTER-SCIENCE HUB-IA

125 Consultations

24 Téléchargements

Investigating Large Vision Model Training Challenges on Satellite Datasets

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager