Fisher Information-based Efficient Curriculum Federated Learning with Large Language Models

Ji Liu; Jiaxiang Ren; Ruoming Jin; Zijie Zhang; Yang Zhou; Patrick Valduriez; Dejing Dou

Communication Dans Un Congrès Année : 2024

Fisher Information-based Efficient Curriculum Federated Learning with Large Language Models

(1) , (2) , (3) , (4) , (2) , (5, 6) , (7)

1
2
3
4
5
6
7

Ji Liu

Fonction : Auteur
PersonId : 1426848

Hithink RoyalFlush Information Network Co

Jiaxiang Ren

Fonction : Auteur

Auburn University

Ruoming Jin

Fonction : Auteur
PersonId : 1390581
ORCID : 0000-0003-1895-4243

Kent State University

Zijie Zhang

Fonction : Auteur

The University of Texas at San Antonio

Yang Zhou

Fonction : Auteur
PersonId : 1390557
ORCID : 0000-0001-7839-4933

Auburn University

Patrick Valduriez

Fonction : Auteur
PersonId : 172604
IdHAL : patrick-valduriez
ORCID : 0000-0001-6506-7538
IdRef : 028314417

Scientific Data Management

Laboratorio Nacional de Computação Cientifica [Rio de Janeiro]

Dejing Dou

Fonction : Auteur
PersonId : 1220583
ORCID : 0000-0003-2949-6874

Fudan University [Shanghai]

Résumé

As a promising paradigm to collaboratively train models with decentralized data, Federated Learning (FL) can be exploited to fine-tune Large Language Models (LLMs). While LLMs correspond to huge size, the scale of the training data significantly increases, which leads to tremendous amounts of computation and communication costs. The training data is generally non-Independent and Identically Distributed (non-IID), which requires adaptive data pro- cessing within each device. Although Low-Rank Adaptation (LoRA) can significantly reduce the scale of parameters to update in the fine-tuning process, it still takes unaffordable time to transfer the low-rank parameters of all the layers in LLMs. In this paper, we propose a Fisher Information-based Efficient Curriculum Federated Learning framework (FibecFed) with two novel methods, i.e., adaptive federated curriculum learning and efficient sparse parameter update. First, we propose a fisher information- based method to adaptively sample data within each device to improve the effectiveness of the FL fine-tuning process. Second, we dynamically select the proper layers for global aggregation and sparse parameters for local update with LoRA so as to improve the efficiency of the FL fine-tuning process. Extensive experimental results based on 10 datasets demonstrate that FibecFed yields excellent performance (up to 45.35% in terms of accuracy) and superb fine-tuning speed (up to 98.61% faster) com- pared with 17 baseline approaches). Our code will be publicly available.

Mots clés

Large Language Model Federated learning Fisher Information

Domaines

Informatique [cs]

Fichier principal

EMNLP2024.pdf (916.57 Ko)

Origine	Fichiers produits par l'(les) auteur(s)
Licence	Paternité - Pas d'utilisation commerciale - Partage selon les Conditions Initiales

Patrick Valduriez : Connectez-vous pour contacter le contributeur

https://hal-lirmm.ccsd.cnrs.fr/lirmm-04734309

Soumis le : dimanche 13 octobre 2024-23:31:10

Dernière modification le : mercredi 13 novembre 2024-21:49:25

Dates et versions

lirmm-04734309 , version 1 (13-10-2024)

Licence

Paternité - Pas d'utilisation commerciale - Partage selon les Conditions Initiales

Identifiants

HAL Id : lirmm-04734309 , version 1

Citer

Ji Liu, Jiaxiang Ren, Ruoming Jin, Zijie Zhang, Yang Zhou, et al.. Fisher Information-based Efficient Curriculum Federated Learning with Large Language Models. EMNLP 2024 - Conference on Empirical Methods in Natural Language Processing, ACL SIGDAT, Nov 2024, Miami, Fl, United States. pp.1-27. ⟨lirmm-04734309⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA ZENITH LIRMM INRIA2 UNIV-MONTPELLIER INRIA-BRASIL

42 Consultations

23 Téléchargements

Fisher Information-based Efficient Curriculum Federated Learning with Large Language Models

Résumé

Mots clés

Domaines

Dates et versions

Licence

Identifiants

Citer

Exporter

Collections

Partager