deepspeed huggingface傳入參數 optimizer和lr

deepspeed huggingface傳入參數 optimizer和lr_scheduler測試

Trainer中

首先：
WarmupDecayLR= --lr_scheduler_type linear
WarmupLR= --lr_scheduler_type constant_with_warmup

1

TrainArgument不傳lr_scheduler_type、optim，warmup_steps=15
ds config文件中定義如下：
在這里插入圖片描述
注意：如果不在TrainArgument傳入warmup_steps，直接在ds config指定的話會報錯，故需要結合使用。

學習率如下：
在這里插入圖片描述

2

TrainArgument不傳lr_scheduler_type、optim，warmup_steps=15
ds config文件中定義如下：
在這里插入圖片描述
學習率如下：

3

TrainArgument不傳optim，warmup_steps=15
lr_scheduler_type=constant_with_warmup
ds config文件中定義如下：
在這里插入圖片描述
學習率如下：

可以得出：deepspeed中的優化器和學習率策略確實是有優先級的，兩個都定義的情況下會用deepspeed中的。

推薦用法：optim用deepspeed， lr_scheduler用huggingface的 cosine
cosine學習率圖如下：
在這里插入圖片描述

PPOTrainer、RLOOTrainer

trl中的相關trainer是不支持deepspeed配置optimizer 和 lr_scheduler的，需要使用huggingface提供的。

本文來自互聯網用戶投稿，該文觀點僅代表作者本人，不代表本站立場。本站僅提供信息存儲空間服務，不擁有所有權，不承擔相關法律責任。
如若轉載，請注明出處：http://www.pswp.cn/web/41848.shtml
繁體地址，請注明出處：http://hk.pswp.cn/web/41848.shtml
英文地址，請注明出處：http://en.pswp.cn/web/41848.shtml

如若內容造成侵權/違法違規/事實不符，請聯系多彩編程網進行投訴反饋email:809451989@qq.com，一經查實，立即刪除！