LoRA: Low-Rank Adaptation of LLM

HPC_C

64人浏览 · 2025-08-30 15:21:49

HPC_C · 2025-08-30 15:21:49 发布

Motivation

For Large Language Models, there are a significant numbers of parameters, such as GPT3(175B). If we want to fine-tune it to adapt mutiple downstream tasks, then we would need to retrain all the parameters, which will greatly waste a lot of computing resources.So this paper proposed a Low-Rank Adapation method, which freezes the pretrained models weights and inject the trainable rank decomposition matrices into each layer of transformer architecture.

Aren't the exist methods good enough?

There are two main-stream methods now. The first one is adding a new adapter, which will introduce inference latency. The another one is to optimize the prompts, which will reduce the usable prompts.

Key contribution

The key formula is the following one:

it will freeze the pretrained weights and decomposite latter with two low-ranks matrices.

Benefits

1. Make the training progress more efficient through using fewer computing resources.
2.It's easy to switch between different tasks by changing the BA parts while freezing the pretrained parts.
3.introducing no inference latency.
4.LoRA is an orthogonal method and can be combined with many of other technical methods, such prefix-tuning.

2048 AI社区

有“AI”的1024 = 2048，欢迎大家加入2048 AI社区

更多推荐

详解推测性采样加速推理的算法逻辑

推测性采样算法通过小模型快速生成草稿序列，再经大模型并行验证与修正，在不改变目标模型分布的前提下实现2-2.5倍加速。该算法分为三阶段：草稿生成阶段由小模型自回归生成K个候选token；并行评分阶段由大模型一次性计算K+1个上下文的概率分布；验证修正阶段通过概率比较和随机采样决定是否保留或修正草稿token。算法关键创新在于通过并行计算减少大模型调用次数，同时利用数学设计确保生成质量与单独使用大模

2048 AI社区

Java Web 明星应援系统系统源码-SpringBoot2+微信小程序+MyBatis+MySQL8【含文档】

2048 AI社区

AI学习机如何选？步步高S9以全能实力引领新趋势

在众多新品中，步步高近期推出的旗舰学习机S9，凭借其深度融合AI大模型技术、全面的权威教育资源和旗舰级的硬件配置，成为了一个备受关注的答案。综上所述，步步高学习机S9通过深度融合AI技术，不仅成为了一名全能的“AI辅导者”，更凭借其在英语学习领域的深厚积累和全面的权威资源，为中国孩子提供了高效、健康且富有乐趣的学习解决方案。更重要的是，系统能溯源错因，关联同类型的“母题”，并由清北学霸名师进行视频