BGE-M3部署使用（1024维）

Ubuntu下BGE-M3部署使用

WSX的博客

1414人浏览 · 2025-04-01 11:32:59

WSX的博客 · 2025-04-01 11:32:59 发布

1.安装包

conda create -n bge-m3 python=3.12
pip install -U FlagEmbedding

2.密集嵌入使用

from FlagEmbedding import BGEM3FlagModel

model = BGEM3FlagModel('BAAI/bge-m3',use_fp16=True)

sentences_1 = ["What is BGE M3?", "Defination of BM25"]
sentences_2 = ["BGE M3 is an embedding model supporting dense retrieval, lexical matching and multi-vector interaction.", 
               "BM25 is a bag-of-words retrieval function that ranks a set of documents based on the query terms appearing in each document"]

embeddings_1 = model.encode(sentences_1,batch_size=12,max_length=8192,)['dense_vecs']
#默认batch_size=256，max_length=512
embeddings_2 = model.encode(sentences_2)['dense_vecs']

#句子相似度计算
similarity = embeddings_1 @ embeddings_2.T
print(similarity)
#[[0.6259035  0.34749585]
#[0.349868   0.6782462 ]]

3.稀疏嵌入使用

from FlagEmbedding import BGEM3FlagModel

model = BGEM3FlagModel('BAAI/bge-m3',  use_fp16=True) 

sentences_1 = ["What is BGE M3?", "Defination of BM25"]
#sentences_1 = ["What is embedding model BGE M3?", "Defination of BM25"]
sentences_2 = ["BGE M3 is an embedding model supporting dense retrieval, lexical matching and multi-vector interaction.", 
               "BM25 is a bag-of-words retrieval function that ranks a set of documents based on the query terms appearing in each document"]

#return_dense=True			返回稠密向量
#return_sparse=True		    返回稀疏向量
#return_colbert_vecs=False	禁用多向量
#默认参数batch_size=256，max_length=512
output_1 = model.encode(sentences_1, return_dense=True, return_sparse=True, return_colbert_vecs=False)
output_2 = model.encode(sentences_2, return_dense=True, return_sparse=True, return_colbert_vecs=False)

#词权重比
print(model.convert_id_to_token(output_1['lexical_weights']))
# [{'What': 0.08356, 'is': 0.0814, 'B': 0.1296, 'GE': 0.252, 'M': 0.1702, '3': 0.2695, '?': 0.04092}, 
#  {'De': 0.05005, 'fin': 0.1368, 'ation': 0.04498, 'of': 0.0633, 'BM': 0.2515, '25': 0.3335}]

#共享词项
lexical_scores = model.compute_lexical_matching_score(output_1['lexical_weights'][0], output_2['lexical_weights'][0])
print(lexical_scores)
# 0.19554451
# 0.2668
print(model.compute_lexical_matching_score(output_1['lexical_weights'][0], output_1['lexical_weights'][1]))
# 0.0

4.多向量使用

from FlagEmbedding import BGEM3FlagModel

model = BGEM3FlagModel('BAAI/bge-m3',  use_fp16=True) 

sentences_1 = ["What is BGE M3?", "Defination of BM25"]
sentences_2 = ["BGE M3 is an embedding model supporting dense retrieval, lexical matching and multi-vector interaction.", 
               "BM25 is a bag-of-words retrieval function that ranks a set of documents based on the query terms appearing in each document"]

output_1 = model.encode(sentences_1, return_dense=True, return_sparse=True, return_colbert_vecs=True)
output_2 = model.encode(sentences_2, return_dense=True, return_sparse=True, return_colbert_vecs=True)

#语义相关度
print(model.colbert_score(output_1['colbert_vecs'][0], output_2['colbert_vecs'][0]))
print(model.colbert_score(output_1['colbert_vecs'][0], output_2['colbert_vecs'][1]))
# 0.7796
# 0.4621

5.混比例使用

from FlagEmbedding import BGEM3FlagModel

model = BGEM3FlagModel('BAAI/bge-m3',  use_fp16=False) 

#sentences_1 = ["What is BGE M3?", "Defination of BM25"]
sentences_1 = ["What is the embedding BGE M3?", "Defination of BM25"]
sentences_2 = ["BGE M3 is an embedding model supporting dense retrieval, lexical matching and multi-vector interaction.", 
               "BM25 is a bag-of-words retrieval function that ranks a set of documents based on the query terms appearing in each document"]

sentence_pairs = [[i,j] for i in sentences_1 for j in sentences_2]

print(model.compute_score(sentence_pairs, 
                          max_passage_length=128, # a smaller max length leads to a lower latency
                          weights_for_different_modes=[0.4, 0.2, 0.4])) # weights_for_different_modes(w) is used to do weighted sum: w[0]*dense_score + w[1]*sparse_score + w[2]*colbert_score

# {
#   'colbert': [0.7796499729156494, 0.4621465802192688, 0.4523794651031494, 0.7898575067520142], 
#   'sparse': [0.195556640625, 0.00879669189453125, 0.0, 0.1802978515625], 
#   'dense': [0.6259765625, 0.347412109375, 0.349853515625, 0.67822265625], 
#   'sparse+dense': [0.482503205537796, 0.23454029858112335, 0.2332356721162796, 0.5122477412223816], 
#   'colbert+sparse+dense': [0.6013619303703308, 0.3255828022956848, 0.32089319825172424, 0.6232916116714478]
# }

# {
#	'colbert': [0.7796695828437805, 0.4162615239620209, 0.45241814851760864, 0.7898669838905334], 
#	'sparse': [0.22662657499313354, 0.0025286339223384857, 0.0, 0.18041232228279114], 
#	'dense': [0.6880049109458923, 0.3501098155975342, 0.3498679995536804, 0.678246259689331], 
#	'sparse+dense': [0.5342121124267578, 0.23424942791461945, 0.2332453429698944, 0.5123016238212585], 
#	'colbert+sparse+dense': [0.6323951482772827, 0.3070542812347412, 0.32091444730758667, 0.6233277320861816]
# }

2048 AI社区

有“AI”的1024 = 2048，欢迎大家加入2048 AI社区

更多推荐

数据湖中的AI技术集成：架构师的实战解决方案

数据湖是企业数字化转型的“数据资产仓库”，但传统数据湖的“原始数据堆”模式难以满足AI对结构化、高质量、可复用数据的需求。本文从架构师视角出发，结合实战经验，系统讲解数据湖与AI技术集成的全流程：从数据治理（让数据“可找到、可信任”）到特征工程（将数据转化为AI能理解的“语言”），再到模型部署（让AI真正产生业务价值）。通过“仓库-工厂”的生活化比喻、具体的架构设计（湖仓一体）、代码示例（Feas

2048 AI社区

表格数据的机器学习（五）

第七章中展示的东京 Airbnb 问题的 XGBoost 解决方案提供了一个基线，我们可以用它来评估深度学习解决相同问题的有效性。以第七章中的 XGBoost 解决方案为起点，我们可以为东京 Airbnb 问题创建一个深度学习解决方案。fastai 库提供了一个紧凑且相对性能良好的深度学习解决方案。通过在从 100% XGBoost 到 100% fastai 的一系列比例中混合 XGBoost