融合两个模型的权重 - 权重平均

model_list = ['./ckpt/rexnetv2_live_labelsmooth_plus65_eh3_94.77.pt.tar','./ckpt/rexnetv2_live_labelsmooth_plus65_eh5_96.36.pt.tar',]# 融合两个模型，模型一的backbone + 模型二的全连接层def integration2(model_list, fl_

梦坠凡尘

4135人浏览 · 2021-08-31 17:43:14

梦坠凡尘 · 2021-08-31 17:43:14 发布

model_list = [
                  './xxx.pt.tar',
                  './yyy.pt.tar',
                  ]
# 融合两个模型，模型一的backbone + 模型二的全连接层
def integration2(model_list, fl_model):

    worker_state_dict=[torch.load(x, map_location='cpu') if x.endswith('pt') else torch.load(x, map_location='cpu')['state_dict'] for x in model_list]
    print(worker_state_dict[0].keys())
    weight_keys=list(worker_state_dict[0].keys()) # ['features.0.weight', 'features.1.weight', 'features.1.bias'.....'output.1.weight', 'output.1.bias']
    fed_state_dict=collections.OrderedDict()
    for key in weight_keys:
        print('key is {}'.format(key))
        key_sum = 0
        for i in range(len(model_list)):
            key_sum += worker_state_dict[i][key]
        fed_state_dict[key] = key_sum /float(len(model_list))
    fl_model.load_state_dict(fed_state_dict)  # 融合后的模型
    return fl_model

2048 AI社区

有“AI”的1024 = 2048，欢迎大家加入2048 AI社区

更多推荐

大模型赋能：模型字段别名的智能生成之旅

这不仅提高了数据分析的效率，还降低了用户使用数据分析平台的门槛，使得更多的业务人员能够参与到数据分析中来，为企业的决策提供更广泛的支持。而且，大模型生成的别名具有高度的一致性，避免了人工生成别名时可能出现的不一致问题，确保了数据仓库中数据的规范性和统一性。这不仅可以大大提高别名生成的效率，减轻数据开发人员的工作负担，还能有效提升别名的一致性和准确性，为数据治理工作注入新的活力。大模型技术的出现，为