计算机科学与探索 ›› 2023, Vol. 17 ›› Issue (9): 2252-2264.DOI: 10.3778/j.issn.1673-9418.2207106

• 大数据技术 • 上一篇    

MaSS:基于单位数据贡献的模型定价框架

张小伟,江东,袁野,安丽霞,王国仁   

  1. 1. 东北大学 计算机科学与工程学院,沈阳 110819
    2. 北京理工大学 计算机学院,北京 100081
  • 出版日期:2023-09-01 发布日期:2023-09-01

MaSS: Model Pricing Marketplace Based on Unit Data Contribution

ZHANG Xiaowei, JIANG Dong, YUAN Ye, AN Lixia, WANG Guoren   

  1. 1. School of Computer Science and Engineering, Northeastern University, Shenyang 110819, China
    2. School of Computer Science & Technology, Beijing Institute of Technology, Beijing 100081, China
  • Online:2023-09-01 Published:2023-09-01

摘要: 数据驱动型的机器学习模型已经是大势所趋,但是对于如何促进机器学习模型市场发展的研究还为之甚少。现有的研究主要分为两方面:一方面是数据拥有者和中间商之间的交互,即数据所有者的补偿问题;另一方面是模型买家与中间商之间的交互,即模型的定价问题。但是对于模型交易市场而言,这两个问题是密不可分的。因此针对这个问题,提出了新的模型定价框架MaSS。MaSS采用了新的效用评估指标Shapley值,然后基于该指标提出了三阶段的Stackelberg博弈的模型交易框架,其中包含了三方参与人:模型买家、中间人、数据拥有者。数据拥有者提供带有噪音的数据给中间人。中间人进一步将数据加工成模型买家需要的模型,向模型买家提供模型从而获利。他们之间相互作用形成一个Stackelberg博弈,以使得参与交易的所有人实现收益最大化,并从理论上证明了均衡定价策略存在的唯一性。最后,通过对真实数据大量模拟,证明了其显著的性能。

关键词: 数据定价, 模型定价, Shapley值, Stackelberg博弈

Abstract: Data-driven machine learning models have become ubiquitous. However, there is still very little research on how to promote the market development of the machine learning model. The existing research is mainly divided into two aspects, one is the interaction between the data owners and the brokers, that is, the compensation of the data owners. Another is the interaction between model buyers and brokers, that is, the expense of the model buyers. But for the model market, these issues are indivisible. Therefore, this paper takes a formal data marketplace perspective and proposes the novel model marketplace based on three-stage hierarchical Stackelberg game and Shapley value (MaSS). MaSS adopts a new utility evaluation index, Shapley value. And then this paper proposes a model trading framework of three-stage Stackelberg game based on Shapley value, including three-party parti-cipants: model buyers, brokers and data owners. The data owners provide the broker with private data. The brokers will further process the data into the models needed by the model buyers, and provide the model for the model buyer for profit. They interact with each other to form a Stackelberg game to maximize the profits of all involved in the transaction. And the uniqueness of the existence of equilibrium pricing strategy is proven theoretically. Finally, its remarkable performance is demonstrated by extensive simulations on real data.

Key words: data pricing, model pricing, Shapley value, Stackelberg game