计算机集成制造系统 ›› 2018, Vol. 24 ›› Issue (第7): 1806-1815.DOI: 10.13196/j.cims.2018.07.022

• 当期目次 • 上一篇    下一篇

支持活动语义度量的用户行为相似度计算方法

林泽东1,曾庆田1,段华2+,鲁法明1,邹杰3   

  1. 1.山东科技大学计算机科学与工程学院
    2.山东科技大学数学与系统科学学院
    3.交通运输部公路科学研究院
  • 出版日期:2018-07-31 发布日期:2018-07-31
  • 基金资助:
    国家自然科学基金资助项目(71704096,61602278,61602279,61472229,31671588);山东省科技发展计划资助项目(2014GGX101035,2016ZDJS02A11);山东省自然科学基金(BS2014DX013,ZR2015FM013,ZR2017MF027);国家海洋局海洋遥测工程技术研究中心开放基金(2018002);交通运输部公路科学研究院项目(2015-9024,2016-9027);山东省博士后创新专项资金资助项目(201603056);山东科技大学领军人才与优秀科研团队计划资助项目(2015TDJH102);教育部人文社会科学研究项目(No.16YJCZH012)。

Similarity computation method of user behavior supporting activity semantic measurement

  • Online:2018-07-31 Published:2018-07-31
  • Supported by:
    Project supported by the National Natural Science Foundation,China(No.71704096,61602278,61602279,61472229,31671588),the Science & Technology Development Fund of Shandong Province,China(No.2014GGX101035,2016ZDJS02A11),the Shandong Provincial Natural Science Foundation,China(No.BS2014DX013,ZR2015FM013,ZR2017MF027),the Fund of Oceanic Telemetry Engineering and Technology Research Center,State Oceanic Administration,China(No.2018002),the 2016 Key Projects of Institute of Highway Science of Ministry of Transportation,China(No.2015-9024,2016-9027),the Shandong Provincial Postdoctoral Innovation Project,China(No.201603056),the SDUST Research Fund,China(No.2015TDJH102),and the Humanities and Social Science Foundation of MOE,China(No.16YJCZH012).

摘要: 针对基于活动序列的用户行为相似性度量方法未见考虑活动的语义相似性度量,提出一种支持活动语义度量的用户行为相似性计算方法。首先结合活动间的邻接关系与标签文本语义计算活动间的相似度;其次,定义了活动编辑权值函数和活动序列距离;最后,利用活动序列多重集建模用户行为并利用推土机距离计算用户行为相似度。与目前主流算法在度量性质可满足性、现实数据集实验评估等方面进行对比分析,验证了所提方法的可行性和有效性。

关键词: 用户行为相似度, 文本语义相似度, 相似性度量, EMD距离

Abstract: The similarity measure of user behavior based on behavior sequence had not considered the semantic similarity index of activity.To solve this problem,a new algorithm of user behavior similarity measurement was proposed which supported activity semantics.Specifically,the similarity of activities was calculated by combining adjacency relation with label text semantics between activities;the edit weight function and the behavior sequence distance were defined;the users'behavior was modeled with behavior sequence multiple sets and the similarity of user behavior was calculated with Earth Mover's Distance (EMD).The feasibility and effectiveness of the proposed method were verified by comparing with the current mainstream algorithms in terms of measurement properties,satisfiability and experimental evaluation of real data sets.

Key words: user behavior similarity, text semantic similarity, similarity measure, earth mover's distance

中图分类号: