Page 197 - 《软件学报》2024年第4期

P. 197

管泽礼等: 基于强化联邦 GNN 的个性化公共安全突发事件检测 1775

model through data centralization. Moreover, due to different focuses and collection times of the data from various parties, there is a
Non-IID (independent and identically distributed) problem among the data. Traditional methods that assume a global model can
accommodate all clients are challenging to solve such issues. Therefore, this study proposes personalized public safety event detection
(PPSED) method based on a reinforcement federated graph neural network. In this method, each client trains a personalized and more
robust model through multi-party collaboration to solve local event detection tasks. A local training and gradient quantization module is
designed for the federated public safety emergency event detection model and trained GraphSage through a minibatch mechanism based
on graph sampling to construct a local model for public safety event detection. This approach reduces the impact of Non-IID data and
supports the gradient quantization method to lower the consumption of gradient communication. A client state awareness module is also
designed based on random graph embedding, which better retains the valuable information of the client model while protecting privacy.
Furthermore, a personalized gradient aggregation and quantization strategy are designed for the federated graph neural network. Deep
deterministic policy gradient (DDPG) is used to fit a personalized federated learning gradient aggregation weighting strategy, and it is
determined whether the gradient can be quantized based on the weight, balancing the model's performance, and communication pressure.
This study demonstrated the effectiveness of the method through extensive experiments on a public safety dataset collected from the
Weibo platform and three public graph datasets.
Key words: federated learning; graph neural network (GNN); public safety; event detection

公共安全突发事件是指在一段时间内与公共安全相关的事件爆发并迅速传播, 引起公众广泛关注的现
象.对这类事件进行及时检测和响应具有重要的意义, 可以帮助决策者更好地管理危机和做出决策 [1,2] . 近年
来,随着社交媒体的兴起, 公共安全突发事件的检测和演化发现已成为社交媒体挖掘的研究热点, 受到了学术
[3]
界和工业界的广泛关注 . 相比传统的文本挖掘或社会网络挖掘, 公共安全突发事件检测任务更具挑战性,
因为它涉及社交网络和文本流的复杂交互. 在社交媒体平台如微博、Twitter 上, 公共安全事件通常以短文本
的形式描述, 并通过时空共现、主题、发布信息、转发关系和标签信息等多个维度进行关联构建. 将公共安
全数据转化为图的形式, 进一步进行事件检测和演化发现, 已成为主流方法 [4−6] .
图神经网络(GNN)已经成为机器学习领域的热门研究方向之一. 与传统的神经网络不同, 图神经网络是
专门处理图数据的神经网络模型, 可以同时利用数据的特征信息与结构信息 [7−9] . 图数据中每个节点代表一
个实体, 每个边代表两个实体之间的关系, 如社交网络中的用户之间的关联、药物分子中的分子结构等 [10,11] .
与只关注结构信息传统的图模型相比 [12,13] , 图神经网络具有更好的表示能力和泛化性能. 通过特征的映射抽
取有用的特征, 并根据图结构聚合节点的邻域特征信息, 构造节点表示并应用于下游任务的训练和推理 [7,14] .
图神经网络已被广泛应用于社交网络分析 [6,15] 、交通预测 [16] 、药物结构预测 [17] 、推荐系统 [18,19] 、查询检索 [20,21]
等. 随着大数据的发展, 每天都会产生大量的原始公共安全数据, 高质量的数据可以提高模型的有效性. 但
是, 大量数据缺少标注信息, 而人工标注成本高、时间长、效率低, 需要训练模型对图数据进行自动分析 [22,23] .
由于高质量的数据通常归属于政府、公司和组织, 由于隐私、法规和利益的原因, 这些数据不能在各方之间
自由流动 [24,25] , 很难通过数据集中的方式学习到有效的事件检测模型.
联邦学习是一种分布式机器学习方法, 它可以在数据不出本地的情况下, 采用多方协作的方式共同训练
模型. 联邦学习不需要集中数据, 在一定程度上保护了数据的隐私, 还可以减少数据传输和存储的成本 [26] .
在实际应用中, 拥有公共安全数据的各方由于关注的主题与任务不同, 导致各方数据是非独立同分布(Non-
IID)的, 每个客户端中的数据都只有部分的标签. 与传统的数据 Non-IID 体现在标签分布不均匀不同, 这种
Non-IID 会同时体现在标签与图结构上. 然而, 在联邦设置中, 训练图神经网络仍然存在联邦图神经网络在
Non-IID 设置中表现不佳的问题 [27−29] , 其原因是因为错误地假设一个全局模型可以适合所有客户端 [30] . 为了
让各客户端可以在数据不出本地的条件下, 利用各方数据学习适用于本地任务的模型, 研究人员提出了个性
化联邦学习, 允许各客户端采用差异化聚合策略, 将其他客户端的模型参数或梯度聚合到本地, 构建个性化
的模型完成本地任务. 现有方法主要基于注意力机制 [28,30] 、微调全局模型 [31] 和正则化 [32] , 用于突发事件检测
的个性化联邦图神经网络的研究存在通信量大、性能与通信压力难以平衡的问题.
强化学习可以根据环境与状态学习一个最优的动作策略, 已经广泛应用到机器人控制 [33] 、图神经网络节
点选择 [34] 、自然语言处理 [35] 等任务中. 在强化学习中, 智能体通过与环境进行交互来学习动作策略, 通过尝

192 193 194 195 196 197 198 199 200 201 202