cs224w-colab5.py代码附件资源-CSDN文库

共3个文件

py：2个

pkl：1个

112 浏览量 2023-09-19 10:21:20 上传评论收藏 799KB ZIP 举报

"cs224w-colab5.py 代码附件" 涉及的知识点主要集中在图机器学习和异构网络领域。该代码可能是针对斯坦福大学CS224w课程第五次实验的简化版，该课程专注于图神经网络（GNNs）及其在异构信息网络中的应用。中的“数据集”可能指的是实验中使用的特定图数据，如学术网络数据集ACM，这通常包含了作者、论文、会议等多个实体类型以及它们之间的关系。"上述流程图.py文件"可能是一个Python脚本，用于可视化实验步骤或GNN模型的处理流程，帮助理解代码的运行逻辑。 "图机器学习-异构网络"是指本实验关注的是如何利用图神经网络处理异构信息网络。异构网络是由不同类型的节点（如人、组织、事件等）和边（如关系、互动等）构成的复杂网络结构，其特点是节点和边可以有不同的属性和类型。在这样的网络中，图机器学习技术可以帮助我们提取和理解节点间复杂的关系模式，进行节点分类、链接预测、社区检测等任务。【压缩包子文件的文件名称列表】包含以下三个文件： 1. `acm.pkl`：这是一个pickle文件，Python中常用来存储和序列化数据。在这里，它可能包含了ACM学术网络的数据，包括节点（如论文、作者、会议）、边（如合作关系、引用关系）以及节点和边的属性，为图学习任务提供输入。 2. `cs224w_colab5_2_cut.py`：这是实验的主要代码文件，很可能包含了构建、训练和评估图神经网络模型的代码。可能包括图数据的预处理、图卷积网络（GCN）或图注意力网络（GAT）的实现、损失函数计算、反向传播优化等步骤。 3. `cs224w_colab5_2_cut_flowchart.py`：这个文件可能是用Python绘制的流程图，展示了实验代码的执行过程。流程图对于理解复杂的代码逻辑非常有帮助，它可能涵盖了数据加载、模型构建、训练过程、验证和测试等关键步骤。通过这些文件，我们可以学习到如何使用图神经网络处理异构数据，理解GNN的原理，包括消息传递机制、聚合函数、特征变换等，并掌握如何在实际问题中运用这些技术。同时，还能了解数据预处理、模型训练与评估的具体实践方法，以及如何使用可视化工具来辅助理解和调试代码。

资源推荐

资源详情

资源评论

收起资源包目录

cs224w_colab5_2_cut.zip （3个子文件）

acm.pkl 55.72MB

cs224w_colab5_2_cut.py 24KB

cs224w_colab5_2_cut_flowchart.py 5KB

import os import copy import torch import deepsnap import numpy as np import torch.nn as nn import torch.nn.functional as F import torch_geometric.nn as pyg_nn from sklearn.metrics import f1_score from deepsnap.hetero_gnn import forward_op from deepsnap.hetero_graph import HeteroGraph from torch_sparse import SparseTensor, matmul class HeteroGNNConv(pyg_nn.MessagePassing): def __init__(self, in_channels_src, in_channels_dst, out_channels): super(HeteroGNNConv, self).__init__(aggr="mean") self.in_channels_src = in_channels_src #这里对应两组输入src,dst 输出为一组对应上文中的两组类型的特征输入进 self.in_channels_dst = in_channels_dst self.out_channels = out_channels # To simplify implementation, please initialize both self.lin_dst # and self.lin_src out_features to out_channels ############# Your code here ############# ## (~3 lines of code) ## Note: ## 1. Initialize the 3 linear layers. ## 2. Think through the connection between the mathematical ## definition of the update rule and torch linear layers! self.lin_dst = nn.Linear(in_channels_dst, out_channels) # W_d^{(l)[m]} self.lin_src = nn.Linear(in_channels_src, out_channels) # W_s^{(l)[m]} self.lin_update = nn.Linear(out_channels * 2, out_channels) # W^{(l)[m]} #未涉及到卷积，只有线性层 ########################################## def forward( self, node_feature_src, node_feature_dst, edge_index, size=None ): ############# Your code here ############# ## (~1 line of code) ## Note: ## 1. Unlike Colabs 3 and 4, we just need to call self.propagate with ## proper/custom arguments. return self.propagate(edge_index, size=size, #消息传递 node_feature_src=node_feature_src, node_feature_dst=node_feature_dst)#, res_n_id=res_n_id def message_and_aggregate(self, edge_index, node_feature_src): out=matmul(edge_index,node_feature_src,reduce=self.aggr) ############# Your code here ############# ## (~1 line of code) ## Note: ## 1. Different from what we implemented in Colabs 3 and 4, we use message_and_aggregate ## to combine the previously seperate message and aggregate functions. ## The benefit is that we can avoid materializing x_i and x_j ## to make the implementation more efficient. ## 2. To implement efficiently, refer to PyG documentation for message_and_aggregate ## and sparse-matrix multiplication: ## https://pytorch-geometric.readthedocs.io/en/latest/notes/sparse_tensor.html ## 3. Here edge_index is torch_sparse SparseTensor. Although interesting, you ## do not need to deeply understand SparseTensor represenations! ## 4. Conceptually, think through how the message passing and aggregation ## expressed mathematically can be expressed through matrix multiplication. ########################################## return out def update(self, aggr_out, node_feature_dst): ############# Your code here ############# ## (~4 lines of code) ## Note: ## 1. The update function is called after message_and_aggregate ## 2. Think through the one-one connection between the mathematical update ## rule and the 3 linear layers defined in the constructor. aggr_out = self.lin_src(aggr_out) node_feature_dst = self.lin_dst(node_feature_dst) concat_features = torch.cat((node_feature_dst, aggr_out), dim=-1) # 维度-1在这里就是维度1 aggr_out = self.lin_update(concat_features) return aggr_out class HeteroGNNWrapperConv(deepsnap.hetero_gnn.HeteroConv): def __init__(self, convs, args, aggr="mean"): super(HeteroGNNWrapperConv, self).__init__(convs, None) self.aggr = aggr # Map the index and message type self.mapping = {} # A numpy array that stores the final attention probability self.alpha = None self.attn_proj = None if self.aggr == "attn": ############# Your code here ############# ## (~1 line of code) ## Note: ## 1. Initialize self.attn_proj, where self.attn_proj should include ## two linear layers. Note, make sure you understand ## which part of the equation self.attn_proj captures. ## 2. You should use nn.Sequential for self.attn_proj ## 3. nn.Linear and nn.Tanh are useful. ## 4. You can model a weight vector (rather than matrix) by using: ## nn.Linear(some_size, 1, bias=False). ## 5. The first linear layer should have out_features as args['attn_size'] ## 6. You can assume we only have one "head" for the attention. ## 7. We recommend you to implement the mean aggregation first. After ## the mean aggregation works well in the training, then you can ## implement this part. # if self.aggr == "attn": self.attn_proj = nn.Sequential( nn.Linear(args['hidden_size'], args['attn_size']), nn.Tanh(), #https://pytorch.org/docs/stable/generated/torch.nn.Tanh.html#torch.nn.Tanh nn.Linear(args['attn_size'], 1, bias=False), ) ########################################## def reset_parameters(self): super(HeteroConvWrapper, self).reset_parameters() if self.aggr == "attn": for layer in self.attn_proj.children(): layer.reset_parameters() def forward(self, node_features, edge_indices): message_type_emb = {} for message_key, message_type in edge_indices.items(): src_type, edge_type, dst_type = message_key node_feature_src = node_features[src_type] node_feature_dst = node_features[dst_type] edge_index = edge_indices[message_key] message_type_emb[message_key] = ( self.convs[message_key]( #HeteroGNNConv(),{('paper', 'author', 'paper'): HeteroGNNConv(), ('paper', 'subject', 'paper'): HeteroGNNConv()} node_feature_src, node_feature_dst, edge_index, ) ) node_emb = {dst: [] for _, _, dst in message_type_emb.keys()} mapping = {} for (src, edge_type, dst), item in message_type_emb.items(): mapping[len(node_emb[dst])] = (src, edge_type, dst) node_emb[dst].append(item) self.mapping = mapping for node_type, embs in node_emb.items(): if len(embs) == 1: node_emb[node_type] = embs[0] else: node_emb[node_type] = self.aggregate(embs) return node_emb def aggregate(self, xs): # TODO: Implement this function that aggregates all message type results. # Here, xs is a list of tensors (embeddings) with respect to message # type aggregation results. if self.aggr == "mean": ## Note: ## 1. Explore the function parameter `xs`! if self.aggr == "mean": x = torch.stack(xs, dim=-1) #torch.Size([3025, 64, 2]) 在维度上连接（concatenate）若干个张量。(这些张量形状相同）。 return x.mean(dim=-1) elif self.aggr == "attn": N = xs[0].shape[0] # Number of nodes for that node type M = len(xs) # Number of message types for that node type

评论收藏

内容反馈