Code for WWW 2023 paper "Graph Neural Networks with Diverse Spectral Filtering" (arXiv & video & ppt & speech)
Figure 1: (a)-(c) Diverse filters learned from real-world networks, where five representative curves are plotted for illustration. On each graph, these filters display similar overall shapes but different local details in function curves, showing the capability of our DSF in capturing both the global graph structure and locally varied linking patterns. (d) Visualization of node-specific filter weights on Cornell dataset, where alike color indicates similar filter weights between nodes. Overall, nodes can be differentiated based on their disjoint underlying regions as circled by the blue and green dashed lines, and far-reaching nodes can still learn similar filter weights due to their akin local structures. E.g., vertices on the graph border are mostly ingrained in a line subgraph such as • − • − •, and some unusual cases can be handled (see details in Section 5.4). These results justify the enhanced model interpretability by learning diverse spectral filters on the micro level.
- pytorch==1.8.0
- pytorch_geometric==2.0.1
- dgl==0.7.1
- networkx==2.5.1
- seaborn==0.11.2
- sklearn==0.24.2
- scipy==1.5.3
- numpy==1.19.5
- optuna==2.9.1
We use the following 11 benchmark datasets in our experiments.
- Six heterophilic graphs: Chameleon and Squirrel are two wikipedia networks where web pages are connected by mutual links. Each web page has some keywords as features and is classified into five categories; Wisconsin, Cornell, and Texas are three webpage datasets collected by Carnegie Mellon University, where nodes are web pages classified into five classes, and edges correspond to hyperlinks. The bag-of-word representations of web pages are taken as node features; Twtich-DE is a social network where nodes, edges, and labels respectively represent twitch users, mutual friendship, and whether a streamer uses explicit language or not. Node features encode users’ information in streaming habits, game preference, and location.
- Five homophilic graphs: Cora, Citeseer, and Pubmed are three widely used citation networks with strong homophily, where nodes are scientific papers, edges denote undirected citations, and each node is assigned with one topic as well as bag-of-word features; Computers and Photo are two Amazon co-purchase graphs. Nodes are goods connected by an edge if they are frequently bought together. The product reviews are encoded into the bag-of-words to be node features, and the product category corresponds to the class label.
As extensive experiments with different base models over various datasets need be conducted, we tune our hyper-parameters using Optuna for 200 trails with a broad searching space defined as
- learning rate
$\sim$ [1e-4, 1e-1] - weight decay
$\sim$ [5e-8, 1e-2] - dropout
$\sim$ {0, 0.1, ..., 0.8} by 0.1 - iterative optimization coefficients
$\eta_1, \eta_2 \sim$ {0.1, 0.2, ..., 1.0} by 0.1 - orthogonal regularization parameter
$\lambda_\text{orth} \sim$ [1e-2, 1] - the number of raw positional features
$f_p \sim$ {2, 4, ..., 32} by 2 - the initializing methods for node positional embeddings
$\sim$ {LapPE, RWPE}.
Running the below script for node positional encoding (the encoded embeddings have been saved in the folder './data/node_pos_enc').
python posEnc.py
Running the below script for tuning hyper-parameters of DSF on node classification tasks.
python hpmOpt.py
or running the below script for node classification.
python main.py
@inproceedings{guo2023graph,
title={Graph Neural Networks with Diverse Spectral Filtering},
author={Guo, Jingwei and Huang, Kaizhu and Yi, Xinping and Zhang, Rui},
booktitle={Proceedings of the ACM Web Conference 2023},
pages={306--316},
year={2023}
}