Graph: GraphRNN

2020-12-12 412 words 2 minutes

Contents

Why is it interesting

Large and variable output
Non-unique representations
- $n$-node graph can be represented in $n!$ ways
- Hard to compute/optimize objective functions
Complex dependencies
- edge fprmation has long-range dependencies

Given: Graphs sampled from $p_{data}(G)$
Goal:

Setup:

Assume we try to learn a generative model from a set of points (i.e., graphs) $\lbrace x_i \rbrace$
- $p_{data}(x)$ is the data distribution, which is never known to us, but we have sampled $ \boldsymbol{x}_{i} \sim p_{data}(\boldsymbol{x})$.
- $p_{model}(\boldsymbol{x}; \theta)$ it the model, parametrized by $\theta$, that we use to approximate $p_{data}(x)$.

Auto-regressive models

$p_{model}(\boldsymbol{x}; \theta)$ is used for both density estimation and sampling (from the probability density)

$$ p_{\text {model}}(\boldsymbol{x} ; \theta)=\prod_{t=1}^{n} p_{\text {model}}\left(x_{t} \mid x_{1}, \ldots, x_{t-1} ; \theta\right) $$

$\boldsymbol{x}$ is a vector, $x_t$ it the $t$-th dimension. E.g. $\boldsymbol{x}$ is a sentence, $x_t$ is the $t$-th word.
For graph generation,$x_t$ will be the $t$-th action (add node, add edge)

Idea: Generating graphs via sequentially adding nodes and edges.

Graphs $G$ with node ordering $\pi$ can be uniquely mapped into a sequence of node and edge additions $S^{\pi}$.

The sequence $S^{\pi}$ has two levels:

We transformed graph generation problem into a sequence generation problem.
Need to model 2 processes

Relationship between node-level RNN and edge-level RNN

Node-level RNN generate the initial state for edge-level RNN
Edge-level RNN generates edges for the new node, then updates node-level RNN state using generated results

Any node can connect to any prior node
Too many step for edge generation
- need to generate full adjacency matrix
- complex too-long edge dependencies

Solution: Tractablity via BFS

Breadth-First Search node ordering
Benefits:
- Reduce possible node orderings
  - From $O(n!)$ to number of distinct BFS orderings
- Reduce steps for edge generation
  - reducing nuber of previous nodes to look at

Challege: There is no efficient Graph isomorphism test that can be applied to any class of graphs

Solution: