TensorFlow Graph Neural Networks

Posted by Sibon Li, Jan Pfeifer and Bryan Perozzi and Douglas Yarrington

Currently, we are angry to open TensorFlow Graph Neural Networks (GNNs), a library designed to manufacture it easy to work with graph structured data the usage of TensorFlow. Now we contain primitive an earlier model of this library in manufacturing at Google in a diversity of contexts (for example, notify mail and anomaly detection, traffic estimation, YouTube notify labeling) and as a part in our scalable graph mining pipelines. In particular, given the myriad styles of data at Google, our library became once designed with heterogeneous graphs in thoughts. We are releasing this library so that you can succor collaborations with researchers in alternate.

Table of Contents

Why spend GNNs?

Graphs are all around us, in the exact world and in our engineered systems. A converse of objects, places, or of us and the connections between them is on the total describable as a graph. As a rule, the facts we stare in machine learning issues is structured or relational, and thus is also described with a graph. And while foremost be taught on GNNs would possibly possibly well per chance be a protracted time oldschool, recent advances in the capabilities of most recent GNNs contain resulted in advances in domains as varied as traffic prediction, rumor and incorrect news detection, modeling illness unfold, physics simulations, and figuring out why molecules smell.

Graphs can mannequin the relationships between many varied styles of data, along side net sites (left), social connections (center), or molecules (real).

A graph represents the family (edges) between a series of entities (nodes or vertices). We can bid every node, edge, or the total graph, and thereby retailer data in every of these pieces of the graph. Furthermore, we can ascribe directionality to edges to portray data or traffic trail with the circulation, for example.

GNNs is also primitive to acknowledge to questions on multiple traits of these graphs. By working on the graph stage, we are attempting to predict traits of the total graph. We can name the presence of sure “shapes,” like circles in a graph that would possibly possibly well per chance bid sub-molecules or perchance shut social relationships. GNNs is also primitive on node-stage initiatives, to classify the nodes of a graph, and predict partitions and affinity in a graph an such as sing classification or segmentation. Indirectly, we can spend GNNs on the brink stage to gaze connections between entities, perchance the usage of GNNs to “prune” edges to name the converse of objects in a scene.

Building

TF-GNN affords building blocks for imposing GNN items in TensorFlow. Previous the modeling APIs, our library also affords intensive tooling all the design in which through the dazzling task of working with graph data: a Tensor-basically basically based graph data construction, a data going through pipeline, and some example items for customers to immediate onboard.

The a huge number of parts of TF-GNN that manufacture up the workflow.

The initial open of the TF-GNN library contains a bunch of utilities and functions for spend by inexperienced persons and skilled customers alike, along side:

A excessive-stage Keras-vogue API to manufacture GNN items that can with out issues be restful with varied styles of items. GNNs are on the total primitive along with ranking, deep-retrieval (twin-encoders) or combined with varied styles of items (advise, textual notify, etc.)
- GNN API for heterogeneous graphs. A complete lot of the graph issues we design at Google and in the exact world dangle varied styles of nodes and edges. Therefore we selected to manufacture a uncomplicated technique to mannequin this.

A successfully-outlined schema to describe the topology of a graph, and tools to validate it. This schema describes the form of its coaching data and serves to data varied tools.
A GraphTensor composite tensor form which holds graph data, is also batched, and has graph manipulation routines readily out there.
A library of operations on the GraphTensor construction:
- Varied efficient broadcast and pooling operations on nodes and edges, and related tools.
- A library of identical old baked convolutions, that is also with out issues extended by ML engineers/researchers.
- A excessive-stage API for product engineers to immediate attach GNN items with out basically caring about its foremost points.
An encoding of graph-fashioned coaching data on disk, moreover a library primitive to parse this data into a data construction from which your mannequin can extract the many functions.

Instance usage

Within the instance below, we attach a mannequin the usage of the TF-GNN Keras API to imply movies to a person per what they watched and genres that they most popular.

We spend the ConvGNNBuilder technique to specify the form of edge and node configuration, particularly to make spend of WeightedSumConvolution (outlined below) for edges. And for every trail through the GNN, we’re going to exchange the node values through a Dense interconnected layer:

    import tensorflow as tf
    import tensorflow_gnn as tfgnn

    # Model hyper-parameters:
    h_dims = {'person': 256, 'movie': 64, 'vogue': 128}
    
    # Model builder initialization:
    gnn = tfgnn.keras.ConvGNNBuilder(
      lambda edge_set_name: WeightedSumConvolution(),
      lambda node_set_name: tfgnn.keras.layers.NextStateFromConcat(
         tf.keras.layers.Dense(h_dims[node_set_name]))
    )
    
    # Two rounds of message passing to try node sets:
    mannequin = tf.keras.items.Sequential([
        gnn.Convolve({'genre'}),  # sends messages from movie to genre
        gnn.Convolve({'user'}),  # sends messages from movie and genre to users
        tfgnn.keras.layers.Readout(node_set_name="user"),
        tf.keras.layers.Dense(1)
    ])

The code above works astronomical, but generally we would possibly possibly well per chance unprejudiced must make spend of a extra unparalleled personalized mannequin structure for our GNNs. As an illustration, in our outdated spend case, we would must specify that sure movies or genres protect extra weight after we give our suggestion. Within the following snippet, we elaborate a extra evolved GNN with personalized graph convolutions, in this case with weighted edges. We elaborate the WeightedSumConvolution class to pool edge values as a sum of weights all the design in which through all edges:

class WeightedSumConvolution(tf.keras.layers.Layer):
  """Weighted sum of offer nodes states."""

  def name(self, graph: tfgnn.GraphTensor,
           edge_set_name: tfgnn.EdgeSetName) -> tfgnn.Arena:
    messages = tfgnn.broadcast_node_to_edges(
        graph,
        edge_set_name,
        tfgnn.SOURCE,
        feature_name=tfgnn.DEFAULT_STATE_NAME)
    weights = graph.edge_sets[edge_set_name]['weight']
    weighted_messages = tf.expand_dims(weights, -1) messages
    pooled_messages = tfgnn.pool_edges_to_node(
        graph,
        edge_set_name,
        tfgnn.TARGET,
        reduce_type='sum',
        feature_value=weighted_messages)
    return pooled_messages

Show conceal that although the convolution became once written with finest the provision and aim nodes in thoughts, TF-GNN makes sure it’s appropriate and works on heterogeneous graphs (with varied styles of nodes and edges) seamlessly.

Subsequent steps

You would possibly possibly well test out the TF-GNN GitHub repo for added data. To smash updated, that you have to be taught the TensorFlow blog, be a half of the TensorFlow Forum at talk about.tensorflow.org, sing twitter.com/tensorflow, or subscribe to youtube.com/tensorflow. In case you’ve built one thing you’d like to half, please publish it for our Neighborhood Spotlight at goo.gle/TFCS. For feedback, please file an subject on GitHub. Thank you!

Acknowledgments

The work described right here became once a be taught collaboration between Oleksandr Ferludin‎, Martin Blais, Jan Pfeifer‎, Arno Eigenwillig, Dustin Zelle, Bryan Perozzi and Da-Cheng Juan of Google, and Sibon Li, Alvaro Sanchez-Gonzalez, Peter Battaglia, Kevin Villela, Jennifer She and David Wong of DeepMind.

Sneha GuptaNovember 19, 2021Last Updated: November 19, 2021

4,432 5 minutes read