CrossGNN: Graph-Enhanced Cross-Modal Transformer for Text Generation

CrossGNN enables a transformer model to comprehend graph modality by integrating graph-specific layers to the language model.

Dependencies

deepspeed does not support gxx later than 10. Installing gxx_linux-64=9.3.0 in advance avoids reconfiguring the whole environment.

conda install gxx_linux-64=9.3.0
pip install -r requirements.txt
conda install pyg pytorch-scatter -c pyg

Note: Data preprocessing & graph creation is to be added to this repository.

Architecture

Upon an encoder-decoder transformer-architectured language backbone:

On the encoder side, an extra graph neural network (GNN) is used to contextualize the graph nodes and edges with the language hidden states.
On the decoder side, the GNN hidden states are attended with inserted cross attention layers to provide knowledge for language decoding.

Two concurrent data streams exist:

A T5 model handles language encoding and decoding, remaining unchanged during training.
The graph first encoded by a GNN model, influenced by language hidden states, then attended and merged into the language decoder.

These streams intersect through two cross-attentions:

During encoding, the graph's context node attends to language hidden states, allowing a one-way flow from language to graph.
In decoding, language hidden states cross attend the graph hidden states, drawing knowledge from the graph back to the language.

TL;DR:

Language Encoder --inform--> Graph Encoder (❄)
Encoded Graph --condition--> Language Decoder (❄)

Usage

Specify the configurations for model, data, and training in a yaml file first, following examples in configs/. Then run the following commands. Then assign the config file path and profile to training or evaluation scripts. Check their respective help messages for more details.

Training

python train.py --help

Evaluation

python eval.py --help

Acknowledgement

The model is built upon dragon, flamingo, and a pytorch implementation of flamingo.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.github/workflows		.github/workflows
configs		configs
dataset		dataset
evaluation		evaluation
lightning		lightning
models		models
scripts		scripts
tests		tests
utils		utils
.gitignore		.gitignore
README.md		README.md
eval.py		eval.py
requirements.txt		requirements.txt
train.py		train.py
train_lm.py		train_lm.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CrossGNN: Graph-Enhanced Cross-Modal Transformer for Text Generation

Dependencies

Architecture

Usage

Acknowledgement

About

Releases

Packages

Languages

happen2me/cross-gnn

Folders and files

Latest commit

History

Repository files navigation

CrossGNN: Graph-Enhanced Cross-Modal Transformer for Text Generation

Dependencies

Architecture

Usage

Acknowledgement

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages