BERTRL: Inductive Relation Prediction by BERT

Code and data for AAAI2022 paper Inductive Relation Prediction by BERT, which aims to study the problem of exploiting structural and textual information in knowledge graph completion leverging pre-trained langauge models. BERTRL feeds texts of candidate triple instances and their possible reasoning paths to BERT and predicts the existence of the triple.

Requirements:

huggingface transformer 3.3.1
pytorch 1.5.0
networkx 2.5
tqdm
numpy
sklearn
ipdb

Download the Dataset Split

Here we provide the data split used in paper in folder "data". The $DATASET$PART and $DATASET$PART_ind contain corresponding transductive and inductive subgraphs. Each train/valid/test file contains a list of knowledge graph triples. "ranking_head.txt" and "ranking_tail.txt" are presampled candidates for the predicting the missing tail triple and missing head triple in knowledge graph completion. Each triple contains 50 candidates for tail and 50 for head in this file. $DATASET denotes the dataset name, and $PART denotes the size of the dataset, whether it is a fewshot version or full. If $PART is not specified, it is full by default.

Preprocessing Data

folder "bertrl_data" provides an example of preprocessed data to be input of BERTRL model. They are actual tsv data examples required for BERTRL. Here we show the example preprocessing scripts. $DATASET denotes the name of the dataset in folder "data", e.g. fb237. part paramerter can be specified as full, 1000, 2000, referring to folder "data".

python load_data.py -d $DATASET -st train --part full --hop 3 --ind_suffix "_ind" --suffix "_neg10_max_inductive"
python load_data.py -d $DATASET -st test --part full --hop 3 --ind_suffix "_ind" --suffix "_neg10_max_inductive"

BERTRL

Training model We provide example bash scripts in train.bash
Evaluating model We provide example bash scripts in test.bash. This generates the BERTRL scoring for each of its examples. Then we evaluate the final ranking results by additional script aggregating the results. Here is an example. python eval_bert.py -d ${DATASET}_hop3_full_neg10_max_inductive
Hyperpameters Our provided training and evaluating scirpts use the hyperparameters mentioned in the paper. Check our paper for more details.

Baselines

We use code provided by authors for baselines in our experiments. RuleN is a ruled-based method. Detailed instructions can be found here. GraIL is subgraph reasoning method. Detailed instructions can be found here.

Citation

If you find this project useful, please cite it using the following format

@article{zha2021inductive,
title={Inductive Relation Prediction by BERT},
author={Zha, Hanwen and Chen, Zhiyu and Yan, Xifeng},
journal={arXiv preprint arXiv:2103.07102},
year={2021}
}

Q&A

If you have any questions about the paper and the github, please feel free to leave an issue or send me an email.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
bertrl_data/fb237_hop3_1000_neg10_path3_max_inductive		bertrl_data/fb237_hop3_1000_neg10_path3_max_inductive
data		data
ruleN		ruleN
.gitignore		.gitignore
README.md		README.md
create_fewshot_samples.py		create_fewshot_samples.py
create_unseen_relation_samples.py		create_unseen_relation_samples.py
eval_bertrl.py		eval_bertrl.py
load_data.py		load_data.py
model.py		model.py
run_bertrl.py		run_bertrl.py
test.sh		test.sh
train.sh		train.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

BERTRL: Inductive Relation Prediction by BERT

Requirements:

Download the Dataset Split

Preprocessing Data

BERTRL

Baselines

Citation

Q&A

About

Releases

Packages

Languages

zhw12/BERTRL

Folders and files

Latest commit

History

Repository files navigation

BERTRL: Inductive Relation Prediction by BERT

Requirements:

Download the Dataset Split

Preprocessing Data

BERTRL

Baselines

Citation

Q&A

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages