MEffi-Prompt

The repository for our paper Multilingual Relation Classification via Efficient and Effective Prompting, to appear at EMNLP-2022 (main conference).

In this paper, we extend the power of prompting to the underexplored task of multilingual relation classification and aim to find out best ways to prompt for different languages, data regimes, etc. with minimal handcraft (i.e. translation). We are especially interested in its in-language and cross-lingual performance, as well as different behaviour between code-switch and in-language prompts.

Effectiveness is validated over 14 languaged covered by the SMiLER dataset.

🔭 Overview

Path	Description
config/	This directory contains the Hydra config files that specify pre-defined settings.
data/	This directory where the user should put their data files, as well as some pre-processing scripts.
docs/	This directory contains the auxiliary files for documentation, such as the figure(s)presented in README.
src/meffi_prompt/	This directory is the package to be installed, which contains the source code of our implementation.

🚀 Installation

git clone git@github.com:DFKI-NLP/meffi-prompt.git
cd meffi-prompt
pip install -e .

💡 Usage

To evaluate the default setting (i.e. fully supervised scenario with model="google/mt5-base", max_length=256, batch_size=16, num_epochs=10, lr=3e-5, soft_token_length=0), run:

python main.py

To run your own setting:

python main.py model="google/mt5-small" batch_size=4 num_epochs=5

Hydra provides a simple way to sweep the arguments for hyperparameter-finetuning. The following command will excute 3 * 2 * 1= 6 runs in a row:

python main.py -m batch_size=4,8,16 model="google/mt5-base","google/mt5-small" max_length=512

To show the available options and the default config, do:

python main.py --help

which results in something like this:

== Config ==
Override anything in the config (foo.bar=value)

seed: 1234
cuda_device: 0
train_file: ./data/smiler/de_corpora_train.json
eval_file: ./data/smiler/de_corpora_test.json
model: google/mt5-base
soft_token_length: 0
max_length: 256
batch_size: 16
lr: 3.0e-05
num_epochs: 5

Note that the different run-scripts correspond to different evaluation scenarios:

script name	scenario
`main.py`	fully supervised
`main_fs.py`	few-shot
`main_iczs.py`	in-context zero-shot
`main_zslt.py`	zero-shot lingual transfer

🔎 Prompt Construction

The templates we employ (see the table above) are already in the code, so it involves no work from your side to reproduce our results.

You can also define your own template. For example, if you want the template to be "$x$. The relation between $e_h$ and $e_t$ is _____., just modify the prompt as

template = {
    "input": ["x", "The relation between", "eh", "and", "et", "is", "<extra_id_0>"],
    "target": ["<extra_id_0>", "r", "<extra_id_1>"],
}

where $x$, $e_h$, $e_t$, $r$ are variants, and <extra_id_?> are special tokens preserved by T5 to denote either (1) start of a blank or (2) end of decoded sequence. The rest elements are hard tokens. To insert soft tokens, use [vN] (v means virtual token; N is the length of inserted soft tokens and can be specified in config.)

📝 Dataset

We evaluate the SMiLER dataset which covers 14 languages.

The dataset can be downloaded from https://github.com/samsungnlp/smiler. The pre-processing script is at ./data/smiler/reformatter.py. Main statistics per language are listed as follows:

Language	#Class	#Train	#Test	% no-rel (train)
ar	9	9303	190	3.46
de	22	51490	1051	0.89
en	36	267579	5461	4.91
es	21	11061	226	4.83
fa	8	2624	54	7.93
fr	22	60884	1243	0.90
it	22	73974	1510	0.70
ko	28	18711	382	1.67
nl	22	38850	793	0.86
pl	21	16831	344	0.00
pt	22	43335	885	0.84
ru	8	6395	131	1.86
sv	22	4482	92	0.60
uk	7	968	20	7.02

📚 Citation

@inproceedings{chen-etal-2022-multilingual,
    title = "Multilingual Relation Classification via Efficient and Effective Prompting",
    author = "Chen, Yuxuan and Harbecke, David and Hennig, Leonhard",
    booktitle = "Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing",
    month = december,
    year = "2022",
    address = "Online and Abu Dhabi, the United Arab Emirates",
    publisher = "Association for Computational Linguistics",
    abstract = "Prompting pre-trained language models has achieved impressive performance on various NLP tasks, especially in low data regimes. Despite the success of prompting in monolingual settings, applying prompt-based methods in multilingual scenarios has been limited to a narrow set of tasks, due to the high cost of handcrafting multilingual prompts. In this paper, we present the first work on prompt-based multilingual relation classification (RC), by introducing an efficient and effective method that constructs prompts from relation triples and involves only minimal translation for the class labels. We evaluate its performance in fully supervised, few-shot and zero-shot scenarios, and analyze its effectiveness across 14 languages, prompt variants, and English-task training in cross-lingual settings. We find that in both fully supervised and few-shot scenarios, our prompt method beats competitive baselines: fine-tuning XLM-R_EM and null prompts. It also outperforms the random baseline by a large margin in zero-shot experiments. Our method requires little in-language knowledge and can be used as a strong baseline for similar multilingual classification tasks.",
}

📘 License

This repository is released under the terms of the Apache 2.0 license.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
configs		configs
data/smiler		data/smiler
docs		docs
src/meffi_prompt		src/meffi_prompt
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
main.py		main.py
main_fs.py		main_fs.py
main_iczs.py		main_iczs.py
main_zslt.py		main_zslt.py
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MEffi-Prompt

Table of Contents

🔭 Overview

🚀 Installation

💡 Usage

🔎 Prompt Construction

📝 Dataset

📚 Citation

📘 License

About

Releases

Packages

Languages

License

DFKI-NLP/meffi-prompt

Folders and files

Latest commit

History

Repository files navigation

MEffi-Prompt

Table of Contents

🔭 Overview

🚀 Installation

💡 Usage

🔎 Prompt Construction

📝 Dataset

📚 Citation

📘 License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages