Eliciting Causal Abilities in Large Language Models for Reasoning Tasks

Illustrative examples demonstrating the purpose of our research problem.

Abstract

Prompt optimization automatically refines prompting expressions, unlocking the full potential of LLMs in downstream tasks. However, current prompt optimization methods are costly to train and lack sufficient interpretability. This paper proposes enhancing LLMs' reasoning performance by eliciting their causal inference ability from prompting instructions to correct answers. Specifically, we introduce the Self-Causal Instruction Enhancement (SCIE) method, which enables LLMs to generate high-quality, low-quantity observational data, then estimates the causal effect based on these data, and ultimately generates instructions with the optimized causal effect. In SCIE, the instructions are treated as the treatment, and textual features are used to process natural language, establishing causal relationships through treatments between instructions and downstream tasks. Additionally, we propose applying Object-Relational (OR) principles, where the uncovered causal relationships are treated as the inheritable class across task objects, ensuring low-cost reusability. Extensive experiments demonstrate that our method effectively generates instructions that enhance reasoning performance with reduced training cost of prompts, leveraging interpretable textual features to provide actionable insights.

Dependency requirements

The code has been verified to work under Python 3.12.4 with the following dependencies:

- openai 1.41.1
- causalnlp 0.8.0
- open-interpreter 0.3.7
- replicate 0.31.0

Usage

1. Generate `instructions_data`

python prepare_data_1.py --base_instruction XXXX --api_key XXXX --base_url XXXX --model XXXX 
#Specify the base instruction, base URL, API key, and the LLM to be used.

2. Generate observational data

python prepare_data_2.py --api_key XXXX --dataset ./input-dataset/XXXX  
# Specify a input-dataset. If not specified, the default is all datasets.

3. `generate_enhanced_instructions`

python generate_enhanced_instructions_1.py --data XXXX  --col_range XXXX --ignore_cols XXXX
# Specify a generated observational dataset, the proxy feature rage, and the ignore columns.

python generate_enhanced_instructions_2.py --data XXXX  --col_range XXXX --ignore_cols XXXX --api_key XXXX  
# Note: except specify the arguments, the prompting content in generate_new_instructions_2.py need to be substitued according to the results in generate_enhanced_instructions_1.py.

4. `evaluate`

python evaluate.py --base_instruction XXXX --enh_instruction XXXX --data ./input-dataset/xxxx --api_key XXXX --base_url xxxx --model xxxx
# The enh_instruction can be obtained from generate_enhanced_instructions_2.py.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Eliciting Causal Abilities in Large Language Models for Reasoning Tasks

Abstract

Dependency requirements

Usage

1. Generate `instructions_data`

2. Generate observational data

3. `generate_enhanced_instructions`

4. `evaluate`

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
generated_observational_dataset		generated_observational_dataset
imgs		imgs
input-dataset		input-dataset
README.md		README.md
evaluate.py		evaluate.py
generate_enhanced_instructions_1.py		generate_enhanced_instructions_1.py
generate_enhanced_instructions_2.py		generate_enhanced_instructions_2.py
prepare_data_1.py		prepare_data_1.py
prepare_data_2.py		prepare_data_2.py

tmlr-group/SCIE

Folders and files

Latest commit

History

Repository files navigation

Eliciting Causal Abilities in Large Language Models for Reasoning Tasks

Abstract

Dependency requirements

Usage

1. Generate instructions_data

2. Generate observational data

3. generate_enhanced_instructions

4. evaluate

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

1. Generate `instructions_data`

3. `generate_enhanced_instructions`

4. `evaluate`

Packages