Composable scaffolding for CoT #75

dhruviyer · 2025-01-10T08:28:06Z

Hello all,

As part of building toward #50 I wanted to lay some ground work for working with CoT prompts and completions. As a start I implemented this for sem_filter and want to get some comments/feedback on the design before rolling it out to other operators.

At a high level, I am proposing a few changes:

We use XML to cleanly separate reasoning sections and answer sections on the model outputs, which I hope will make generation and parsing more reliable and also can be reused across all the various semantic operators.

Hence, the PR revamps the way the model prompt to require the model to fill out this template:

<Reasoning> Provide your reasoning here. {customer reasoning instructions} </Reasoning> <Answer> Provide your answer here. {custom answer instructions} </Answer>

Correspondingly, it also formats the CoT examples in this same template when provided by the user

The PR also simplifies the logic around (1) prompting the model to do CoT and (2) providing examples of CoT. The following configurations are now implicitly handled in the single filter_formatter() function:

No CoT prompting, no examples
CoT prompting, no CoT examples
CoT prompting, CoT examples
No CoT prompting, CoT examples

As mentioned, mostly want to get feedback from the authors if this approach makes sense

…d providing examples and requiring the model to use CoT

dhruviyer · 2025-01-14T01:46:43Z

@liana313 after our discussion I removed the formatting of CoT using XML. This PR now introduces a standardized way of creating CoT prompts and parsing CoT responses. Following PRs will expand support to other semantic operators and implement retries (whichever PR solves #43)

dhruviyer added 5 commits January 9, 2025 19:35

cot and zs-cot support for semantic filter

a77a580

made cot optional

2999d58

linting and formatting

1ef7446

cleaning up for code review

ced50e0

exposed ability to add custom reasoning instructions and disaggregate…

fb6bdf3

…d providing examples and requiring the model to use CoT

dhruviyer linked an issue Jan 10, 2025 that may be closed by this pull request

Better CoT / Prompting Strategy Support #50

Open

dhruviyer added 3 commits January 10, 2025 00:28

remove debug level in filter example

436882e

fix mypy errors

e0640e9

prompt LLM to generate valid XML

0685768

dhruviyer marked this pull request as ready for review January 11, 2025 02:06

dhruviyer added the do not merge label Jan 13, 2025

revert using XML for CoT

2d7df7a

dhruviyer removed the do not merge label Jan 14, 2025

ruff format and removed excesss changes to mnimize PR

13d0161

dhruviyer assigned liana313 and unassigned liana313 Jan 15, 2025

dhruviyer requested a review from liana313 January 15, 2025 01:39

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Composable scaffolding for CoT #75

Composable scaffolding for CoT #75

dhruviyer commented Jan 10, 2025 •

edited

Loading

dhruviyer commented Jan 14, 2025 •

edited

Loading

Composable scaffolding for CoT #75

Are you sure you want to change the base?

Composable scaffolding for CoT #75

Conversation

dhruviyer commented Jan 10, 2025 • edited Loading

dhruviyer commented Jan 14, 2025 • edited Loading

dhruviyer commented Jan 10, 2025 •

edited

Loading

dhruviyer commented Jan 14, 2025 •

edited

Loading