Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add DocQA Tool #330

Merged
merged 11 commits into from
Dec 19, 2024
Merged

Add DocQA Tool #330

merged 11 commits into from
Dec 19, 2024

Conversation

dillonalaird
Copy link
Member

@dillonalaird dillonalaird commented Dec 19, 2024

Adds a couple things:

  • Adds the DocQA tool
  • Adds new embedding tool based off of Stella
  • Updates prompts to test only relevant tools, not all tools

Copy link
Member

@camiloaz camiloaz left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm

'label': table',
'order': 1119,
'caption': [{'Column 1': 'Value 1', 'Column 2': 'Value 2'},
'summary': 'This table illustrates a trend of ...'},
],
",document_extraction
"'document_qa' is a tool that can answer any questions about arbitrary documents, presentations, or tables. It's very useful for document QA tasks, you can ask it a specific question or ask it to return a JSON object answering multiple questions about the document.","document_qa(prompt: str, image: numpy.ndarray) -> str:
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

would it be better to not use "QA" and instead use "question answering"? i know that changing this would require new benchmarking runs, so no need to address it right now, but maybe for the future.

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Like call the tool document_question_answering instead of document_qa? I think in this case it's fine, it seems to understand QA pretty well

@dillonalaird dillonalaird merged commit 8dce01a into main Dec 19, 2024
8 checks passed
@dillonalaird dillonalaird deleted the add-doc-qa branch December 19, 2024 17:59
Copy link
Contributor

@hugohonda hugohonda left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants