-
Notifications
You must be signed in to change notification settings - Fork 43
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Docling models path #362
Docling models path #362
Conversation
This pull request has merge conflicts that must be resolved before it can be |
Signed-off-by: Aakanksha Duggal <[email protected]>
Signed-off-by: Aakanksha Duggal <[email protected]>
Signed-off-by: Aakanksha Duggal <[email protected]>
3359ba4
to
1b984e0
Compare
Signed-off-by: Aakanksha Duggal <[email protected]>
b8e67d7
to
a6b6454
Compare
Signed-off-by: Aakanksha Duggal <[email protected]>
Signed-off-by: Aakanksha Duggal <[email protected]>
Co-authored-by: Jaideep Rao <[email protected]> Signed-off-by: Aakanksha Duggal <[email protected]>
Co-authored-by: Jaideep Rao <[email protected]> Signed-off-by: Aakanksha Duggal <[email protected]>
1fd4284
to
dcbf7f3
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM, thanks
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
A couple of notes where I don't think the path is actually getting looked up and used like we want. And, as a more general note, I wonder if it would be easier to just lookup the docling models using the xdg_data_dirs logic closer to where we use docling, inside ContextAwareChunker
as that would avoid us having to wire this model path all the way through the calls here to get it from generate_data
down to where it's used.
This pull request has merge conflicts that must be resolved before it can be |
Signed-off-by: Aakanksha Duggal <[email protected]>
Signed-off-by: Aakanksha Duggal <[email protected]>
Signed-off-by: Aakanksha Duggal <[email protected]>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The indentation is fixed, and I pushed a couple of unit tests to confirm we're properly parsing the model path out of the config.yaml when its found and that when it's not we're leaving the model path as None.
Thanks for the great collaboration on this!
These simple unit tests just test the cases where we found a config.yaml to parse for the docling model path and where we didn't. Signed-off-by: Ben Browning <[email protected]>
0c5ef3c
to
0e9d75d
Compare
Changed how my unit tests mock to avoid a Python 3.10 vs 3.11 incompatibility with patching certain modules. This should get the tests green on the Python 3.10 CI again. |
Thanks @bbrowning for adding the quick unit tests 💯 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
thanks @aakankshaduggal and @bbrowning !
might want to squash your commits before merging
@Mergifyio backport release-v0.5 |
✅ Backports have been created
|
#335