Finding a Proper Fix/Replacement for Coreferee? #9

duckduckdoof · 2024-08-26T20:57:13Z

From Paul:

multiple_essay_report.py is a script used to visualize document/token features of our spacy NLP pipeline at work. This script can be used to verify if pronouns like "they" are properly used in a document.

Currently, Paul has noted that coreferee (which is used for coreference resolution) fails to properly do this at the current version of spacy (specifically for pronoun antecedents). Our text examples include essays written for the GRE, which make use of pronouns in ways which spacy or coreferee are not trained to handle/properly identify. Now that we are updating to spacy 3.6+, we need to see if coreferee continues to do poorly; finally, we would like to see if other coreference modules perform better.

The text was updated successfully, but these errors were encountered:

duckduckdoof · 2024-08-26T21:07:20Z

From Dr. Lynch in #7 (now merged with current issue):

In the early development of the components an issue was found with the coreferee/spacy module where it makes errors in the pronominal reference when it calculates pronoun antecedents around third-person plural pronouns. The solution for this was the development of a link to a separate untested BERT server. In the GRE passage I used for testing, the pronoun "they" was used to refer to animates, but the coreferee module overrode the semantic valence of the verb to prefer the syntactically most prominent potential antecedent, which was inanimate.

We need to develop (a) unit tests for this that can reliably evaluate whether the full system is working; and (b) evaluate the relative cost of doing this probability evaluation with BERT, Spacy/Coreferee, and LanguageTool which appears to have a built in probability estimation feature.

The code that uses the BERT service is located in awe_components/components/utility_functions.py under ResolveReferences. The code that runs the BERT service is under AWE_Workbench.

duckduckdoof · 2024-09-20T14:26:51Z

Known alternatives so far:

DrLynch · 2024-09-20T15:13:54Z

Related to coreferee issue here: richardpaulhudson/coreferee#29

Other related:
https://github.com/explosion/spacy-experimental/releases/tag/v0.6.0
https://explosion.ai/blog/coref
https://staedi.github.io/posts/coref

duckduckdoof · 2025-01-23T17:46:37Z

Moved issue to AWE_Components, since that's where the solution will sit.

duckduckdoof added enhancement New feature or request help wanted Extra attention is needed labels Aug 26, 2024

duckduckdoof mentioned this issue Aug 26, 2024

Consolidation of Server Processes for Whole Pipeline #10

Closed

duckduckdoof changed the title ~~Finding a Proper Replacement for Coreferee?~~ Finding a Proper Fix/Replacement for Coreferee? Aug 26, 2024

This was referenced Aug 26, 2024

Coreferee Pipeline Update #4

Closed

System has base dependency on BERT that must be tested for or removed. #7

Closed

duckduckdoof mentioned this issue Jan 23, 2025

Replace wordseqProbServer ArgLab/AWE_Components#11

Open

duckduckdoof closed this as completed Jan 23, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Finding a Proper Fix/Replacement for Coreferee? #9

Finding a Proper Fix/Replacement for Coreferee? #9

duckduckdoof commented Aug 26, 2024

duckduckdoof commented Aug 26, 2024 •

edited

Loading

duckduckdoof commented Sep 20, 2024 •

edited

Loading

DrLynch commented Sep 20, 2024

duckduckdoof commented Jan 23, 2025

Finding a Proper Fix/Replacement for Coreferee? #9

Finding a Proper Fix/Replacement for Coreferee? #9

Comments

duckduckdoof commented Aug 26, 2024

duckduckdoof commented Aug 26, 2024 • edited Loading

duckduckdoof commented Sep 20, 2024 • edited Loading

DrLynch commented Sep 20, 2024

duckduckdoof commented Jan 23, 2025

duckduckdoof commented Aug 26, 2024 •

edited

Loading

duckduckdoof commented Sep 20, 2024 •

edited

Loading