Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Tagging sequence markup for entity extraction #810

Closed
hoagy-davis-digges opened this issue Oct 11, 2017 · 7 comments
Closed

Tagging sequence markup for entity extraction #810

hoagy-davis-digges opened this issue Oct 11, 2017 · 7 comments
Assignees

Comments

@hoagy-davis-digges
Copy link

How can I use a tagging sequence such as IOB with snorkel features?

@jason-fries
Copy link
Contributor

Hi @hoagy-davis-digges

The easiest way to implement IOB tagging is to treat every word as a candidate and then use categorical variables, similar to the tutorial at: https://github.com/HazyResearch/snorkel/blob/master/tutorials/advanced/Categorical_Classes.ipynb

Here your labeling functions would output labels in the space B, I , O, abstain. This approach doesn't model dependencies between labels (like a CRF would), but not all problems actually need this.

We're working on a formal entity tagging paper, so these types of tasks will be easier to work with in Snorkel in the near future.

@hoagy-davis-digges
Copy link
Author

Awesome, thanks!

@ajratner
Copy link
Contributor

Closing for now--will be accessible via the "Q&A" link in README--but feel free to re-open!

@jbkoh
Copy link

jbkoh commented Aug 13, 2018

Hi @jason-fries
Is there any update on the entity tagging with Snorkel you mentioned above? It'd be awesome if I could try it out for my use case. I've been doing research on entity resolution in metadata of buildings. I can also contribute if there is a room for it.

Thanks for the great project!

@thak123
Copy link

thak123 commented Jan 29, 2020

Any update in this regard?

1 similar comment
@mpgxc
Copy link

mpgxc commented Apr 11, 2021

Any update in this regard?

@jason-fries
Copy link
Contributor

jason-fries commented Apr 12, 2021

Hi @mpgxc,

The general idea behind sequence labeling with off-the-shelf Snorkel is outlined in Issue #1254. More technical details are available in our manuscript and our demo notebook for weakly supervised NER at https://github.com/som-shahlab/trove/

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

7 participants