Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Use sentence-data in database to supplement existing ML model training data #87

Open
jecarr opened this issue Dec 19, 2023 · 0 comments
Labels
enhancement Performance, refactoring, and usability good first issue Good for newcomers

Comments

@jecarr
Copy link
Member

jecarr commented Dec 19, 2023

When a report is submitted, we either (solely) retrieve existing ML models or build/save/retrieve them.

Our training data is currently what was provided from the initial TRAM repo

It would be good to utilise the true positives, false negatives, and false positives we have in the database as training data

Only add these to the training data if they are associated with sentences from completed reports.

If #88 has been completed, please also exclude sentences with non-confident mappings from the training data.

@jecarr jecarr added feature request New feature or request enhancement Performance, refactoring, and usability and removed feature request New feature or request labels Dec 19, 2023
@KadeMorton KadeMorton added the good first issue Good for newcomers label Dec 19, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement Performance, refactoring, and usability good first issue Good for newcomers
Projects
None yet
Development

No branches or pull requests

2 participants