Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Replace Newspaper3k #9

Open
ninabarzh opened this issue Jun 2, 2022 · 0 comments
Open

Replace Newspaper3k #9

ninabarzh opened this issue Jun 2, 2022 · 0 comments
Labels
enhancement New feature or request help wanted Extra attention is needed question Further information is requested

Comments

@ninabarzh
Copy link
Owner

ninabarzh commented Jun 2, 2022

We have used Newspaper3k as web scraping library to extract the text of an article and then use a machine learning library to generate the actual summary. It depends on nltk and bloats the image.

We could roll our own solution with say Beautiful Soup (for scraping) and Bart (for summarizing)?

@ninabarzh ninabarzh added enhancement New feature or request help wanted Extra attention is needed question Further information is requested labels Jun 2, 2022
@ninabarzh ninabarzh added this to the 3rd party enhancements milestone Jun 2, 2022
@ninabarzh ninabarzh changed the title Replace Newspaper3k] Replace Newspaper3k Jun 2, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request help wanted Extra attention is needed question Further information is requested
Projects
None yet
Development

No branches or pull requests

1 participant