Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Improve species percolation logic to better deal with the extinct case #919

Open
davidebbo opened this issue Nov 25, 2024 · 2 comments
Open

Comments

@davidebbo
Copy link
Collaborator

davidebbo commented Nov 25, 2024

For extinct species, we sometimes have good images, like 'Life reconstruction', and sometimes unpleasant ones, like a few fossilized bones/teeth.

Today, the 'bad' ones can end up percolating up, because we only rely on image quality in picProcess.py (and we don't know much about quality).

But if we were to add popularity to the percolation calculation, we probably would end up with better images, as more popular species are more likely to have nicer images.

@hyanwong
Copy link
Member

hyanwong commented Nov 25, 2024

Can we use popularity as a straight replacement for image quality somehow, e.g. by normalising it to within the same range as the quality scores?

I can't remember exactly how we use the quality scores in the picProcess algorithm, and how much is hard-coded to the 10000 - 50000 range we use for quality scores. @jrosindell will know, as he wrote that code.

@davidebbo
Copy link
Collaborator Author

davidebbo commented Nov 25, 2024

Copying my slack comment: Yes. But one obvious gotcha: we'd end up mostly percolating extant species, as they're likely far more popular than extinct (e.g. no ancient bear can compete with polar bears or modern pandas!). I suppose we could boost up extinct over extant.

@jrosindell wrote: I see, good point - probably we should percolate both extant and extinct species separately based on quality / popularity and only use extinct signposts if no extant species exists. I prefer this to augmenting the popularity which will feel a bit made up and hard to get right.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants