Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
FOMO Model Updates and Improvements
The FOMO model, with its very simple architecture, can sometimes cause issues in object detection tasks. These issues have been addressed in this pull request.
Identified Issue: Multiple Predictions Per Object
In its default configuration, the model occasionally predicts multiple points for each object. This behavior can be mitigated by:
Demonstration of Results
The following images illustrate the model's performance under different configurations:
Before Adjustment (Loss Weight: 1000, No NMS):
The model predicts multiple points for each object.
After Adjustment 1 (Loss Weight: 500, With NMS):
The model's predictions are improved with fewer redundant points, thanks to a combination of reduced loss weight and NMS.
After Adjustment 2 (Loss Weight: 5, No NMS):
A significantly lower loss weight further simplifies the predictions, even without applying NMS.
Validation Loss for Larger Objects
One remaining issue with this simple model is that the validation loss can rise for larger objects. This occurs because, in the initial training stages, the model tends to make uncertain and excessive predictions. For example:
Initial State: The model predicts numerous uncertain points for each object.
After a Few Epochs: The model gradually learns to identify the centers of objects but can still occasionally misplace them due to the inherent difficulty in defining the "correct" center in the labels.
This behavior is important to keep in mind when using this model, especially for datasets with larger or irregularly shaped objects.