Fix union validator to correctly identify inherited classes in discriminated unions #1613

benglewis · 2025-01-30T14:35:26Z

Update the validate_smart function to identify the correct object class in a discriminated union by using the class with the greatest number of matching fields and the greatest percentage of matching fields.

Modify validate_smart function in src/validators/union.rs to calculate the number and percentage of matching fields for each class and select the best match.
Add test cases in tests/validators/test_union.py to verify the updated validate_smart function for discriminated unions and ensure correct class selection based on the number and percentage of matching fields.

For more details, open the Copilot Workspace session.

…minated unions Update the `validate_smart` function to identify the correct object class in a discriminated union by using the class with the greatest number of matching fields and the greatest percentage of matching fields. * Modify `validate_smart` function in `src/validators/union.rs` to calculate the number and percentage of matching fields for each class and select the best match. * Add test cases in `tests/validators/test_union.py` to verify the updated `validate_smart` function for discriminated unions and ensure correct class selection based on the number and percentage of matching fields. --- For more details, open the [Copilot Workspace session](https://copilot-workspace.githubnext.com/pydantic/pydantic-core/tree/main?shareId=XXXX-XXXX-XXXX-XXXX).

davidhewitt

This seems like a reasonable heuristic to me when picking what the best match should be, thanks.

cc @dmontagu we've talked about this kind of idea long ago, this seems like a reasonable small iteration towards a good result 👍

davidhewitt · 2025-01-30T18:37:20Z

src/validators/union.rs

@@ -113,7 +113,7 @@ impl UnionValidator {
        let strict = state.strict_or(self.strict);
        let mut errors = MaybeErrors::new(self.custom_error.as_ref());

-        let mut best_match: Option<(Py<PyAny>, Exactness, Option<usize>)> = None;
+        let mut best_match: Option<(Py<PyAny>, Exactness, Option<usize>, usize, f64)> = None;


I think now that we have so many fields here, we should create a struct for these so we can give them names.

davidhewitt · 2025-01-30T18:43:08Z

src/validators/union.rs

@@ -141,25 +141,24 @@ impl UnionValidator {

                        let new_exactness = state.exactness.unwrap_or(Exactness::Lax);
                        let new_fields_set_count = state.fields_set_count;
+                        let new_fields_set_percentage = new_fields_set_count.map(|count| {
+                            let total_fields = input.as_dict().map_or(0, |dict| dict.len());


This doesn't depend on the validator so can be done once at the start of the function (i.e. above the self.choices loop.)

davidhewitt · 2025-01-30T18:43:44Z

src/validators/union.rs

@@ -141,25 +141,24 @@ impl UnionValidator {

                        let new_exactness = state.exactness.unwrap_or(Exactness::Lax);
                        let new_fields_set_count = state.fields_set_count;
+                        let new_fields_set_percentage = new_fields_set_count.map(|count| {
+                            let total_fields = input.as_dict().map_or(0, |dict| dict.len());
+                            count as f64 / total_fields as f64


To avoid a divide by zero here I think that we should keep total_fields as Option<usize>. For example consider an int or str input, these have no fields.

davidhewitt reviewed Jan 30, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix union validator to correctly identify inherited classes in discriminated unions #1613

Fix union validator to correctly identify inherited classes in discriminated unions #1613

benglewis commented Jan 30, 2025 •

edited

Loading

davidhewitt left a comment

davidhewitt Jan 30, 2025

davidhewitt Jan 30, 2025

davidhewitt Jan 30, 2025

Fix union validator to correctly identify inherited classes in discriminated unions #1613

Are you sure you want to change the base?

Fix union validator to correctly identify inherited classes in discriminated unions #1613

Conversation

benglewis commented Jan 30, 2025 • edited Loading

davidhewitt left a comment

Choose a reason for hiding this comment

davidhewitt Jan 30, 2025

Choose a reason for hiding this comment

davidhewitt Jan 30, 2025

Choose a reason for hiding this comment

davidhewitt Jan 30, 2025

Choose a reason for hiding this comment

benglewis commented Jan 30, 2025 •

edited

Loading