You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I attempted to reproduce the issue on OpenSearch 2.7 using the sample data, iterating over all columns and setting os_index_field for each one. However, after performing these checks, I didn’t encounter any duplicate rows.
Could you please review this and see if there’s anything I might have overlooked?
What is the bug?
DataFrame.to_pandas generates duplicates when an
os_index_field
is set and/or other than "_doc".How can one reproduce the bug?
=> Loading index:
2024-03-01 16:02:58.179774: read 10000 rows
2024-03-01 16:03:07.520786: read 14895 rows
4930
What is the expected behavior?
opensearch_py_ml/operations.py:1229
search_yield_pandas_dataframes
should be called withsort_index
parameter set toos_index_field
value defined in the oml.DataFrameWhat is your host/environment?
The text was updated successfully, but these errors were encountered: