-
Notifications
You must be signed in to change notification settings - Fork 548
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[QST] is there any feature to undersampling or oversampling like scikit-learn-contrib/imbalanced-learn #4362
Comments
Hi @rrfaria . Today, there isn't a simple way to do this. We're excited about this use case, as we've also seen that nuanced oversampling and undersampling on CPUs can be very time consuming. We're currently working with the imbalanced-learn maintainers on a pull request that would allow you to use cuML estimators with imbalanced learn, like this: from imblearn.over_sampling import SMOTE
...
nn = cuml.neighbors.NearestNeighbors()
x_resampled, y_resampled = SMOTE(k_neighbors=nn).fit_resample(x, y) If accelerated imbalanced-learn is important for your work, it would be great if you could comment on this imbalanced-learn issue to indicate your interest in this effort. |
Thank you so much |
This issue has been labeled |
This issue has been labeled |
any update? |
The relevant code has been merged into imbalanced-learn, so the code snippet above now works when using imbalanced-learn built from source. It's not yet available in pip/conda installations of imbalanced-learn, but will be in the next release. Based on initial testing, it's possible to achieve large speedups on samplers as data sizes grow. I'm going to close this issue. If you build imbalanced-learn from source and run into any issues using it with cuML, please feel free to re-open this issue. |
I could not found anything from imbalance-learn
I'm using something like this
But I would like to use cuml to speed up because with big amount of data it takes a lot of time
Is there any method I could use to do it?
The text was updated successfully, but these errors were encountered: