Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

remove highly correlated categorical variables #1045

Open
zhaoliang0302 opened this issue Oct 11, 2022 · 2 comments
Open

remove highly correlated categorical variables #1045

zhaoliang0302 opened this issue Oct 11, 2022 · 2 comments
Labels
feature a feature request or enhancement

Comments

@zhaoliang0302
Copy link

Hi,

step_corr() can remove highly correlated continuous variables using Pearson or Spearman correlation analysis. However, prefilter functions for categorical variables were not provided in the recipes package. I have 20 columns with categorical variables (using one-hot encoding), and I want to remove redundant columns which were correlated with each other. Can you give me some advice? Thanks

Best regards

@EmilHvitfeldt EmilHvitfeldt added question feature a feature request or enhancement and removed question labels Mar 30, 2023
@EmilHvitfeldt
Copy link
Member

Hello @zhaoliang0302, I have been thinking about such steps for a while, do you know of any existing methods that would work to do such an opperation?

@corybrunson
Copy link
Contributor

Hi all, i just came across this issue. I reviewed the JOSS submission for {latentcor}, which is on CRAN and might provide a versatile solution for logical, numeric, and categorical variables.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feature a feature request or enhancement
Projects
None yet
Development

No branches or pull requests

3 participants