Support feature importance / variable selection #26

JakeColtman · 2018-10-13T15:27:37Z

In many real world use cases, it's important to be able to identity truly important features.

Implementing some of the approaches of https://repository.upenn.edu/cgi/viewcontent.cgi?article=1555&context=statistics_papers seems like a good start.

A side constraint is that the solution should be able to scale to large datasets, which might pose a problem for the permutation approach. Possibly it would be useful to have two different modes - a fully principled one and a rough and ready one for large data sets.

JakeColtman · 2018-10-13T15:28:38Z

Given the claims in the paper, it would be interesting for the solution to be general enough that it could be applied to implementations of models like RF in other libraries

JakeColtman mentioned this issue Oct 27, 2018

Feature Selection #33

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support feature importance / variable selection #26

Support feature importance / variable selection #26

JakeColtman commented Oct 13, 2018

JakeColtman commented Oct 13, 2018

Support feature importance / variable selection #26

Support feature importance / variable selection #26

Comments

JakeColtman commented Oct 13, 2018

JakeColtman commented Oct 13, 2018