Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Node feature selection #1

Open
cshjin opened this issue Jun 6, 2022 · 5 comments
Open

Node feature selection #1

cshjin opened this issue Jun 6, 2022 · 5 comments
Assignees
Labels
documentation Improvements or additions to documentation preprocess

Comments

@cshjin
Copy link
Contributor

cshjin commented Jun 6, 2022

  • node feature descriptions
attr type comment
type categorical auxiliary / compute / transfer
is_clustered int
ready timestamp
pre_script_start timestamp
pre_script_end timestamp
submit timestamp
execute_start timestamp
execute_end timestamp
post_script_start timestamp
post_script_end timestamp
wms_delay float = submit - ready
pre_script_delay float = pre_script_end - pre_script_start
queue_delay float = execute_start - submit
runtime float = execute_end - execute_start
post_script_delay float = post_script_end - post_script_start
stage_in_delay float
stage_in_bytes float
stage_out_delay float
stage_out_bytes float
kickstart_user categorical panorama
kickstart_site categorical local / condorpool
kickstart_hostname categorical master / worker-*
kickstart_transformations str
kickstart_executables str
kickstart_executables_argv str
kickstart_executables_cpu_time float
kickstart_status int
kickstart_executables_exitcode int
  • exclude timestamps
  • exclude long string
  • handle missing values
@cshjin cshjin added the documentation Improvements or additions to documentation label Jun 6, 2022
@cshjin
Copy link
Contributor Author

cshjin commented Sep 9, 2022

@cshjin cshjin closed this as completed Sep 9, 2022
@cshjin cshjin reopened this Feb 2, 2023
@cshjin
Copy link
Contributor Author

cshjin commented Feb 2, 2023

  • Normalization
    • MinMaxScalar vs StandardScalar
  • Feature Selection
    • Auto feature selection from model

@cshjin
Copy link
Contributor Author

cshjin commented Feb 21, 2023

@cshjin cshjin self-assigned this Feb 21, 2023
@cshjin
Copy link
Contributor Author

cshjin commented Feb 24, 2023

  • check out new features in montage and predict_future_sales

@cshjin
Copy link
Contributor Author

cshjin commented Mar 10, 2023

  • fit_transform on train and transform on test
  • remove the normalization in dataset.py

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
documentation Improvements or additions to documentation preprocess
Projects
None yet
Development

No branches or pull requests

1 participant