Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Autodetection #8

Open
zyronix opened this issue Oct 21, 2021 · 0 comments
Open

Autodetection #8

zyronix opened this issue Oct 21, 2021 · 0 comments
Labels
enhancement New feature or request

Comments

@zyronix
Copy link
Collaborator

zyronix commented Oct 21, 2021

Demeuk gets more and more options, for newcomers I want to have a simple feature which scan a document and prints out the cleaning options.

Maybe you want to jump to 0% - 10% - 20% etc and scan 1% there. This will result in scanning 10% of the file, but should be sufficient.

Goal: scanning of a very large text files (lets say 10GB) should take max 1 min. For each module hit a small snippet should be given of which lines were hit.

So for example:
`
Check_hashes; found hashes; examples;
$h$7/uhfibmxg83yq6y1rh5y9wjee13kh.
$6$/fasjdfsadj$safjasdfasjdfasdjf/asdfsadfasdfasdfas/fadsfasdfa

Check_encoding; encoding decoding using 'utf-8'; examples

Check_controlchar; found control chars; example
\x07

Scanning done, run demeuk with:
demeuk.py -i -o --check-controlchar --check-hash --check-encoding
`
Also, some changing some default behavior:

  • Disable all options by default
  • Always use all cores (-j all)
@zyronix zyronix added the enhancement New feature or request label Oct 21, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant