-
Notifications
You must be signed in to change notification settings - Fork 702
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Feature COMMANDLOG to record slow execution and large request/reply #1294
base: unstable
Are you sure you want to change the base?
Conversation
Signed-off-by: zhaozhao.zz <[email protected]>
Signed-off-by: zhaozhao.zz <[email protected]>
Signed-off-by: zhaozhao.zz <[email protected]>
Signed-off-by: zhaozhao.zz <[email protected]>
Signed-off-by: zhaozhao.zz <[email protected]>
Codecov ReportAttention: Patch coverage is
Additional details and impacted files@@ Coverage Diff @@
## unstable #1294 +/- ##
============================================
+ Coverage 70.81% 70.93% +0.12%
============================================
Files 121 121
Lines 65132 65162 +30
============================================
+ Hits 46121 46225 +104
+ Misses 19011 18937 -74
|
Signed-off-by: zhaozhao.zz <[email protected]>
Signed-off-by: zhaozhao.zz <[email protected]>
Signed-off-by: zhaozhao.zz <[email protected]>
Signed-off-by: zhaozhao.zz <[email protected]>
Just for Note: Let's first refresh the memory in pr #336, the last comment conclusion is: After a core team meeting, we decided adding a new command COMMANDLOG with subcommands HEAVYTRAFFIC and SLOW, and then slowlog.c can be renamed to a common commandlog.c. #336 (comment) |
From my understanding, for the command: commandlog get should be: commandlog len should be: commandlog reset should be: Can you describe the terms heavytraffic-input and heavytraffic-output in the json file (argument part) because I can only know And I think the existing slowlog commands should be deprecated? If yes, I think you should update the related json files as well. |
Signed-off-by: zhaozhao.zz <[email protected]>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM, approved
@@ -4,6 +4,11 @@ | |||
"complexity": "Depends on subcommand.", | |||
"group": "server", | |||
"since": "2.2.12", | |||
"arity": -2 | |||
"arity": -2, | |||
"deprecated_since": "8.1.0", |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
For compatibility with other databases, I think we should be extremely careful about deprecating commands. Some users are very careful to not use deprecated commands. I remember when some product team were working on upgrading to Valkey and reading all the release notes of the 7.x versions, they noticed that CLUSTER SLOTS is deprecated, it almost caused them a lot of extra work, until I explained to them that it is no longer deprecated.
We can document that it is an alias of COMMANDLOG, but we don't have to mark it as deprecated.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
seems we need a "not-recommended" field.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
We can just describe it in the docs in text?
Just curious, why mark as not recommended? Is there a reason to make users move away from the slowlog commands?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
From my understanding, the new command 'COMMANDLOG get count slow | heavytraffic-input | heavytraffic-output.' is encouraged in new version, thus the command 'Slowlog' is not suggested to be used anymore.
I think it makes no sense that 2 commands with the same weight implement the same function in a system, they are duplicated.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
OK, so there are pros and cons:
- Avoid duplicate/alias commands. Of course this is good to avoid.
- Keep compatibility with other systems (forks and managed databases) and tools written for them. This is a good thing too.
You say keep compatibility with other forks and tools makes no sense? Or you say we keep SLOWLOG for backward compatibility but it's recommended to use COMMANDLOG instead?
This is fine. I'm just concerned that if we say "deprecated", users will think that we want to remove it and diverge from redis and other forks. This is a bad communication to the community IMO.
I have seen some comments that people are worried that Valkey will diverge from open source Redis versions. If Redis and Valkey diverges, I hope we can safely say it is Redis fault, not our fault. Did you see the recent discussion about the redis-rs
Rust library?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
OK, so there are pros and cons:
- Avoid duplicate/alias commands. Of course this is good to avoid.
- Keep compatibility with other systems (forks and managed databases) and tools written for them. This is a good thing too.
You say keep compatibility with other forks and tools makes no sense? Or you say we keep SLOWLOG for backward compatibility but it's recommended to use COMMANDLOG instead?
First all, I do not express making no sense to compatible with other forks and tools. I think Valkey should try the best compatible with others. And I would suggest in the new version, user had better choose the COMMANDLOG command instead of SLOWLOG command.
This is fine. I'm just concerned that if we say "deprecated", users will think that we want to remove it and diverge from redis and other forks. This is a bad communication to the community IMO.
Of course, from my personal experience, if I see API noted deprecated, I would like to avoid to use them. Maybe we can find the other words to encourage to use COMMANDLOG command
I have seen some comments that people are worried that Valkey will diverge from open source Redis versions. If Redis and Valkey diverges, I hope we can safely say it is Redis fault, not our fault. Did you see the recent discussion about the
redis-rs
Rust library?
Yes, I just saw the lots of comments from redis-rs discussion yesterday. But for deprecating an existing command in Redis OSS, I think we could find a better solution to let our user know to deal with it.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
First all, I do not express making no sense to compatible with other forks and tools.
Sorry! I know you don't.
I think we want the same thing. I just wanted to highlight that there are pros and cons. Maybe I'm overrreacting. Maybe deprecated is right here.
Whatever we call it, deprecated or not recommended, we can describe in the text that there is no plan to remove the slowlog command. It can make worried people less worried maybe.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
First all, I do not express making no sense to compatible with other forks and tools.
Sorry! I know you don't.
Do not worry, I know you ^_^
I think we want the same thing. I just wanted to highlight that there are pros and cons. Maybe I'm overrreacting. Maybe deprecated is right here.
Whatever we call it, deprecated or not recommended, we can describe in the text that there is no plan to remove the slowlog command. It can make worried people less worried maybe.
In fact, 'not recommended' is much properly than deprecated if we can add it in json file. @soloestoy can we? haha,
I never did it before
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Ok, "not-recommended" or "not-recommended-since" field is not bad, haha. We can keep replaced-by too.
It has no effect in any tools, but we can use it for the website. We just need to implement some code for it in website repo.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'm ok with "not-recommended", I think we can open another PR to implement and discuss which commands should be changed from "deprecated" to "no-recommended".
Signed-off-by: zhaozhao.zz <[email protected]>
Signed-off-by: zhaozhao.zz <[email protected]>
@valkey-io/core-team Let's make a TSC decision on the command syntax and configs as described in the top comment. Approve or vote 👍 or 👎. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Mostly LGTM, still agree with high level direction.
Signed-off-by: zhaozhao.zz <[email protected]>
Co-authored-by: Madelyn Olson <[email protected]> Signed-off-by: zhaozhao.zz <[email protected]>
Signed-off-by: zhaozhao.zz <[email protected]>
Signed-off-by: zhaozhao.zz <[email protected]>
Signed-off-by: zhaozhao.zz <[email protected]>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
overall LGTM, didn't review the code carefully in depth though.
Signed-off-by: zhaozhao.zz <[email protected]>
src/commandlog.c
Outdated
/* Push a new entry into the command log. | ||
* This function will make sure to trim the command log accordingly to the | ||
* configured max length. */ | ||
void commandlogPushEntryIfNeeded(client *c, robj **argv, int argc, long long value, int type) { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
once commandlogPushCurrentCommand
is moved into this file, I think it would be beneficial to inline commandlogPushEntryIfNeeded
to help reduce the likelihood of branching since we are now testing 3 different thresholds.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think we should trust the compiler to do the inlining. As long we have o3 enabled it may choose to inline functions, we don't need to mark it explicitly.
long long slowlog_entry_id; /* SLOWLOG current entry ID */ | ||
long long slowlog_log_slower_than; /* SLOWLOG time limit (to get logged) */ | ||
unsigned long slowlog_max_len; /* SLOWLOG max number of items logged */ | ||
commandlog commandlog[COMMANDLOG_TYPE_NUM]; /* Logs of commands. */ |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
nit - we could also consider moving commandlog
out of this gigantic valkeyServer
struct and into a top level global in commandlog.h. I feel it is time to start trimmingvalkeyServer
.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
i'd like to keep it in valkeyServer
, it's easy to let other system to use it, and the prefix server.
can tell us it is global
}; | ||
addReplyHelp(c, help); | ||
} else if (c->argc == 2 && !strcasecmp(c->argv[1]->ptr, "reset")) { | ||
commandlogReset(COMMANDLOG_TYPE_SLOW); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
only the SLOW
list can be reset? Why are the others not supported?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
because this is only for the old slowlog
Signed-off-by: zhaozhao.zz <[email protected]>
Co-authored-by: Ping Xie <[email protected]> Signed-off-by: zhaozhao.zz <[email protected]>
Co-authored-by: Ping Xie <[email protected]> Signed-off-by: zhaozhao.zz <[email protected]>
Co-authored-by: Ping Xie <[email protected]> Signed-off-by: zhaozhao.zz <[email protected]>
Co-authored-by: Ping Xie <[email protected]> Signed-off-by: zhaozhao.zz <[email protected]>
As discussed in PR #336.
We have different types of resources like CPU, memory, network, etc. The
slowlog
can only record commands eat lots of CPU during the processing phase (doesn't include read/write network time), but can not record commands eat too many memory and network. For example:This PR introduces a new command
COMMANDLOG
, to log commands that consume significant network bandwidth, including both input and output. Users can retrieve the results usingCOMMANDLOG get <count> large-request
andCOMMANDLOG get <count> large-reply
, all subcommands forCOMMANDLOG
are:COMMANDLOG HELP
COMMANDLOG GET <count> <slow|large-request|large-reply>
COMMANDLOG LEN <slow|large-request|large-reply>
COMMANDLOG RESET <slow|large-request|large-reply>
And the slowlog is also incorporated into the commandlog.
For each of these three types, additional configs have been added for control:
commandlog-request-larger-than
andcommandlog-large-request-max-len
represent the threshold for large requests(the unit is Bytes) and the maximum number of commands that can be recorded.commandlog-reply-larger-than
andcommandlog-large-reply-max-len
represent the threshold for large replies(the unit is Bytes) and the maximum number of commands that can be recorded.commandlog-execution-slower-than
andcommandlog-slow-execution-max-len
represent the threshold for slow executions(the unit is microseconds) and the maximum number of commands that can be recorded.slowlog-log-slower-than
andslowlog-max-len
are now set as aliases for these two new configs.