Extra attributes for log_msg #2595

bachittle · 2023-01-04T16:45:19Z

This addresses the issue: #617. It is experimental and should be more tested and reviewed before merging into the main branch. Perhaps it could be a separate branch in your repository, or part of v2.x?

spdlog already has a set amount of pre-defined attributes built into the log msg struct, such as the level, time, logger name, thread id, etc. This code simply adds additional parameters that can be passed to a log message that will be appended to the end of the message.

Checklist:

attributes object built into log_msg
endpoint via custom logger methods (push_context, pop_context, clear_context)
printing the attributes is possible
custom patterns can print attributes in whatever manner the user chooses
whitespace scrambling of keys and values
special restriction on keys to prevent whitespace delimeter (perhaps convert to underscore?)
an API that is thread-safe and exception-safe implicitly for users.

using the API:
Refer to example.cpp and its new attribute_example() for a concrete example of attributes in action.

… this to work

…ith pattern_formatter

…d type

bachittle · 2023-01-05T14:27:45Z

build is failing due to lack of C++17 support, I think due to my custom string_view usage. Will try integrating with the built-in string_view_t and see if I can revert back to having C++11 support

gabime · 2023-01-05T14:31:40Z

Yes, I was wondering why the 17 requirement.

…-in string_view_t

bachittle · 2023-01-05T17:37:28Z

I was wondering, for further comment, how we should go about printing key-value pairs?

Currently I am using a logfmt-esque approach as the default, but perhaps the user can specify in the pattern matcher, like for example if they would like to do JSON instead to print pairs in the form of: {"key": "value"}. Either a new pattern would have to be set (logger->set_attr_pattern()), or it could be appended to the end of any given pattern (something like "{ ..., \"%k\": \"%v\", }") but then I'm not sure how the pattern matcher will know what patterns to repeat. I personally think the first option is more feasible.

…ion of gcc

attribute formatting

* init fixed attribute, currently having issues with the factory * using the much simpler api already defined: no factory, just define the sinks yourself * async logger fixed attribute support * fixed missing fixed attributes in default log calls * forgot some spots, all cases were found? * potential fix for illegal vector iterators * kept up to date with attributes branch * cleaner fixed logger api * less function overloads, just add and remove context when needed * fixed example to use new api * added append function for nested context * pushing and popping contexts for nested loggers * fixed error due to implicit conversion * fixed sign conversion error * better attribute example with optional json example Co-authored-by: Bailey Chittle <[email protected]>

bachittle · 2023-01-12T20:24:29Z

I made a attribute stack of sorts so that context can be pushed/pulled when required. This can allow nested context, which would be nice for something like a stack trace. The API is as follows:
logger->push_context(): pushes an attribute_list onto the stack
logger->pop_context(): pops the last attribute_list added to the stack
logger->clear_context(): clears the entire stack and essentially resets logger to default

gabime · 2023-01-12T20:47:22Z

Sounds good👍

lowdesertpunk · 2023-01-13T11:35:15Z

I think having attributes on logger level or on log message level support two different uses cases. The first case is useful for persistent contextual attributes, while the second when doing structured logging in every log message. In both cases the attributes should reach sinks separately from the actual payload to allow converting to JSON, or supplying them to a a different system where attributes are first-class citizens (mostly what the original issue #617 is about).

Contextual attributes

// done once
logger->push_context({"user_id", get_user_id()});

// later
logger->info(msg1);
// ...
logger->info(msg2);

Structured messages

logger->info("backup done", {{"src", backup_path}, {"count", num_backup_items}});

I also did a change (living in a fork) which support the second case. Attributes and attribute names are living on the stack (via initializer_list / span and string_view, while attribute values stored in strings created by fmt::to_string to allow using user defined types. Also planning to extend attribute values to a variant supporting a few basic types so that sinks can handle them appropriately. In case of async logging log_msg_buffer copies stack allocated object as necessary.

Does this sound like something which would be useful to merge upstream? Happy to open a PR later on, but currently there are C++17 dependencies (span and variant) which need to be resolved.

lowdesertpunk · 2023-01-13T11:46:40Z

Hmm, I somehow missed the for of @bobhansen in the original issue which does something very similar as indicated, probably making my changes obsolete.

bachittle · 2023-01-13T14:29:42Z

I originally had overloads as well, but I think the main issue with overloading is just how many spots would have to be overloaded in order to get this to work. This is because there are a lot of helper functions (ex: info(), warn(), etc.), that would all need an extra overload just to pass attributes for the desired use case.

The contextual logger makes things simple, and even possible to just do one message at a time. For example:

logger->push_context({"user_id", get_user_id()});
logger->info(msg1);
logger->pop_context();

Sure, it's a few extra lines, but lessens the amount of overloads to account for every use case. In my opinion it is better to have a minimum viable product which could work for every use case, such as this, and add the overloads if there is a demand, since those are more so optional sugar.

* initial testing of attributes * updating test cases for new api * more testing, fixing bugs found * fixed unused error * errors might be caused due to passing pointers by reference, these simple pointer passes should be by value * my only other idea for getting this to work. will experiment * fixing tests before merge (#4) * my only other idea for getting this to work. will experiment * I think I fixed the thread issue, lets see... * unused lambda capture acting funny * see if its the context that is causing issues * unused warning... again * reverting to basic, see where the issue is in github actions (testing most of it locally) * this works again? * some diff cleanup before squash merging * cleaner search for stop iterator

lowdesertpunk · 2023-01-24T09:07:54Z

The contextual logger makes things simple, and even possible to just do one message at a time.

Possible, although in the current form it's neither exception, nor thread-safe. Thread safety could be solved by thread local variables, but exception safety would need some interface change.

bachittle · 2023-01-24T14:39:41Z

Could you elaborate as to how it's not exception safe? Perhaps with an example? I do understand the thread-local issues and my test cases deal with this accordingly, using thread-local variables as you suggested.

There is a benefit to calling log methods with overloads as that would indeed be thread-safe, it could perhaps be an addition to the codebase as a new pull request, but they both require the same root features I have implemented, just one requires more overloads and is a bit more of a hassle to test. I am also experimenting with using RAII patterns to generate logger contexts as that might have a better chance of being thread-safe.

bachittle · 2023-01-24T15:33:58Z

Maybe I will restrict the library user from using contexts unless they do so in a thread-safe manner by making the public API only for instantiating new logger objects, although this will break some other functionalities that involve using a logger in a global context, like backtracing.

lowdesertpunk · 2023-01-25T08:49:00Z

Could you elaborate as to how it's not exception safe? Perhaps with an example?

logger->push_context({"user_id", get_user_id()});
logger->info("some other formatted message {}", map.at(key));
logger->pop_context();

If key is non-existent the context, meant for a single message, will remain active for further logs. Handling contexts via RAII objects as you indicated would be a possible solution.

bachittle · 2023-01-25T15:02:43Z

I guess depending on the use case someone might be happy to have added context to their error messages, but that seems like a feature that arises from a bug lol. There are a few possible API solutions:

1: nested logger objects

{
auto lg = logger->clone("nested_logger");
lg->push_context({{"user_id", get_user_id()}});
lg>log("message {}", map.at(key));
}

again, a similar solution for the thread-safe option, nested logger objects will follow RAII and on exception will be cleaned up. Loses the global logger context and some features associated with that.

2: logger lambda

logger->context({{"user_id", get_user_id()}}, [&](){
  logger->log("message {}", map.at(key))
});

the lambda would allow for extra code to be appended at the beginning and end (like push/pop).

3: context lifetime object

{
auto ctx = logger::context({{"user_id", get_user_id()}});
logger::info("has context");
}
logger::info("doesn't have context");

the destructor for ctx will pop context.

Either way the easiest way for this to work is to use thread-local logger objects. Both 2 and 3 still suffer from thread-safety due to sharing data. If a logger is shared amongst threads then it needs to do something else, so overloads might be needed for that use case.

pparuzel · 2023-05-29T09:31:03Z

Does this allow for thread-local context and if so, does the thread-local context work with asynchronous logging?

bobhansen · 2023-05-31T22:39:50Z

3: context lifetime object
{
auto ctx = logger::context({{"user_id", get_user_id()}});
logger::info("has context");
}
logger::info("doesn't have context");
the destructor for ctx will pop context.

We used this solution to very good effect. It is can be exception-safe, prevents users from forgetting to pop context, and is fairly ergonomic.

We solved the thread-safety in https://github.com/bobhansen/structured_spdlog by having the contexts stored in a thread-local linked-list of shared_ptr objects (heap allocations! booo!). This was a performance trade; in the first implementation, we had to continuously copy the thread-locals that were at the top of the stack into each async logging message.

We also ran into the need for an API to snapshot contexts and share them between threads. e.g. taking the current thread context and pushing into a thread pool to go with some execution, while being able to preserve the context of the request.

falbrechtskirchinger · 2023-06-08T14:57:01Z

We solved the thread-safety in https://github.com/bobhansen/structured_spdlog by having the contexts stored in a thread-local linked-list of shared_ptr objects (heap allocations! booo!). This was a performance trade; in the first implementation, we had to continuously copy the thread-locals that were at the top of the stack into each async logging message.

I'm in the planning stages of what sounds like a re-implementation of what you've done or maybe a hybrid of yours and this PR.

The shared_ptr covers the case of the thread terminating before the message is logged, right? I had resigned myself to always copying…

bachittle · 2024-04-26T14:53:03Z

closing this as the PR is stale, and it seems like this MDC PR does something very similar to what I was attempting to accomplish: #2907

gabime · 2024-04-26T15:33:44Z

@bachittle Thanks for the effort anyway. The new mdc impl was chosen because it is super simple and doesnt require any breaking changes.

adnsv and others added 13 commits December 6, 2022 09:45

experimenting with attributes

aa4e4c9

some bug fixes to attribute fork to get compilation working

9ab32d7

attributes passed to root API

0ca869e

forgot the things

1eec41f

formats log messages (at least for default)

301ab18

more cleanup of unneeded code, needed to revert cmake standard to get…

07abdc6

… this to work

attribute example for testing purposes

6c16b6c

attributes can simulate structured logging, show example. Now works w…

98958cf

…ith pattern_formatter

multiple kv pairs can be passed via initializer list

2d79f7a

fixed bugs with default formatting

be5cc44

scrambling key and value to escape ascii codes

882cee1

Merge remote-tracking branch 'origin/v1.x' into attributes

f5bde10

more well-defined definition of list instantiation using a pre-define…

b33301d

…d type

bachittle added 2 commits January 5, 2023 10:06

C++11 backwards compat fixes by replacing std::string_view with built…

8516010

…-in string_view_t

more endpoints, removed commas from logfmt

d864706

bachittle and others added 12 commits January 10, 2023 12:41

code breaks when fmt is external, so am using fmts implementation only

59da466

fixed errors caused by -Werror=conversion

8f2d273

experimenting with attribute formatting

7883014

actually for sure fixed the -Werror=conversion, for the specific vers…

7f24394

…ion of gcc

another pedantic error in ci

be3e571

custom pattern formatting is now functional

785c3a5

Merge branch 'attributes' into attr_format

cbdcb3e

fixed reordering error

3aba11d

changed switch statement due to an interesting bug I found

82e4553

default kv pairs

3383207

fixed example to include new pattern formatting

0b48d79

Merge pull request #1 from bachittle/attr_format

26a04ad

attribute formatting

broke the utf8 code by accident

71d2e3d

bachittle and others added 4 commits January 13, 2023 11:27

global contextual logger support

d6ed2f2

Merge branch 'v1.x' into attributes

47076c9

Merge branch 'v1.x' into attributes

6097b84

bachittle added 7 commits January 30, 2023 12:13

exception safety tests

7aa0ced

Merge branch 'v1.x' into p_attributes

7b92a6c

setting the cmake standard to 20 when using std format

6af7ea2

separate std string view from std format

0367289

Merge branch 'v1.x' into seperate_std_string_view

e1afb67

set string_view option instead of error for compat

96d28af

Merge branch 'seperate_std_string_view' into attributes

fffa350

gabime mentioned this pull request May 27, 2023

Custom Asynchronous Loggers #2742

Closed

bachittle closed this Apr 26, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Extra attributes for log_msg #2595

Extra attributes for log_msg #2595

bachittle commented Jan 4, 2023 •

edited

Loading

bachittle commented Jan 5, 2023

gabime commented Jan 5, 2023

bachittle commented Jan 5, 2023

bachittle commented Jan 12, 2023

gabime commented Jan 12, 2023

lowdesertpunk commented Jan 13, 2023

lowdesertpunk commented Jan 13, 2023

bachittle commented Jan 13, 2023 •

edited

Loading

lowdesertpunk commented Jan 24, 2023

bachittle commented Jan 24, 2023

bachittle commented Jan 24, 2023

lowdesertpunk commented Jan 25, 2023 •

edited

Loading

bachittle commented Jan 25, 2023 •

edited

Loading

pparuzel commented May 29, 2023

bobhansen commented May 31, 2023 •

edited

Loading

falbrechtskirchinger commented Jun 8, 2023

bachittle commented Apr 26, 2024

gabime commented Apr 26, 2024

Extra attributes for log_msg #2595

Extra attributes for log_msg #2595

Conversation

bachittle commented Jan 4, 2023 • edited Loading

bachittle commented Jan 5, 2023

gabime commented Jan 5, 2023

bachittle commented Jan 5, 2023

bachittle commented Jan 12, 2023

gabime commented Jan 12, 2023

lowdesertpunk commented Jan 13, 2023

lowdesertpunk commented Jan 13, 2023

bachittle commented Jan 13, 2023 • edited Loading

lowdesertpunk commented Jan 24, 2023

bachittle commented Jan 24, 2023

bachittle commented Jan 24, 2023

lowdesertpunk commented Jan 25, 2023 • edited Loading

bachittle commented Jan 25, 2023 • edited Loading

pparuzel commented May 29, 2023

bobhansen commented May 31, 2023 • edited Loading

falbrechtskirchinger commented Jun 8, 2023

bachittle commented Apr 26, 2024

gabime commented Apr 26, 2024

bachittle commented Jan 4, 2023 •

edited

Loading

bachittle commented Jan 13, 2023 •

edited

Loading

lowdesertpunk commented Jan 25, 2023 •

edited

Loading

bachittle commented Jan 25, 2023 •

edited

Loading

bobhansen commented May 31, 2023 •

edited

Loading