revision for handling dead data #443
pavlis
started this conversation in
Design & Development
Replies: 1 comment 2 replies
-
I think that design sounds good. I think it make sense to explicitly manage the killed data. Note that some would argue that it is an overkill to manage dead data in a database, but I think it makes sense if we want to preserve the history of the whole workflow. With that said, I think in a lot of cases, we don't really care about the kills. I think we should probably implement it as optional so that the killed data is only managed when user explicitly asks so. What do you think? |
Beta Was this translation helpful? Give feedback.
2 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
As you know I've been working on cleaning up some redundancy and inefficiencies in the Database class. I encountered some issues in how we currently handle data marked dead. I came across two things I don't think we handle well:
tombstone
subdocument left in elog. One could find dead data that way easily with MongoDB, but it strike me now as a bit of a collision of concept to mix dead records in elog. Indeed all dead data should have an elog entry, but the analogy in the real world to the current situation is having graves randomly around the landscape instead of localized in one place.I have two proposals to handle each of these issues:
tombstones
and bodies need to be buried in special collection. We can and should give it the descriptive namecemetery
. The documents in cemetery are the record the dead. They need only link back to the parent waveform that was killed during processing.Before I go too far down this road I think we need to agree on whether or not this is a prudent choice. The key benefit is putting records of the dead in an easier, more logical organization. Fixing the more esoteric ensemble issue is of secondary importance in my view.
Beta Was this translation helpful? Give feedback.
All reactions