Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feature] Implement Hot-Warm-Cold Data Management Architecture #11945

Open
2 of 10 tasks
hanahmily opened this issue Feb 28, 2024 · 3 comments
Open
2 of 10 tasks

[Feature] Implement Hot-Warm-Cold Data Management Architecture #11945

hanahmily opened this issue Feb 28, 2024 · 3 comments
Assignees
Labels
database BanyanDB - SkyWalking native database feature New feature

Comments

@hanahmily
Copy link
Contributor

Search before asking

  • I had searched in the issues and found no similar feature requirement.

Description

As BanyanDB grows and deals with bigger datasets, it's important to manage data efficiently and optimize storage to keep costs down. Right now, BanyanDB doesn't differentiate between "hot" data that gets accessed frequently and "cold" data that's barely touched.

After discussing with @HHoflittlefish777, we propose the implementation of a hot-warm-cold data management architecture within BanyanDB. This architecture would categorize data based on access patterns and move them between different storage tiers:

  • Hot: Data that is frequently accessed and requires fast read/write speeds. This should be stored on the fastest storage media available.
  • Warm: Data that is accessed less frequently but may still need to be readily available. This could be stored on lower-cost, lower-speed media. It applies a force merge to optimize the data files.
  • Cold: Rarely accessed data that can be archived, preventing any changes from being made to it. This should be stored on the cheapest storage media, such as cold HDDs or even offsite/cloud storage.

Contributor: @HHoflittlefish777

Next Steps

  • Discuss the viability and scope of the proposed feature
  • Draft a detailed design document
  • Review and approve the design document
  • Implement the feature
  • Perform thorough testing, including performance and resilience tests
  • Update documentation to reflect the new architecture
  • Create tutorials or guides for users to understand how to work with the new data tiers

Use case

No response

Related issues

No response

Are you willing to submit a pull request to implement this on your own?

  • Yes I am willing to submit a pull request on my own!

Code of Conduct

@hanahmily hanahmily added feature New feature database BanyanDB - SkyWalking native database labels Feb 28, 2024
@sollhui
Copy link

sollhui commented Feb 28, 2024

please assign to me :)

@wu-sheng
Copy link
Member

wu-sheng commented Nov 5, 2024

@hanahmily I reassigned this back to you, as the previous request has no update for over half of a year.

@hanahmily
Copy link
Contributor Author

This feat relies on #12876

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
database BanyanDB - SkyWalking native database feature New feature
Projects
None yet
Development

No branches or pull requests

3 participants