Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update docs #3

Merged
merged 1 commit into from
May 20, 2024
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
2 changes: 2 additions & 0 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -12,6 +12,8 @@ metrics.
SB-OSC is designed to overcome the limitations that existing migration tools face with large-scale tables,
significantly reducing the operational overhead associated with managing large tables.

Please visit our [blog](https://sendbird.com/blog/sb-osc-sendbird-online-schema-change) for more information.

## Takeaways

SB-OSC has its own unique features that differentiate it from existing schema migration tools such as `pt-osc` and `gh-ost`.
Expand Down
11 changes: 11 additions & 0 deletions doc/troubleshooting.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,11 @@
# Troubleshooting

This sections provides list of possible issues and solutions that may occur when using SB-OSC.

### apply_dml_events_validation_batch_size
When setting `apply_dml_events_validation_batch_size` there are two factors to consider. Since the binlog resolution is in seconds, if the number of DML events in a second is greater than the batch size, the validation process can hang indefinitely. In this case, it is recommended to increase the batch size.

Another factor is `max_allowed_packet` of MySQL. Apply DML events stage uses query with IN clause containing `apply_dml_events_validation_batch_size` number of PKs. If the size of this query exceeds `max_allowed_packet`, the query will not return properly. In this case, it is recommended to decrease the batch size. Also, you might need to kill running queries since it may hang indefinitely in this case.

### Monitoring
SB-OSC uses CPU utilization and WriteLatency to determine the optimal `batch_size` and `thread_count`. But in some cases, other metrics such as `RowLockTime`, `DiskQueueDepth` can be more related to cluster's overall performance. We are currently discovering the best combination of metrics to monitor for optimal performance.
Loading