Skip to content

Commit

Permalink
Adapt charm to support COS integration (#127)
Browse files Browse the repository at this point in the history
* Add public interface

* Remove promtail prefix from fcts

* Fix license header

* Merge install and config into start

* Add interface to log metrics

* Add grafana_dashboard lib

* Move logging of event to metrics module

* Introduce charm state

* Update attrs doc

* Implement COS integration

* Remove grafana_dashboard

* Change proxy server settings in test_charm

* Ignore whitelist in pylintrc

* Add missing newlines

* Exclude woke.yaml from license check

* Add final newline to .woke.yaml

* Remove usage of requests.session in metrics

* Pin pydantic more specifically

* Fix type hint/doc string in ProxyConfig

* Pin pydantic

* Rename NotCompleteError

* Remove hardcoding of promtail arch

* Remove hardcoding of promtail arch

* Rename -> download_info

* promtail.start -> promtail.setup

* handle non-happy case first in metrics

* Remove group

* Adapt promtail.yaml.j2

* Introduce constant in test_charm

* Move issue_metrics to RunnerManagerConfig

* Fix promtail.service.j2

* Capture time only when needed

* Narrow the exception catch

* Introduce constants in Promtail

* Retry Promtail health check and raise error

* Use Path.write_bytes in promtail

* set unit status to Blocked for unhealthy promtail

* Fix integration test

Breaking change in latest Pygithub version

* Lint

* Catch RequestException

* Drop Promtail and call Loki directly

* Add unit test for charm state

* Switch to cos_agent integration

* Adapt integration test

* Fix integration test

* Simplify integration test

* Fix issue_event

* Add code to set up logrotate

* Update grafana dashboard

* Cleanup

* Remove status in wait_for_idle

* Small fixes

* Dont keep charm inside state

* Update src-docs

* Dont use mutable default value

* Compute event name on instantiation

* Raise LogrotateSetupError

* Reuse _create_runner func

* Check that metrics log is empty

* Fix docstring in Event

* Lint

* app_no_runner -> app
  • Loading branch information
cbartz authored and yhaliaw committed Oct 11, 2023
1 parent 818eb9e commit 8808652
Show file tree
Hide file tree
Showing 33 changed files with 2,277 additions and 75 deletions.
4 changes: 2 additions & 2 deletions .github/workflows/integration_tests.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -13,7 +13,7 @@ jobs:
pre-run-script: scripts/pre-integration-test.sh
provider: lxd
test-tox-env: integration-juju3.1
modules: '["test_charm_fork_repo", "test_charm_no_runner", "test_charm_scheduled_events", "test_charm_one_runner"]'
modules: '["test_charm_fork_repo", "test_charm_no_runner", "test_charm_scheduled_events", "test_charm_one_runner", "test_charm_metrics"]'
integration-test-juju2:
name: Integration test
needs: integration-test-juju3
Expand All @@ -24,4 +24,4 @@ jobs:
pre-run-script: scripts/pre-integration-test.sh
provider: lxd
test-tox-env: integration-juju2.9
modules: '["test_charm_fork_repo", "test_charm_no_runner", "test_charm_scheduled_events", "test_charm_one_runner"]'
modules: '["test_charm_fork_repo", "test_charm_no_runner", "test_charm_scheduled_events", "test_charm_one_runner", "test_charm_metrics"]'
2 changes: 2 additions & 0 deletions .licenserc.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -22,4 +22,6 @@ header:
- 'CODEOWNERS'
- 'icon.svg'
- 'LICENSE'
- '.pylintrc'
- '.woke.yaml'
comment: on-failure
2 changes: 2 additions & 0 deletions .pylintrc
Original file line number Diff line number Diff line change
@@ -0,0 +1,2 @@
[MAIN]
extension-pkg-whitelist=pydantic # see https://github.com/pydantic/pydantic/issues/1961#issuecomment-759522422
3 changes: 3 additions & 0 deletions .woke.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,3 @@
ignore_files:
# Ignore pylintrc as it uses non compliant terminology: whitelist
- .pylintrc
28 changes: 28 additions & 0 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -39,6 +39,34 @@ If there are more idle runners than configured, the oldest idle runners are unre

During each time period, every unit will make one or more API calls to GitHub. The interval may need to be adjusted if the number of units is large enough to trigger [Rate Limiting](https://docs.github.com/en/rest/overview/resources-in-the-rest-api#rate-limiting).


## COS
The charm is designed to provide comprehensive metrics and monitoring capabilities for both the Runners and the Charm itself. These metrics are made available through the `cos-agent` integration with the `cos_agent` interface. Additionally, a Grafana Dashboard is included to help visualize these metrics effectively.

### Loki Integration
#### Loki Push API
The charm seamlessly integrates with Loki, a powerful log aggregation system, through the `cos_agent` interface. This integration allows the charm to push various metrics and logs related to the Runners and the Charm itself to a Loki instance. This provides valuable insights into the performance and behavior of your deployment.

### Grafana Dashboard
To make monitoring even more accessible, the charm comes with a pre-configured Grafana Dashboard. This dashboard is designed to visualize the metrics collected by the charm, making it easier for operators to track the health and performance of the system.

#### Automated Dashboard Deployment
You can automate the deployment of the Grafana Dashboard using the [cos-integration-k8s](https://charmhub.io/cos-configuration-k8s) charm. This simplifies the setup process and ensures that your monitoring infrastructure is ready to go with minimal manual intervention.

#### Configuration Options
To enable the automated deployment of the Grafana Dashboard, you can provide the following configuration options when deploying the `cos-integration-k8s` charm:

```ini
git_repo=https://https://github.com/canonical/github-runner-operator
git_branch=main
git_depth=1
grafana_dashboards_path=src/grafana_dashboard_metrics
```





## Development

This charm uses black and flake8 for formatting. Both run with the lint stage of tox.
Expand Down
Loading

0 comments on commit 8808652

Please sign in to comment.