Name		Name	Last commit message	Last commit date
parent directory ..
src		src
static		static
test		test
README.md		README.md
package-lock.json		package-lock.json
package.json		package.json
requirements.txt		requirements.txt
serverless.yml		serverless.yml

README.md

AWS Auto Cleanup App

The Auto Cleanup App consists of several serverless AWS resource that all work together to find, and delete AWS resources that may have been abandoned. The architecture diagram below illustrates the various services and their relationships with one another.

Deployment

Install AWS CLI
```
pip install awscli
```
Quickly Configuring the AWS CLI
- Auto Cleanup should be deployed by a user with administrative privileges.
Install Serverless Framework
```
npm install -g serverless
```

Download

git clone https://github.com/servian/aws-auto-cleanup.git

Change directory
```
cd aws-auto-cleanup/app/
```
Install dependencies
```
npm install
```

Deploy

npm run deploy -- [--region] [--aws-profile]

Run
```
npm run invoke -- [--region] [--aws-profile]
```
- Settings and Whitelist tables will be populated at the start of the first run.
- Dry run mode is automatically activated by default.

Inspect

npm run logs -- [--region] [--aws-profile]

Removal

Change directory
```
cd aws-auto-cleanup/app/
```
Remove
```
npm run remove -- [--region] [--aws-profile]
```
- S3 buckets provisioned by Serverless will not be deleted through this process. To finalise removal, please delete the athena-results and execution-log buckets manually.

Features

Whitelist

The whitelist table (DynamoDB) maintains a record of all AWS resources that have been whitelisted (and therefore preserved). During the execution of Auto Cleanup, the scanned resources will be checked against the whitelist. If the resource exists within the whitelist table, it will not be deleted.

The whitelist table adheres to the following schema:

Column	Format	Description
resource_id	`<service>:<resource>:<id>`	Unique identifier of the resource. This is a custom format base on the service (e.g., `ec2`, `s3`), the resource (e.g., `instance`, `bucket`) and `id`.
expiration	Epoch timestamp	Epoch timestamp when the record will be removed from the settings table
comment	Text field	Comment field describing the resource and why it has been whitelisted
owner	Text field	Email address or name of the resource owner in case they need to be contacted regarding the whitelisting

Resource ID

The resource_id field within the whitelist table holds a unique identifier for the whitelisted AWS resource. Due to some limitations with AWS, ARNs are not a viable unique identifier for all AWS resources and therefore an alternative was identifier was created.

The below table indicates AWS resources that are supported by Auto Cleanup along with indications and examples of resource_id values for each resource.

Resource	ID Attribute	Example Value
Airflow Environments	Environment Name	`airflow:environment:environment_name`
Amplify Apps	App Name	`amplify:app:app_name`
CloudFormation Stacks	Stack Name	`cloudformation:stack:stack_name`
CloudWatch Log Groups	Log Group Name	`cloudwatch:log_group:log_group_name`
DynamoDB Tables	Table Name	`dynamodb:table:table_name`
EC2 Elastic IPs	Allocation ID	`ec2:address:allocation_id`
EC2 Images	Image ID	`ec2:image:image_id`
EC2 Instances	Instance ID	`ec2:instance:instance_id`
EC2 Security Groups	Group ID	`ec2:security_group:group_id`
EC2 Snapshots	Snapshot ID	`ec2:snapshot:snapshot_id`
EC2 Volumes	Volume ID	`ec2:volume:volume_id`
ECR Images	Image Digest	`ecr:image:image_digest`
ECR Repositories	Repository Name	`ecr:repository:repository_name`
ECS Clusters	Cluster Name	`ecs:cluster:cluster_name`
ECS Services	Service Name	`ecs:service:service_name`
EFS File Systems	File System ID	`efs:file_system:file_system_id`
EKS Clusters	Cluster Name	`eks:cluster:cluster_name`
EKS Fargate Profiles	Fargate Profile Name	`eks:fargate_profile:fargate_profile_name`
EKS Node Groups	Node Group Name	`eks:node_group:node_group_name`
Elastic Beanstalk Applications	Application Name	`elasticbeanstalk:application:application_name`
ElastiCache Clusters	Cache Cluster ID	`elasticache:cluster:cache_cluster_id`
ElastiCache Replication Groups	Replication Group ID	`elasticache:replication_group:replication_group_id`
Elasticsearch Service	Domain Name	`elasticsearch:domain:domain_name`
ELB Load Balancers	Load Balancer Name	`elb:load_balancer:load_balancer_name`
EMR Clusters	ID	`emr:cluster:id`
Glue Crawlers	Crawler Name	`glue:crawler:crawler_name`
Glue Databases	Database Name	`glue:database:database_name`
Glue Dev Endpoints	Endpoint Name	`glue:dev_endpoint:endpoint_name`
IAM Access Keys	Access Key ID	`iam:access_key:access_key_id`
IAM Policies	Policy Name	`iam:policy:policy_name`
IAM Roles	Role Name	`iam:role:role_name`
IAM Users	User Name	`iam:user:user_name`
Kafka Clusters	Cluster Name	`kafka:cluster:cluster_name`
Kinesis Streams	Stream Name	`kinesis:stream:stream_name`
Lambda Functions	Function Name	`lambda:function:function_name`
RDS Instances	DB Instance Identifier	`rds:instance:db_instance_identifier`
RDS Snapshots	DB Snapshot Name	`rds:snapshot:db_snapshot_name`
Redshift Instances	Cluster Identifier	`redshift:instance:cluster_identifier`
Redshift Snapshots	Snapshot Identifier	`redshift:snapshot:snapshot_identifier`
S3 Buckets	Bucket Name	`s3:bucket:bucket_name`
SageMaker Apps	App Name	`sagemaker:app:app_name`
SageMaker Endpoints	Endpoint Name	`sagemaker:endpoint:endpoint_name`
SageMaker Notebook Instances	Notebook Instance Name	`sagemaker:notebook_instance:notebook_instance_name`

Note: Resources that are a part of a CloudFormation Stack will be automatically whitelisted at run time to prevent the need to whitelist the CloudFormation Stack and each resource the Stack provisions.

Expiration

The expiration field within the whitelist table is marked as a TTL field. This means that when the current timestamp exceeds the value within the expiration field, DynamoDB will remove the record from the table.

This has been designed in such a way as to prevent AWS resources from being whitelisted indefinitely.

Settings

The settings table contains several key-value pairs records including version, general, services, and regions.

Version

The version number of the settings. If the version number within the app/src/data/auto-cleanup-settings.json file is greater than in the database, the settings will be refreshed.

Key	Value
Version	123

General

General settings.

Key	Value
Dry Run	True

Services

Service-specific settings indicating the supported AWS services, resources, and their lifespan.

Service	Resource Type	Clean	TTL	Comment
Airflow	Environments 🆕	True	7
Amplify	Apps	True	7
CloudFormation	Stacks	True	7	Deletes Stack if not whitelisted or not part of a whitelistd nested Stack.
CloudWatch	Log Groups	True	30
DynamoDB	Tables	True	7
EC2	Addresses	True	N/A	Deletes Address if not associated with an EC2 instance.
EC2	Images	True	7
EC2	Instances	True	7
EC2	Security Groups	True	N/A	Deletes Security Group if not associated with an EC2 instance.
EC2	Snapshots	True	7
EC2	Volumes	True	7	Volumes that are attached to an EC2 Instance when it launched will be deleted if the EC2 Instance is terminated. This is an AWS behavior and not something that can be controlled.
ECR	Images	True	7
ECR	Repositories	True	7	Deletes Repository if no Images exist.
ECS	Clusters	True	N/A	Deletes Cluster if no running Services or Tasks.
ECS	Services	True	7
EFS	Clusters	True	7
EFS	Fargate Profiles	True	7
EFS	File Systems	True	7
EFS	Node Groups	True	7
EKS	Clusters	True	7	Deletes Cluster if no Fargate Profiles or Node Groups exist.
EKS	Fargate Profiles	True	7
EKS	Node Groups	True	7
ElastCache	Clusters	True	7
ElastCache	Replication Groups	True	7
Elastic Beanstalk	Applications	True	7
Elasticsearch Service	Domain Name	True	7
ELB	Load Balancers	True	7
EMR	Clusters	True	7
Glue	Crawlers	True	7
Glue	Databases	True	30
Glue	Dev Endpoints	True	7
IAM	Access Keys 🆕	True	30
IAM	Policies	True	30
IAM	Roles	True	30
IAM	Users 🆕	True	30
Kafka	Clusters	True	7
Kinesis	Streams	True	7
Lambda	Functions	True	30
RDS	Instances	True	7
RDS	Snapshots	True	7
Redshift	Clusters	True	7
Redshift	Snapshots	True	7
S3	Buckets	True	30
SageMaker	Apps 🆕	True	7
SageMaker	Endpoints	True	7
SageMaker	Notebook Instances	True	7

Regions

Region-specific settings indicating the regions to be cleaned.

Region	Clean
af-south-1	True
ap-east-1	True
ap-northeast-1	True
ap-northeast-2	True
ap-northeast-3 *	False
ap-south-1	True
ap-southeast-1	True
ap-southeast-2	True
ca-central-1	True
cn-north-1 *	False
cn-northwest-1 *	False
eu-central-1	True
eu-north-1	True
eu-south-1	True
eu-west-1	True
eu-west-2	True
eu-west-3	True
me-south-1	True
sa-east-1	True
us-east-1	True
us-east-2	True
us-west-1	True
us-west-2	True

Note: Some regions are deactivated by default as they required special access from AWS.

Execution Log

Post every Auto Cleanup run, an execution log is generated and stored as a flat CSV file within the execution-log S3 Bucket. The execution log files adhere to the following schema.

Column	Format	Description
platform	string	Always `AWS`
region	string	Region (e.g. `ap-southeast-2`)
service	string	Service (e.g., `s3`)
resource	string	Resource (e.g., `bucket`)
resource_id	string	Resource ID (e.g., Instance ID)
action	string	Action taken on the resource (e.g., `DELETE`, `DELETE - NOT CONFIRMED`, `SKIP - TTL`, `SKIP - WHITELIST`, `SKIP - IN USE`, OR `ERROR`)
timestamp	timestamp	Timestamp when action was performed
dry_run_flag	boolean	Dry run activated
execution_id	string	Lambda execution ID

Athena

To enable analytical access to the generated execution logs, a Glue Database and Glue Table are provisioned based on the S3 Bucket and file schema of the execution log. This database and table can be accessed directly from within Athena enabling the logs to be queried using SQL.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

app

app

README.md

AWS Auto Cleanup App

Table of contents

Deployment

Removal

Features

Whitelist

Resource ID

Expiration

Settings

Version

General

Services

Regions

Execution Log

Athena

Files

app

Directory actions

More options

Directory actions

More options

Latest commit

History

app

Folders and files

parent directory

README.md

AWS Auto Cleanup App

Table of contents

Deployment

Removal

Features

Whitelist

Resource ID

Expiration

Settings

Version

General

Services

Regions

Execution Log

Athena