Rin is a Redshift data Importer by SQS messaging.
- (Someone) creates a S3 object.
- S3 event notifications will send to a message to SQS.
- Rin will fetch messages from SQS, and publish a "COPY" query to Redshift.
Configuring Amazon S3 Event Notifications.
- Create SQS queue.
- Attach SQS access policy to the queue. Example Walkthrough 1:
- Enable Event Notifications on a S3 bucket.
- Run
rin
process with configuration for using the SQS and S3.
queue_name: my_queue_name # SQS queue name
credentials:
aws_access_key_id: AAA
aws_secret_access_key: SSS
aws_region: ap-northeast-1
redshift:
host: localhost
port: 5439
dbname: test
user: test_user
password: test_pass
schema: public
s3:
bucket: test.bucket.test
region: ap-northeast-1
sql_option: "JSON 'auto' GZIP" # COPY SQL option
# define import target mappings
targets:
- redshift:
table: foo
s3:
key_prefix: test/foo
- redshift:
schema: xxx
table: bar
s3:
key_prefix: test/bar
- redshift:
schema: $1 # expand by key_regexp captured value.
table: $2
s3:
key_regexp: test/schema-([a-z]+)/table-([a-z]+)/
- redshift:
host: redshift.example.com # override default section in this target
port: 5439
dbname: example
user: example_user
password: example_pass
schema: public
table: example
s3:
bucket: redshift.example.com
region: ap-northeast-1
key_prefix: logs/example/
sql_option: "CSV DELIMITER ',' ESCAPE"
Rin waits new SQS messages and processing it continually.
$ rin -config config.yaml [-debug]
Rin process new SQS messages and exit.
$ rin -config config.yaml -batch [-debug]