Skip to main content
waffle.svg
Domo Knowledge Base

Amazon Athena Unload Connector

Version 3

 

Intro

Amazon Athena is a service that makes it easy to query big data from Amazon S3 Bucket. With the Amazon Athena Unload Data Connector, you can get constant access to your data right from your Domo instance. This connector uses the SDK to unload the query into the S3 Bucket using the unique object id and parses the S3 object. Use Domo's Amazon Athena Unload connector to instantly connect, visualize, and get insights from your big data using Amazon Athena Unload with S3 query processing. To learn more about the Amazon Athena API, visit their page (https://aws.amazon.com/documentation/athena/).

You connect to your Amazon Athena account in the Data Center. This topic discusses the fields and menus that are specific to the Amazon Athena Unload connector user interface. General information for adding DataSets, setting update schedules, and editing DataSet information is discussed in Adding a DataSet Using a Data Connector.

Prerequisites

To connect to your Amazon Athena account and create a DataSet, you must have the following:

  • Your AWS access key

  • Your AWS secret key

  • The Amazon S3 location where your query output is written. From the Athena web console, this can be found by going to Settings. Example: s3://aws-athena-query-results-1234-us-west-2/

  • Your AWS region 

For information about obtaining AWS credentials, visit http://docs.aws.amazon.com/general/l...red-types.html.

Required Permission Policies

To gain the access to AWS services and resources, such as Athena and the Amazon S3 buckets, provide the JDBC driver credentials to your application. If you are using the JDBC driver, ensure that the IAM permissions policy includes all actions listed in AmazonAthenaFullAccess Managed Policy.

The AmazonAthenaFullAccess managed policy grants full access to Athena.

Managed policy contents change, so the policy shown here may be out-of-date. Check the IAM console for the most up-to-date policy.

{
    "Version": "2012-10-17",
    "Statement": [
        {
            "Effect": "Allow",
            "Action": [
                "athena:*"
            ],
            "Resource": [
                "*"
            ]
        },
        {
            "Effect": "Allow",
            "Action": [
                "glue:CreateDatabase",
                "glue:DeleteDatabase",
                "glue:GetDatabase",
                "glue:GetDatabases",
                "glue:UpdateDatabase",
                "glue:CreateTable",
                "glue:DeleteTable",
                "glue:BatchDeleteTable",
                "glue:UpdateTable",
                "glue:GetTable",
                "glue:GetTables",
                "glue:BatchCreatePartition",
                "glue:CreatePartition",
                "glue:DeletePartition",
                "glue:BatchDeletePartition",
                "glue:UpdatePartition",
                "glue:GetPartition",
                "glue:GetPartitions",
                "glue:BatchGetPartition"
            ],
            "Resource": [
                "*"
            ]
        },
        {
            "Effect": "Allow",
            "Action": [
                "s3:GetBucketLocation",
                "s3:GetObject",
                "s3:ListBucket",
                "s3:ListBucketMultipartUploads",
                "s3:ListMultipartUploadParts",
                "s3:AbortMultipartUpload",
                "s3:CreateBucket",
                "s3:PutObject"
            ],
            "Resource": [
                "arn:aws:s3:::aws-athena-query-results-*"
            ]
        },
        {
            "Effect": "Allow",
            "Action": [
                "s3:GetObject",
                "s3:ListBucket"
            ],
            "Resource": [
                "arn:aws:s3:::athena-examples*"
            ]
        },
        {
            "Effect": "Allow",
            "Action": [
                "s3:ListBucket",
                "s3:GetBucketLocation",
                "s3:ListAllMyBuckets"
            ],
            "Resource": [
                "*"
            ]
        },
        {
            "Effect": "Allow",
            "Action": [
                "sns:ListTopics",
                "sns:GetTopicAttributes"
            ],
            "Resource": [
                "*"
            ]
        },
        {
            "Effect": "Allow",
            "Action": [
                "cloudwatch:PutMetricAlarm",
                "cloudwatch:DescribeAlarms",
                "cloudwatch:DeleteAlarms"
            ],
            "Resource": [
                "*"
            ]
        }
    ]
}

Connecting to Your Amazon Athena Account

This section enumerates the options in the Credentials and Details panes in the Amazon Athena Unload Connector page. The components of the other panes in this page, Scheduling and Name & Describe Your DataSet, are universal across most connector types and are discussed in greater length in Adding a DataSet Using a Data Connector.

Credentials Pane

This pane contains fields for entering credentials to connect to your Amazon Athena account. The following table describes what is needed for each field:  

Field

Description

Access Key Enter your AWS access key.
Secret Key Enter your AWS secret key.
S3 Staging Directory Enter the Amazon S3 location where your query output is written. From the Athena web console, this can be found by going to Settings. Example: s3://aws-athena-query-results-1234-us-west-2/
Region Select your AWS region.
Workgroup Enter the workgroup.

Once you have entered valid Amazon Athena credentials, you can use the same account any time you go to create a new Amazon Athena DataSet. You can manage connector accounts in the Accounts tab in the Data Center. For more information about this tab, see Managing User Accounts for Connectors.

Details Pane

This pane contains a number of options for entering your query and configuring the report.

Menu

Description

Report

Select the Amazon Athena Unload report you want to run. Currently only one report type is available.

Query

Lets you query data from a specified S3 database and table.

Database Name

Select the database you want to query.

Query

Enter the SQL query you want to execute (e.g. select * from table_name).

Other Panes

For information about the remaining sections of the connector interface, including how to configure scheduling, retry, and update options, see Adding a DataSet Using a Data Connector.