Amazon Redshift

Amazon Redshift is a fast, fully managed, petabyte-scale data warehouse solution that makes it simple and cost-effective to efficiently analyze all your data using your existing business intelligence tools. You can start small for just $0.25 per hour with no commitments or upfront costs and scale to a petabyte or more for $1,000 per terabyte per year, less than a tenth of most other data warehousing solutions.

Introduction to Amazon Redshift Data Warehousing (3:22)

Learn how to do data warehousing the AWS way with Amazon Redshift. Amazon RedShift is a powerful, fast and fully-managed data warehouse service that works with your existing analytics tools. Watch how, with just a few clicks in the AWS Management Console, you can be set up with a fully functional data warehouse, ready to accept data without learning any new languages. Amazon Redshift easily plugs in with all of the existing BI tools and can be up and running within minutes.

Customer Stories

Try Amazon Redshift for free

Start Free Trial
Learn More

Get 750 free DW2.Large hours per month for 2 months. To start the trial:

Create an AWS account and sign in to the Amazon Redshift Console

Launch an Amazon Redshift cluster and select DW2.Large for Node Type

Report and ingest data for free using our Partner Free Trials

Made the product and user experience 5X faster. »

Queried 200+ TB fast, and at low cost. »

Enabled real-time reporting for marketers on billions of mobile and desktop interactions. »

Handled massive data with speed and efficiency. »

Lowered costs for emergency room staffing. »

Enabled large scale analytics for feature phones worldwide. »

Enabled rapid scalability and saved 60% »

Significantly improved performance and reduced costs »

Made data available immediately for decision making »

Improved business analytics for gaming »

Saw a 25-100X performance gain. »

What's New from Amazon Redshift

Date	Announcement
Nov 04	Amazon Redshift adds Four New Features and Sixteen New SQL Commands and Functions
Jul 01	Amazon Redshift Free Trial and Price Reductions in Asia Pacific	Blog
Jun 29	Amazon Redshift Announces Cross Region Ingestion and Improved Query Functionality

Learn more about what's new from Amazon Web Services »

Features and Benefits

Amazon Redshift delivers fast query performance by using columnar storage technology to improve I/O efficiency and parallelizing queries across multiple nodes. Redshift uses standard PostgreSQL JDBC and ODBC drivers, allowing you to use a wide range of familiar SQL clients. Data load speed scales linearly with cluster size, with integrations to Amazon S3, Amazon DynamoDB, Amazon Elastic MapReduce, Amazon Kinesis or any SSH-enabled host.

We've automated most of the common administrative tasks associated with provisioning, configuring and monitoring a data warehouse. Backups to Amazon S3 are continuous, incremental and automatic. Restores are fast; you can start querying in minutes while your data is spooled down in the background. Enabling disaster recovery across regions takes just a few clicks.

Security is built-in. You can encrypt data at rest and in transit using hardware-accelerated AES-256 and SSL, isolate your clusters using Amazon VPC and even manage your keys using hardware security modules (HSMs). All API calls, connection attempts, queries and changes to the cluster are logged and auditable.

Fast

Optimized for Data Warehousing

Amazon Redshift uses a variety of innovations to obtain very high query performance on datasets ranging in size from a hundred gigabytes to a petabyte or more. It uses columnar storage, data compression, and zone maps to reduce the amount of I/O needed to perform queries. Amazon Redshift has a massively parallel processing (MPP) architecture, parallelizing and distributing SQL operations to take advantage of all available resources. The underlying hardware is designed for high performance data processing, using local attached storage to maximize throughput between the CPUs and drives, and a 10GigE mesh network to maximize throughput between nodes.

Scalable

With a few clicks of the AWS Management Console or a simple API call, you can easily change the number or type of nodes in your data warehouse as your performance or capacity needs change. Amazon Redshift enables you to start with as little as a single 160GB DW2.Large node and scale up all the way to a petabyte or more of compressed user data using 16TB DW1.8XLarge nodes. While resizing, Amazon Redshift places your existing cluster into read-only mode, provisions a new cluster of your chosen size, and then copies data from your old cluster to your new one in parallel. You can continue running queries against your old cluster while the new one is being provisioned. Once your data has been copied to your new cluster, Amazon Redshift will automatically redirect queries to your new cluster and remove the old cluster.

Cheap

No Up-Front Costs

You pay only for the resources you provision. You can choose On-Demand pricing with no up-front costs or long-term commitments, or obtain significantly discounted rates with Reserved Instance pricing. On-Demand pricing starts at just $0.25/hour per 160GB DW2.Large node or $0.85/hour per 2TB DW1.XLarge node. With Reserved Instances, you can lower your effective price to $0.10/hour per DW2.Large node ($5,500/TB/year) or $0.228/hour per DW1.XLarge node ($1,000/TB/year). To see more details, visit the Amazon Redshift Pricing page.

Simple

Get Started in Minutes

With a few clicks in the AWS Management Console or simple API calls, you can create a cluster, specifying its size, underlying node type, and security profile. Amazon Redshift will provision your nodes, configure the connections between them, and secure the cluster. Your data warehouse should be up and running in minutes.

Fully Managed

Amazon Redshift handles all the work needed to manage, monitor, and scale your data warehouse, from monitoring cluster health and taking backups to applying patches and upgrades. You can easily add or remove nodes from your cluster as your performance and capacity needs change. By handling all these time-consuming, labor-intensive tasks, Amazon Redshift frees you up to focus on your data and business.

Fault Tolerant

Amazon Redshift has multiple features that enhance the reliability of your data warehouse cluster. All data written to a node in your cluster is automatically replicated to other nodes within the cluster and all data is continuously backed up to Amazon S3. Amazon Redshift continuously monitors the health of the cluster and automatically re-replicates data from failed drives and replaces nodes as necessary.

Automated Backups

Amazon Redshift’s automated snapshot feature continuously backs up new data on the cluster to Amazon S3. Snapshots are continuous, incremental and automatic. Amazon Redshift stores your snapshots for a user-defined period, which can be from one to thirty-five days. You can take your own snapshots at any time, which leverage all existing system snapshots and are retained until you explicitly delete them. Redshift can also asynchronously replicate your snapshots to S3 in another region for disaster recovery. Once you delete a cluster, your system snapshots are removed but your user snapshots are available until you explicitly delete them.

Fast Restores

You can use any system or user snapshot to restore your cluster using the AWS Management Console or the Amazon Redshift APIs. Your cluster is available as soon as the system metadata has been restored and you can start running queries while user data is spooled down in the background.

Secure

Encryption

With just a couple of parameter settings, you can set up Amazon Redshift to use SSL to secure data in transit and hardware-accelerated AES-256 encryption for data at rest. If you choose to enable encryption of data at rest, all data written to disk will be encrypted as well as any backups. By default, Amazon Redshift takes care of key management but you can choose to manage your keys using your own hardware security modules (HSMs), AWS CloudHSM, or AWS Key Management Service.

Network Isolation

Amazon Redshift enables you to configure firewall rules to control network access to your data warehouse cluster. You can also run Amazon Redshift inside Amazon Virtual Private Cloud (Amazon VPC) to isolate your data warehouse cluster in your own virtual network and connect it to your existing IT infrastructure using industry-standard encrypted IPsec VPN.

Audit and Compliance

Amazon Redshift integrates with AWS CloudTrail to enable you to audit all Redshift API calls. Amazon Redshift also logs all SQL operations, including connection attempts, queries and changes to your database. You can access these logs using SQL queries against system tables or choose to have them downloaded to a secure location on Amazon S3. Amazon Redshift is compliant with SOC1, SOC2, SOC3 and PCI DSS Level 1 requirements. For more details, please visit the AWS Compliance Center.

Compatible

SQL

Amazon Redshift is a SQL data warehouse solution and uses industry standard ODBC and JDBC connections and PostgreSQL drivers. Many popular software vendors are certifying Amazon Redshift with their offerings to enable you to continue to use the tools you do today. See the Amazon Redshift partner page for details.

Integrated

Amazon Redshift is integrated with other AWS services and has built in commands to load data in parallel to each node from Amazon S3, Amazon DynamoDB or your EC2 and on-premise servers using SSH. AWS Data Pipeline and Amazon Kinesis both integrate with Amazon Redshift as a data target.