3/9/2023 0 Comments Metabase athenaIf you’re running the same query every hour or day to power a BI dashboard, and the structure of the underlying data stays consistent, Redshift might be the better choice – especially if you’re willing to invest in infrastructure to ensure consistent performance come hell or high water.Īthena, on the other hand, might be your go-to for an ad-hoc query – most queries will just work, even on semi-structured or unstructured data, and they will return results without extensive tuning. Redshift would be the better choice if you have data coming in from diverse sources and you would like to transform that data, enforce consistent schema and formats, and create a single source that feeds a large number of business intelligence and reporting tools across the organization (see the AWS Athena FAQ ). Redshift is the more natural choice for data warehouse reporting, Athena for ad-hoc queries against S3 storage. However, here are the key factors to consider when deciding between Redshift and Athena: 1. The caveat from above still applies – your mileage may vary, every organization and every architecture is different, and so on. Let’s go a little bit more in-depth with… 4 Questions to decide where to run your analytic workloads Athena makes it is easier to get started and is more flexible in the types of data it can query, but performance is not guaranteed without significant data preparationįrom here it’s a question of how much data you have, what your use case is, and how much you are willing to invest in money and human hours.Redshift is faster and more robust (as long as you’re willing to add compute power), but also more expensive and complicated to manage.Let’s start with the simplest, most basic way to approach the question of where to analyze your cloud data. However, for the purposes of this article, these are the important aspects you need to understand to answer the question of… Redshift or Athena? The super-short answer There’s a metric ton more that can be said about each of these tools – the AWS website would be a good place to start if you want to further your research ( Redshift, Athena ). Coupled architecture (storage and compute) serverless functionality available via Redshift Spectrum.Users can scale by adding compute nodes as needed Composed of leader nodes and compute nodes.Cloud data warehouse based on PostgreSQL.Utilizes Amazon Glue as a Hive MetastoreĪmazon Redshift is a cloud data warehouse optimized for analytics performance.Typically only read-only and does not alter data on S3 however query results can be written to S3.Works on top of Amazon S3 object storage.Serverless query service – AWS manages all infrastructure aspects.Let’s give the 30 second overview of each one: Amazon AthenaĪmazon Athena is a serverless query engine used to retrieve data from Amazon S3 using SQL. Once you understand the differences between an enterprise data warehouse such as Redshfit and an interactive query service such as Athena, you can identify the scenarios in which one will be more useful than the other. Redshift and Athena from a Bird’s Eye View We’ll try to do that with 2 sentences, 4 questions, and a detailed feature-by-feature breakdown. In this article, we’ll try to do exactly that – help you identify the use cases where you’d use Redshift for your analytic workloads, versus those where you’d lean more towards Athena. Nevertheless, there are some high-level factors you can look at to help you gauge which tech could be the more relevant for your situation. ) Like most cliches, this one is also true technology decisions are rarely black and white and if two platforms are commonly used, it’s often because each can be useful for some users in some scenarios. (we’ve done it too when we compared Kafka to Kinesis. “Tool X vs Tool Y” comparisons usually start and end with a cop-out stating that everything depends on your particular circumstances, there is no single correct answer, etc. The rapid release of new databases and analytics tools – both by AWS and newer players such as Snowflake – can leave software architects baffled as to which stack they should adopt in order to solve a specific business scenario. Redshift has only been commercially available since 2013 Amazon Athena has only been around since 2016. With the near-ubiquity of Amazon Web Services in cloud computing, it’s easy to forget how new many of its services are. ![]() Read on for the excerpt, or get the full education pack for FREE right here. The following article is part of our free Amazon Athena resource bundle.
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |