Community
Most clients have a significant on-prem data presence which they’re planning on migrating to AWS but aren’t sure where to start. My recommendation is to start small, move quickly, and iterate. Hence this short introductory blog post on how to get started without a lot of development resources, while at the same time utilizing the latest and greatest AWS services.
I’ll try to keep the words to a minimal, so let’s jump in by going through each step in the above architecture:
Prep: Create an S3 bucket (e.g. my-data-lake). Create “raw” and “processed” prefixes. Create any additional sub-prefixes (e.g. /customers and /requests). It really comes down to what type of data you have. The goal is to move all of your on-prem data into Amazon S3 first and foremost, so make sure your data lake bucket is setup appropriately. If later on you’ll be giving direct access to your S3 data to your users, you will want to incorporate the Cognito Federated ID into the bucket prefix to control which users will be accessing which bucket prefixes. You will also want to have a chronological order to your bucket items for multiple reasons, including being Amazon Athena-friendly (even if the data is raw). Here’s what a prefix might look like:
/raw/customers/user=ASDF44444/year=2021/month=11/day=08/
There are no servers to manage here. Even the Aurora MySQL database, which gives you access to the EC2 instances, manages the database and resources for you, so you don’t really need to sorry about these resources and let Aurora do its magic. The web app can be easily hosted on S3 as well.
Now you’re ready for data consumption.
7. As you can see, there are a lot of steps in this section. You can create a very simple web app that allows users to authenticate with Cognito and then have access to their data (if you saved the ingested data under a specific userId which corresponds to the Cognito federated id, you can give your users direct access to their S3 data, or pipe the web requests through a Lambda function, checking their AuthZ that way. You can build a very complex and data-rich app using our AWS AppSync service. It will take care of interacting with the Lambda function (or an HTTP endpoint that you specify) and will streamline your data retrieval on the web app.
This content is provided by an external author without editing by Finextra. It expresses the views and opinions of the author.
Kate Obiidykhata Group Product Marketing Manager at Percona
22 August
Dave Glaser CEO at Dwolla
Parminder Saini CEO at Triple Minds
21 August
Alex Kreger Founder and CEO at UXDA Financial UX Design
Welcome to Finextra. We use cookies to help us to deliver our services. You may change your preferences at our Cookie Centre.
Please read our Privacy Policy.