ML on AWS : Read from your favorite AWS Data Sources to a Pandas DataFrame using AWS Data Wrangler

Chouaieb Nemri
4 min readSep 20, 2022

AI generated image using DALL-E mini (The prompt was “Pandas in the Cloud) — Learn about DALL-E here.

Note: I have received no compensation for writing this piece. Please consider supporting mine and others’ writing by becoming a Medium member with this link.

Introduction

Machine Learning is being used in many applications across the world. A lot of data is being generated by these applications and systems. This data needs to be stored, processed and analyzed to extract useful information. Amazon Web Services (AWS) provides a wide variety of services that can be used to store, process and analyze your application’s data. In this blog post, we will focus on how you can use AWS Data Wrangler to read from your favorite AWS Data Sources to a Pandas DataFrame.

Pandas is an open source, BSD-licensed library providing high-performance, easy-to-use data structures and data analysis tools for the Python programming language.

AWS Wrangler is an AWS Professional Service open source python initiative that extends the power of Pandas library to AWS connecting DataFrames and AWS data related services. It provides easy integration with Athena, Glue, Redshift, Timestream, OpenSearch…

--

--