There are many ways to construct a big data flow on AWs depending on the time, skills, budgets, objective and operational support.

Key architectural principle is simplicity. A second is cost control.

Types of Data:

 

Methods of using Big Data:

 

Delivery:

 

Architecture Principles

-immutable data lakes, materialised views

ETL – normalised view of different data sets and schemas  eg Glue analyses, CSV, JSON, Parquet