Implementation of a streaming architecture on AWS that captures real-time data from YouTube, processes it with Apache Flink, and stores it in an S3 Datalake.