Pason Systems: Petabyte Scale Drilling Datamart on AWS
Learn how Pason built a petabyte scalable customer-facing drilling datamart using AWS services. Ed Quan from Pason describes how Amazon EMR, along with Presto, can be used to provide a secure and seamless SQL interface to data that is stored in Amazon RDS and Hive. Hive is used to periodically ETL billions of sensor data points from microservices into S3 and EMRFS. You will also learn how multiple EMR clusters were used to provide high availability and distribute workloads across this solution.