Our customers use Amazon EMR (including Apache Hadoop and the full range of tools that make up the Apache Spark ecosystem) to handle many types of mission-critical big data use cases. For example:
Yelp processes over a terabyte of log files and photos every day.
Expedia processes streams of clickstream, user interaction, and supply data.
FINRA analyzes billions of brokerage transaction records daily.
DataXu evaluates 30 trillion ad opportunities monthly.
Because customers like these (see our big data use cases for many others) are processing data that is mission-critical and often sensitive, they need to keep it safe and sound.
We already offer several data encryption options for EMR including server and client side encryption for Amazon S3 with EMRFS and Transparent Data Encryption for HDFS. While these solutions do a good job of protecting data at rest, they do not address data stored in temporary files
Original URL: http://feedproxy.google.com/~r/AmazonWebServicesBlog/~3/tOgULd8drUY/