Big Data upended the economics and architectural practices of enterprise data warehousing by not only making it cost effective to store and process more data and more varied forms of it, but also promoting new patterns that pushed analytics computing and data tiers together.
Now the cloud is prompting a shift of the pendulum back the other way. By decoupling data from compute, cloud Big Data services take advantage of object storage, which is far cheaper than HDFS file storage, and compute can be made elastic. While Amazon EMR allows customers the option to use HDFS, most EMR customers have embraced S3. Recently, Amazon customer FINRA collaborated to port AWS’s HBase service to S3.