Talia, D. (2019). A view of programmable scalable data analysis: From clouds to exascale. Journal of Cloud Computing, 8(4), 1-16. This journal article discusses the importance of scalability for big data analysis solutions and how this can be achieved by parallel implementations that can exploit the computing and storage facilities of high-performance computing (HPC) systems and clouds.
Heisel, M., Mistrik, I., Bahsoon, R., Maxim, B., & Ali, N. (2017). Software architecture for big data and the cloud. Morgan Kaufmann. Read Chapter 14 – Exploring the Evolution of Big Data Technologies.
This chapter explores different facets of data processing like MapReduce, Machine Learning, and Streaming Data.