Akidau, T., et al.: The dataflow model: a practical approach to balancing correctness, latency, and cost in massive-scale, unbounded, out-of-order data processing. Proc. VLDB Endow. 8(12), 1792–1803 (2015)
CrossRef
Google Scholar
Bao, Y., Peng, Y., Wu, C., Li, Z.: Online job scheduling in distributed machine learning clusters. In: INFOCOM 2018. IEEE, April 2018
Google Scholar
Carbone, P., Katsifodimos, A., Ewen, S., Markl, V., Haridi, S., Tzoumas, K.: Apache Flink\(^\text{ TM }\): stream and batch processing in a single engine. IEEE Data Eng. Bull. 38(4), 28–38 (2015)
Google Scholar
Dean, J., Ghemawat, S.: MapReduce: Simplified Data Processing on Large Clusters. In: OSDI 2004. USENIX Association, January 2004
Google Scholar
Delimitrou, C., Kozyrakis, C.: Paragon: QoS-aware scheduling for heterogeneous datacenters. In: ASPLOS 2013. ACM, March 2013
Google Scholar
Delimitrou, C., Kozyrakis, C.: Quasar: resource-efficient and QoS-aware cluster management. In: ASPLOS 2014. ACM, March 2014
Google Scholar
Ghodsi, A., Zaharia, M., Hindman, B., Konwinski, A., Shenker, S., Stoica, I.: Dominant resource fairness: fair allocation of multiple resource types. In: NSDI 2011. USENIX Association, March 2011
Google Scholar
Ghoting, A., et al.: SystemML: declarative machine learning on mapreduce. In: ICDE 2011. IEEE, April 2011
Google Scholar
Gonzalez, J.E., Xin, R.S., Dave, A., Crankshaw, D., Franklin, M.J., Stoica, I.: GraphX: graph processing in a distributed dataflow framework. In: OSDI 2014. USENIX Association, September 2014
Google Scholar
Hindman, B., et al.: Mesos: a platform for fine-grained resource sharing in the data center. In: NSDI 2011. USENIX Association, March 2011
Google Scholar
Jyothi, S.A., et al.: Morpheus: towards automated SLOs for enterprise clusters. In: OSDI 2016. USENIX Association, November 2016
Google Scholar
Ludwig, U.L., Xavier, M.G., Kirchoff, D.F., Cezar, I.B., De Rose, C.A.F.: optimizing multi-tier application performance with interference and affinity-aware placement algorithms. Concurr. Comput. Pract. Exp. 31(18), e5098 (2018)
Google Scholar
Mao, H., Alizadeh, M., Menache, I., Kandula, S.: Resource management with deep reinforcement learning. In: HotNets 2016. ACM, November 2016
Google Scholar
Mao, H., Schwarzkopf, M., Venkatakrishnan, S.B., Meng, Z., Alizadeh, M.: Learning Scheduling Algorithms for Data Processing Clusters. arXiv preprint arXiv:1810.01963, October 2018
Meng, X., et al.: MLlib: machine learning in apache spark. J. Mach. Learn. Res. 17(1), 1235–1241 (2016)
MathSciNet
MATH
Google Scholar
Niu, Z., Tang, S., He, B.: Gemini: an adaptive performance-fairness scheduler for data-intensive cluster computing. In: CloudCom2015. IEEE, November 2015
Google Scholar
Olston, C., Reed, B., Srivastava, U., Kumar, R., Tomkins, A.: Pig Latin: a not-so-foreign language for data processing. In: SIGMOD 2008. ACM, June 2008
Google Scholar
Ousterhout, K., Rasti, R., Ratnasamy, S., Shenker, S., Chun, B.G.: Making sense of performance in data analytics frameworks. In: NSDI 2015. USENIX Association, March 2015
Google Scholar
Rasley, J., Karanasos, K., Kandula, S., Fonseca, R., Vojnovic, M., Rao, S.: Efficient queue management for cluster scheduling. In: EuroSys 2016. ACM, April 2016
Google Scholar
Reiss, C., Tumanov, A., Ganger, G.R., Katz, R.H., Kozuch, M.A.: Heterogeneity and dynamicity of clouds at scale: google trace analysis. In: SoCC 2012. ACM, October 2012
Google Scholar
Renner, T., Thamsen, L., Kao, O.: Network-aware resource management for scalable data analytics frameworks. In: Big Data 2015. IEEE, October 2015
Google Scholar
Thamsen, L., Rabier, B., Schmidt, F., Renner, T., Kao, O.: Scheduling recurring distributed dataflow jobs based on resource utilization and interference. In: BigData Congress. IEEE, June 2017
Google Scholar
Thamsen, L., Verbitskiy, I., Rabier, B., Kao, O.: Learning efficient co-locations for scheduling distributed dataflows in shared clusters. Serv. Trans. Big Data 4(1), 1–15 (2019)
Google Scholar
Vavilapalli, V.K., et al.: Apache hadoop YARN: yet another resource negotiator. In: SOCC 2013. ACM, September 2013
Google Scholar
Xavier, M.G., De Rose, C.A.: Data Processing with Cross-application Interference Control via System-level Instrumentation. Ph.D. Thesis at PUCRS, Brazil (2018)
Google Scholar
Zaharia, M., Borthakur, D., Sen Sarma, J., Elmeleegy, K., Shenker, S., Stoica, I.: Delay scheduling: a simple technique for achieving locality and fairness in cluster scheduling. In: EuroSys 2010. ACM, April 2010
Google Scholar
Zaharia, M., Chowdhury, M., Franklin, M.J., Shenker, S., Stoica, I.: Spark: cluster computing with working sets. In: HotCloud 2010. USENIX Association, June 2010
Google Scholar