CLOUD-BASED FLOWBSTER PORTAL TO DESIGN AND DEPLOY SCIENTIFIC WORKFLOWS

Volume 1 (2), December 2018, Pages 140-157

József Kovács, Zoltán Farkas, Enikő Nagy and Bendegúz Gúlyás


Institute for Computer Science and Control, Hungarian Academy of Sciences (MTA SZTAKI), Budapest, Hungary This email address is being protected from spambots. You need JavaScript enabled to view it., This email address is being protected from spambots. You need JavaScript enabled to view it., This email address is being protected from spambots. You need JavaScript enabled to view it., This email address is being protected from spambots. You need JavaScript enabled to view it.


Abstract

A workflow system called Flowbster has been designed to create efficient data pipelines in clouds. The entire Flowbster workflow is dynamically built by using virtual machines on a target cloud. The paper describes a recently designed and developed web-based science gateway to support Flowbster. It provides a high-level graphical environment to handle different levels of abstractions, like workflows representing the layout and deployment representing the infrastructure realizing the workflow. Detailed overview of the user interface, the portal architecture and its internal operation are given in the paper. Moreover, an insight is provided on the selection and cooperation of the web modules and on the integration of the portal in the firebase environment developed by Google.

Keywords:

workflow, science gateways, orchestration, firebase.

DOI: https://doi.org/10.32010/26166127.2018.1.2.140.157

 

Reference 

[1] Buyya, R., Broberg, J., & Gościński, A. (2011). Cloud computing: Principles and paradigms. Hoboken, NJ: Wiley.

[2] Liu, J., Pacitti, E., Valduriez, P., & Mattoso, M. (2015). A Survey of Data-Intensive Scientific Workflow Management. Journal of Grid Computing, 13(4), 457-493. doi:10.1007/s10723-015-9329-8

[3] Kacsuk, P., Kovács, J., & Farkas, Z. (2018). The Flowbster Cloud-Oriented Workflow System to Process Large Scientific Data Sets. Journal of Grid Computing, 16(1), 55-83. doi:10.1007/s10723-017-9420-4

[4] Kovács, J., & Kacsuk, P. (2017). Occopus: A Multi-Cloud Orchestrator to Deploy and Manage Complex Scientific Infrastructures. Journal of Grid Computing, 16(1), 19-37. doi:10.1007/s10723-017-9421-3

[5] Kovács, J. (n.d.). The Cloudifacturing EU project homepage. Retrieved from https://www.cloudifacturing.eu/159

[6] The Angular Framework homepage. (n.d.). Retrieved from http://www.angular.io/

[7] React – A JavaScript library for building user interfaces. (n.d.). Retrieved from https://reactjs.org/

[8] Vue.js. (n.d.). Retrieved from https://vuejs.org/

[9] Get TypeScript. (n.d.). Retrieved from https://www.typescriptlang.org/

[10] Baranovskiy, D. (n.d.).  Raphaël-JavaScript Library. Retrieved from http://dmitrybaranovskiy.github.io/raphael/

[11] Jgraph. (2018, December 14). Jgraph/mxgraph. Retrieved from https://github.com/jgraph/mxgraph [12] JointJS: Visualize and interact with diagrams and graphs. (n.d.). Retrieved from https://www.jointjs.com/opensource

[13] Foundation, N. (n.d.). Retrieved from https://nodejs.org/en/

[14] Anderson, J. C., Lehnardt, J., & Slater, N. (2010). CouchDB: The Definitive Guide: Time to Relax. « O’Reilly Media, Inc.».

[15] Kristina, C., & Michael, D. (2010). MongoDB: the definitive guide. O’Reilly Media, 1ed.–2010.

[16] (n.d.). Retrieved from https://firebase.google.com/

[17] Farkas, Z., Kacsuk, P., & Hajnal, Á. (2016). Enabling workflow-oriented science gateways to access multi-cloud systems. Journal of Grid Computing, 14(4), 619-640.

[18] CloudBroker Platform. (n.d.). Retrieved from http://cloudbroker.com/platform/

[19] Marru, S., Gardler, R., Slominski, A., Douma, A., Perera, S., Weerawarana, S., . . . Chinthaka, E. (2011). Apache airavata. Proceedings of the 2011 ACM Workshop on Gateway Computing Environments - GCE 11. doi:10.1145/2110486.2110490

[20] Perera, S., Marru, S., & Herath, C. (2008, June). Workflow infrastructure for multi-scale science gateways. In TeraGrid Conference.

[21] Deelman, E., Singh, G., Su, M. H., Blythe, J., Gil, Y., Kesselman, C., ... & Laity, A. (2005). Pegasus: A framework for mapping complex scientific workflows onto distributed systemsScientific Programming, 13(3), 219-237.

[22] McLennan, M., Clark, S., Deelman, E., Rynge, M., Vahi, K., McKenna, F., ... & Song, C. (2013). Bringing scientific workflow to the masses via pegasus and hubzero. parameters, 13, 14.

[23] McLennan, M., & Kennell, R. (2010). HUBzero: a platform for dissemination and collaboration in computational science and engineering. Computing in Science & Engineering, 12(2), 48-53.

[24] Balis, B., Figiela, K., Malawski, M., Pawlik, M., & Bubak, M. (2015, September). A lightweight approach for deployment of scientific workflows in cloud infrastructures. In International Conference on Parallel Processing and Applied Mathematics (pp. 281-290). Springer, Cham.

[25] Qasha, R., Cała, J., & Watson, P. (2016, October). A framework for scientific workflow reproducibility in the cloud. In e-Science (e-Science), 2016 IEEE 12th International Conference on (pp. 81-90). IEEE.

[26] Qasha, R., Cala, J., & Watson, P. (2016, December). Dynamic deployment of scientific workflows in the cloud using container virtualization. In 2016 IEEE International Conference on Cloud Computing Technology and Science (CloudCom) (pp. 269-276).