Bandwidth Modeling in Large Distributed Systems for Big Data Applications


Speaker: Bahman Javadi

Affiliation: University of Wester Sydney

Time: Monday 23/03/2015 from 14:00 to 15:00

Venue: Access Grid UWS. Presented from Parramatta (EB.1.32), accessible from Campbelltown (26.1.50) and Penrith (Y239).

Abstract: The emergence of Big Data applications provides new challenges in data management such as processing and movement of masses of data. Volunteer computing has proven itself as a distributed paradigm that can fully support Big Data generation. This paradigm uses a large number of heterogeneous and unreliable Internet-connected hosts to provide Peta-scale computing power for scientific projects. With the increase in data size and number of devices that can potentially join a volunteer computing project, the host bandwidth can become
a main hindrance to the analysis of the data generated by these projects, especially if the analysis is a concurrent approach based on either in-situ or in-transit processing. In this paper, we propose a bandwidth model for volunteer computing projects based on the real trace data taken from the Docking@home
project with more than 280,000 hosts over a 5-year period. We validate the proposed statistical model using model-based and simulation-based techniques. Our modeling provides us with valuable insights on the concurrent integration of data generation with in-situ and in-transit analysis in the volunteer computing
paradigm.

Biography: Dr Bahman Javadi is a Lecturer in Networking and Cloud Computing at the University of Western Sydney. Before that he appointed as a Research Fellow at the University of Melbourne, Australia. He was a Postdoctoral Fellow at the INRIA Rhone-Alpes, France in 2008-2010. He received his MS and PhD degrees in Computer Engineering from the Amirkabir University of Technology in 2001 and 2007, respectively. He had been a Research Scholar at the School of Engineering and Information Technology, Deakin University, Australia during his PhD course. Dr Javadi is the co-founder of the Failure Trace Archive, which serves as a public repository of failure traces and algorithms for distributed systems. He served as a program committee of many international conferences and workshops and co-guest editor of a special issue of the Journal of Future Generation Computer Systems on Desktop Grids.