Performance tuning of MapReduce jobs using surrogate-based Modeling
Taufer, Travis Johnston, Mohammad Alsulmi, Pietro Cicotti, and Michela . 2015
Modeling workflow performance is crucial for finding optimal configuration parameters and optimizing execution times. We apply the method of surrogate-based modeling to performance tuning of MapReduce jobs. We build a surrogate model defined by a multivariate polynomial containing a variable for each parameter to be tuned. For illustrative purposes, we focus on just two parameters: the number of parallel mappers and the number of parallel reducers. We demonstrate that an accurate performance model can be built sampling a small set of the parameter space. We compare the accuracy and cost of building the model when using different sampling methods as well as when using different modeling approaches. We conclude that the surrogate-based approach we describe is both less expensive in terms of sampling time and more accurate than other well-known tuning methods.
The presence of sand material in railway tracks, known as sand fouling, can yield unfavorable consequences, resulting in track ballast contamination and causing instability and rapid degradation…
The task of investing in financial markets to make profits and grow one’s wealth is not a straightforward task. Typically, financial domain experts, such as investment advisers and financial…
Search systems have played an essential role in improving user experience and information accessibility on the…