Memory-based scheduling of scientific computing clusters

Niemi, Tapio; Hameri, Ari-Pekka
September 2012
Journal of Supercomputing;Sep2012, Vol. 61 Issue 3, p520
Academic Journal
This study looks at how increased memory utilisation affects throughput and energy consumption in scientific computing, especially in high-energy physics. Our aim is to minimise energy consumed by a set of jobs without increasing the processing time. The earlier tests indicated that, especially in data analysis, throughput can increase over 100% and energy consumption decrease 50% by processing multiple jobs in parallel per CPU core. Since jobs are heterogeneous, it is not possible to find an optimum value for the number of parallel jobs. A better solution is based on memory utilisation, but finding an optimum memory threshold is not straightforward. Therefore, a fuzzy logic-based algorithm was developed that can dynamically adapt the memory threshold based on the overall load. In this way, it is possible to keep memory consumption stable with different workloads while achieving significantly higher throughput and energy-efficiency than using a traditional fixed number of jobs or fixed memory threshold approaches.


Related Articles

  • Power-aware register assignment for large register file design. Shieh, Wann-Yun; Wang, Bo-Syun // Journal of Supercomputing;Sep2012, Vol. 61 Issue 3, p719 

    The design trend of high-speed microprocessors is toward wider and wider issue architecture to increase instruction-level parallelism. Such architecture needs a large register file to reduce register pressure. A large register file, however, consumes much more power during program execution. In...

  • A queuing theory model for cloud computing. Vilaplana, Jordi; Solsona, Francesc; Teixidó, Ivan; Mateo, Jordi; Abella, Francesc; Rius, Josep // Journal of Supercomputing;Jul2014, Vol. 69 Issue 1, p492 

    The ability to deliver guaranteed QoS (Quality of Service) is crucial for the commercial success of cloud platforms. This paper presents a model based on queuing theory to study computer service QoS in cloud computing. Cloud platforms are modeled with an open Jackson network that can be used to...

  • Accelerating Pairwise DNA Sequence Alignment using the CUDA Compatible GPU. Khaled, H.; El Gohary, R.; Badr, N. L.; Faheem, H. M. // International Journal of Computer Applications;Dec2013, Vol. 84, p25 

    We present a novel implementation of the pairwise DNA sequence alignment problem other than the Dynamic programming solution presented by Smith Waterman Algorithm. The proposed implementation uses CUDA; the parallel computing platform and programming model invented by NVIDIA. The main idea of...

  • Scalable distributed data allocation in LuNA fragmented programming system. Malyshkin, Victor; Perepelkin, Vladislav; Schukin, Georgy // Journal of Supercomputing;Feb2017, Vol. 73 Issue 2, p726 

    The paper presents a scalable distributed algorithm for static and dynamic data allocation in LuNA fragmented programming system. LuNA is intended for automation of construction of parallel programs, which implement large-scale numerical models for multicomputers with large number of computing...

  • A secure and efficient identity-based authenticated key exchange protocol for mobile client-server networks. Farash, Mohammad; Attari, Mahmoud // Journal of Supercomputing;Jul2014, Vol. 69 Issue 1, p395 

    Recently, Chou et al. (J Supercomput 66(2): 973-988, ) proposed two identity-based key exchange protocols using elliptic curves for mobile environments. The first one is an two-party authentication key exchange protocol to establish a session key between a client and a remote server. The second...

  • Performance-energy adaptation of parallel programs in pervasive computing. Zhu, Liang; Jin, Hai; Liao, Xiaofei; Yue, Jianhui // Journal of Supercomputing;Dec2014, Vol. 70 Issue 3, p1260 

    It is meaningful to use a little energy to obtain more performance improvement compared with the increased energy. It also makes sense to relax a small quantity of performance restriction to save an enormous amount of energy. Trading a small amount of energy for a considerable sum of performance...

  • ACOTES Project: Advanced Compiler Technologies for Embedded Streaming. Munk, Harm; Ayguad�, Eduard; Bastoul, C�dric; Carpenter, Paul; Chamski, Zbigniew; Cohen, Albert; Cornero, Marco; Dumont, Philippe; Duranton, Marc; Fellahi, Mohammed; Ferrer, Roger; Ladelsky, Razya; Lindwer, Menno; Martorell, Xavier; Miranda, Cupertino; Nuzman, Dorit; Ornstein, Andrea; Pop, Antoniu; Pop, Sebastian; Pouchet, Louis-No�l // International Journal of Parallel Programming;Jun2011, Vol. 39 Issue 3, p397 

    Streaming applications are built of data-driven, computational components, consuming and producing unbounded data streams. Streaming oriented systems have become dominant in a wide range of domains, including embedded applications and DSPs. However, programming efficiently for streaming...

  • A moving threads processor architecture MTPA. Forsell, M.; Leppänen, V. // Journal of Supercomputing;Jul2011, Vol. 57 Issue 1, p5 

    Moving threads is a new kind of approach for multicore processor architectures. Traditionally, each thread stays in the core where it is created, and data is moved from the main memory via caches to each core and thread. In the moving threads approach, each core can access only a certain portion...

  • Methods for Optimizing OpenCL Applications on Heterogeneous Multicore Architectures. Slo-Li Chu; Chih-Chieh Hsiao // Applied Mathematics & Information Sciences;2013, Vol. 7 Issue 6, p2549 

    Heterogeneous multicore architectures with CPU and add-on GPUs or streaming processors are now widely used in computer systems. These GPUs provide substantially more computation capability and memory bandwidth compared to traditional multi-cores. Also, because they are highly programmable, they...


Read the Article


Sorry, but this item is not currently available from your library.

Try another library?
Sign out of this library

Other Topics