1 /*! \page publis Reference publications about SimGrid
4 When citing SimGrid, the prefered reference paper is <i>SimGrid: a
5 Generic Framework for Large-Scale Distributed Experimentations</i>.
7 \li <b>SimGrid: a Generic Framework for Large-Scale Distributed
9 by <em>Henri Casanova, Arnaud Legrand and Martin Quinson</em>\n
10 Proceedings of the 10th IEEE International Conference on Computer
11 Modelling and Simulation (UKSIM/EUROSIM'08)\n
12 Distributed computing is a very broad and active research area
13 comprising fields such as cluster computing, computational
14 grids, desktop grids and peer-to-peer (P2P) systems.
15 Unfortunately, it is often impossible to obtain theoretical or
16 analytical results to compare the performance of algorithms
17 targeting such systems. One possibility is to conduct large
18 numbers of back-to-back experiments on real platforms. While
19 this is possible on tightly-coupled platforms, it is infeasible
20 on modern distributed platforms as experiments are labor-intensive
21 and results typically not reproducible. Consequently, one must
22 resort to simulations, which enable reproducible results and also
23 make it possible to explore wide ranges of platform and
24 application scenarios.\n
25 In this paper we describe the SimGrid framework, a
26 simulation-based framework for evaluating cluster, grid and P2P
27 algorithms and heuristics. This paper focuses on SimGrid v3, which
28 greatly improves on previous versions thanks to a novel and
29 validated modular simulation engine that achieves higher
30 simulation speed without hindering simulation accuracy. Also, two
31 new user interfaces were added to broaden the targeted research
32 community. After surveying existing tools and methodologies we
33 describe the key features and benefits of SimGrid.\n
34 http://www.loria.fr/~quinson/articles/SimGrid-uksim08.pdf
36 Previous publication do not cover the GRAS part of the framework. So, if you
37 want to cite GRAS, please use this publication instead:
40 \section publis_others Other publications
42 A lot of other papers where published about SimGrid. The list is
43 splited in 3 pages (also accessible from the navbar on top of this page):
45 This section contains papers describing some sub-parts of SimGrid,
46 or references superseeded by the one given above.
47 - \ref publis_extern\n
48 SimGrid is used by an ever growing scientific community. This
49 section lists all the papers resulting of works in which the core
50 SimGrid team were not involved.
52 This section lists the paper co-signed by at least one of the core
53 team member, and using SimGrid as a tool (and not studying SimGrid
56 \section publis_count Amount of published papers using SimGrid results
60 <tr><td>Year</td><td>2000</td><td>2001</td><td>2002</td><td>2003</td><td>2004</td><td>2005</td><td>2006</td><td>2007</td>
62 <tr><td>Amount of papers external to the core team</td>
63 <td>-</td> <td>-</td> <td>-</td> <td>3</td> <td>6</td> <td>3</td> <td>8</td> <td>5</td>
65 <tr><td>Amount of papers co-signed by one team member</td>
66 <td>1</td> <td>1</td> <td>1</td> <td>2</td> <td>2</td> <td>-</td> <td>4</td> <td>4</td>
70 \page publis_core Other publications about the SimGrid framework
72 \li <b>Speed and Accuracy of Network Simulation in the SimGrid Framework</b>\n
73 by <em>K. Fujiwara, H. Casanova</em>\n
74 in Proceedings of the First International Workshop on Network Simulation Tools (NSTools), Nantes, France, October 2007.\n
75 http://navet.ics.hawaii.edu/~casanova/homepage/papers/fujiwara_nstool2007.pdf
77 \li <b>Cost and Accuracy of Packet-Level vs. Analytical Network Simulations: An Empirical Study</b>\n
78 by <em>K. Fujiwara</em>\n
79 <b>M.S. Thesis</b>, Dept. of Information and Computer Sciences, University of Hawai`i at Manoa, April 2007.\n
80 http://navet.ics.hawaii.edu/~casanova/homepage/theses/kayo_fujiwara_MS.pdf
82 \li <b>Gras: A Research & Development Framework for Grid and P2P
84 by <em>Martin Quinson</em>\n
85 <b>Best paper</b> of the 18th IASTED International Conference on
86 Parallel and Distributed Computing and Systems (PDCS 2006)\n
87 http://www.loria.fr/~quinson/articles/gras-iasted06.pdf
89 \li <b>The SimGrid Project - Simulation and Deployment of Distributed Applications</b>\n
90 by <em>A. Legrand, M. Quinson, K. Fujiwara, H. Casanova</em>\n
91 <b>POSTER</b> in Proceedings of the IEEE International Symposium on High Performance Distributed Computing (HPDC-15), Paris, France, May 2006.\n
93 <a href="http://navet.ics.hawaii.edu/~casanova/homepage/papers/simgrid_hpdc06.pdf"><img src="poster_thumbnail.png" /></a>
95 http://navet.ics.hawaii.edu/~casanova/homepage/papers/simgrid_hpdc06.pdf
97 \li <b>Scheduling Distributed Applications: the SimGrid Simulation Framework</b>\n
98 by <em>Henri Casanova and Arnaud Legrand and Loris Marchal</em>\n
99 Proceedings of the third IEEE International Symposium
100 on Cluster Computing and the Grid (CCGrid'03)\n
101 http://www-id.imag.fr/Laboratoire/Membres/Legrand_Arnaud/articles/simgrid2_CCgrid03.pdf
103 \li <b>A Network Model for Simulation of Grid Application</b>\n
104 by <em>Henri Casanova and Loris Marchal</em>\n
106 In this work we investigate network models that can be
107 potentially employed in the simulation of scheduling algorithms for
108 distributed computing applications. We seek to develop a model of TCP
109 communication which is both high-level and realistic. Previous research
110 works show that accurate and global modeling of wide-area networks, such
111 as the Internet, faces a number of challenging issues. However, some
112 global models of fairness and bandwidth-sharing exist, and can be link
113 withthe behavior of TCP. Using both previous results and simulation (with
114 NS), we attempt to understand the macroscopic behavior of
115 TCP communications. We then propose a global model of the network for the
116 Grid platform. We perform partial validation of this model in
117 simulation. The model leads to an algorithm for computing
118 bandwidth-sharing. This algorithm can then be implemented as part of Grid
119 application simulations. We provide such an implementation for the
120 SimGrid simulation toolkit.\n
121 ftp://ftp.ens-lyon.fr/pub/LIP/Rapports/RR/RR2002/RR2002-40.ps.gz
124 \li <b>MetaSimGrid : Towards realistic scheduling simulation of
125 distributed applications</b>\n
126 by <em>Arnaud Legrand and Julien Lerouge</em>\n
127 Most scheduling problems are already hard on homogeneous
128 platforms, they become quite intractable in an heterogeneous
129 framework such as a metacomputing grid. In the best cases, a
130 guaranteed heuristic can be found, but most of the time, it is
131 not possible. Real experiments or simulations are often
132 involved to test or to compare heuristics. However, on a
133 distributed heterogeneous platform, such experiments are
134 technically difficult to drive, because of the genuine
135 instability of the platform. It is almost impossible to
136 guarantee that a platform which is not dedicated to the
137 experiment, will remain exactly the same between two tests,
138 thereby forbidding any meaningful comparison. Simulations are
139 then used to replace real experiments, so as to ensure the
140 reproducibility of measured data. A key issue is the
141 possibility to run the simulations against a realistic
142 environment. The main idea of trace-based simulation is to
143 record the platform parameters today, and to simulate the
144 algorithms tomorrow, against the recorded data: even though it
145 is not the current load of the platform, it is realistic,
146 because it represents a fair summary of what happened
147 previously. A good example of a trace-based simulation tool is
148 SimGrid, a toolkit providing a set of core abstractions and
149 functionalities that can be used to easily build simulators for
150 specific application domains and/or computing environment
151 topologies. Nevertheless, SimGrid lacks a number of convenient
152 features to craft simulations of a distributed application
153 where scheduling decisions are not taken by a single
154 process. Furthermore, modeling a complex platform by hand is
155 fastidious for a few hosts and is almost impossible for a real
156 grid. This report is a survey on simulation for scheduling
157 evaluation purposes and present MetaSimGrid, a simulator built
159 ftp://ftp.ens-lyon.fr/pub/LIP/Rapports/RR/RR2002/RR2002-28.ps.gz
161 \li <b>SimGrid: A Toolkit for the Simulation of Application
163 by <em>Henri Casanova</em>\n
164 Advances in hardware and software technologies have made it
165 possible to deploy parallel applications over increasingly large
166 sets of distributed resources. Consequently, the study of
167 scheduling algorithms for such applications has been an active area
168 of research. Given the nature of most scheduling problems one must
169 resort to simulation to effectively evaluate and compare their
170 efficacy over a wide range of scenarios. It has thus become
171 necessary to simulate those algorithms for increasingly complex
172 distributed, dynamic, heterogeneous environments. In this paper we
173 present SimGrid, a simulation toolkit for the study of scheduling
174 algorithms for distributed application. This paper gives the main
175 concepts and models behind SimGrid, describes its API and
176 highlights current implementation issues. We also give some
177 experimental results and describe work that builds on SimGrid's
179 http://grail.sdsc.edu/papers/simgrid_ccgrid01.ps.gz
181 \page publis_extern Papers that use SimGrid-generated results (not counting our owns)
183 This list is a selection of articles. We list only papers written by people
184 external to the development group, but we also use our tool ourselves (see
188 - <b>Reactive Grid Scheduling of DAG Applications.</b>\n by
189 <em>I. Hernandez and M. Cole (UK)</em>. In Parallel and Distributed Computing and Networks, 2007.\n
190 http://www.actapress.com/PaperInfo.aspx?PaperID=29625
191 - <b>Dynamic Scheduling of Multi-Processor Tasks on Clusters of Clusters</b>\n
192 by <em>S. Hunold, T. Rauber and G. Rünger</em>.\n
193 Proceedings of the Sixth International Workshop on Algorithms,
194 Models and Tools for Parallel Computing on Heterogeneous Networks
195 (Heteropar'07), Austin, TX, September 2007.
196 - <b>Scheduling Δ-Critical Tasks in Mixed-Parallel Applications on a National Grid</b>\n
197 by <em>Frédéric Suter</em>.\n
198 In 8th IEEE/ACM International Conference on Grid Computing (Grid 2007), Austin, TX, September 2007.
199 - <b>Brokering strategies in computational grids using stochastic
200 prediction models.</b>\n by <em>Vandy Berten and Bruno
201 Gaujal</em>. In Parallel Computing, vol. 33(4-5): 238-249, 2007.\n
202 http://dev.ulb.ac.be/sched/articles/PARCO.pdf
203 - <b>Managing Scheduling and Replication in the LHC Grid.</b>\n by
204 <em>Thomas Ferrandiz and Vania Marangozova</em>. In CoreGrid
205 Workshop on middleware, 2007.\n
208 - <b>Simbatch: an API for simulating and predicting the performance of parallel resources and batch systems.</b>\n
209 by <em>Jean-Sébastien Gay and Yves Caniou</em>\n
210 INRIA Research Report 6040, November 2006.\n
211 https://hal.inria.fr/inria-00115880
212 - <b>Simbatch : une API pour la simulation et la prédiction de performances de systèmes batch</b>\n
213 by <em>Jean-Sébastien Gay and Yves Caniou</em>.\n
214 In 17ème Rencontres Francophones du Parallélisme, des Architectures et des Systèmes, RenPar'17.\n
215 October 4-6, Perpignan, France
216 - <b>Metascheduling Multiple Resource Types using the MMKP</b>\n
217 by <em>D. Vanderster, N. Dimopoulos, R. Sobie</em>\n
218 7th IEEE/ACM International Conference on Grid Computing\n
219 Barcelona, September 28th-29th 2006
220 - <b>Master-Slave Tasking on Asymmetric Networks</b>\n
221 by <em>Cyril Banino-Rokkones, Olivier Beaumont and Lasse Natvig</em>.\n
222 In Proceedings of 12th International Euro-Par Conference, Euro-Par 2006.\n
223 August 29 - September 1, Pages 167--176, Dresden, Germany.
224 - <b>Critical Path and Area Based Scheduling of Parallel Task Graphs on Heterogeneous Platforms</b>\n
225 by <em>Tchimou N'Takpé and Frédéric Suter</em>\n
226 Proceedings of the Twelfth International Conference on Parallel and Distributed Systems (ICPADS)\n
227 Minneapolis, MN, July 12-15, 2006.
228 - <b>Sensitivity Analysis of Knapsack-based Task Scheduling on the Grid</b>\n
229 by <em>D.C. Vanderster and N.J. Dimopoulos</em>.\n
230 In Proceedings of The 20th ACM International Conference on Supercomputing\n
231 Cairns, Australia, June 28-July 1, 2006.\n
232 http://portal.acm.org/citation.cfm?id=1183401.1183446&coll=GUIDE&dl=%23url.coll
233 - <b>Hierarchical Scheduling of Independent Tasks with Shared Files</b>\n
234 by <em>H. Senger, F. Silva, W. Nascimento</em>.\n
235 Proceedings of the Sixth IEEE International Symposium on Cluster
236 Computing and the Grid Workshop (CCGRIDW'06)\n
237 Singapore, 16-19 May 2006.\n
238 http://www.unisantos.br/mestrado/informatica/hermes/File/senger-HierarchicalScheduling-Workshop-TB120.pdf
239 - <b>Evaluation of Knapsack-based Scheduling using the NPACI JOBLOG</b>\n
240 by <em>D. Vanderster, N. Dimopoulos, R. Parra-Hernandez and R. Sobie</em>.\n
241 20th International Symposium on High-Performance Computing in an
242 Advanced Collaborative Environment (HPCS'06)\n
243 St. John's, Newfoundland, Canada, 14-17 May 2006\n
244 http://doi.ieeecomputersociety.org/10.1109/HPCS.2006.23
247 - <b>On Dynamic Resource Management Mechanism using Control
248 Theoretic Approach for Wide-Area Grid Computing</b>\n
249 by <em>Hiroyuki Ohsaki, Soushi Watanabe, and Makoto Imase</em>\n
250 in Proceedings of IEEE Conference on Control Applications (CCA 2005), Aug. 2005.\n
251 http://www.ispl.jp/~oosaki/papers/Ohsaki05_CCA.pdf
252 - <b>Evaluation of Meta-scheduler Architectures and Task Assignment Policies for
253 high Throughput Computing</b>\n
254 by <em>Eddy Caron, Vincent Garonne and Andrei Tsaregorodtsev</em>\n
255 Proceedings of 4th Internationnal Symposium on Parallel and
256 Distributed Computing Job Scheduling Strategies for Parallel
257 Processing (ISPDC'05), July 2005.\n
258 http://www.ens-lyon.fr/LIP/Pub/Rapports/RR/RR2005/RR2005-27.pdf
259 - <b>Algorithmes de redistribution de données pour anneaux de processeurs hétérogènes</b>\n
260 by <em>Héléne Renard, Yves Robert and Frédéric Vivien</em>\n
261 In 16ième Rencontres Francophones du Parallélisme des Architectures et des Systèmes, Le Croisic, France, 6-8 avril 2005.\n
262 http://www.polytech.unice.fr/~hrenard/recherche/Renpar16.ps
264 - <b>Deadline Scheduling with Priority for Client-Server Systems on the Grid</b>\n
265 by <em>Eddy Caron, PK Chouhan, Frédéric Desprez</em>\n
266 in IEEE International Conference On Grid Computing. Super Computing 2004, oct 2004.
267 - <b>Efficient Scheduling Heuristics for GridRPC Systems</b>\n
268 by <em>Yves Caniou and Emmanuel Jeannot.</em>\n
269 in IEEE QoS and Dynamic System workshop (QDS) of International Conference
270 on Parallel and Distributed Systems (ICPADS), New-Port Beach California, USA,
271 pages 621-630, July 2004\n
272 http://graal.ens-lyon.fr/~ycaniou/QDS04.ps
273 - <b>Exploiting Replication and Data Reuse to Efficiently Schedule
274 Data-intensive Applications on Grids</b>\n
275 by <em> E. Santos-Neto, W. Cirne, F. Brasileiro, A. Lima.</em>\n
276 Proceedings of 10th Job Scheduling Strategies for Parallel Processing, June 2004.\n
277 http://www.lsd.ufcg.edu.br/~elizeu/articles/jsspp.v6.pdf
278 - <b>Resource Management and Knapsack Formulations on the Grid</b>\n
279 by <em>R. Parra-Hernandez, D. Vanderster and N. J. Dimopoulos</em>\n
280 Fifth IEEE/ACM International Workshop on Grid Computing (GRID'04)\n
281 http://doi.ieeecomputersociety.org/10.1109/GRID.2004.54
282 - <b>Scheduling BoT Applications in Grids using a Slave Oriented Adaptive
284 by <em>T. Ferreto, C. A. F. De Rose and C. Northfleet.</em>\n
285 Second International Symposium on Parallel and Distributed Processing
286 and Applications (ISPA), 2004, Hong Kong. Published in Lecture Notes in
287 Computer Science (LNCS), Volume 3358, by Springer-Verlag. p. 392-398.
288 - <b>Data redistribution algorithms for heterogeneous processor rings</b>\n
289 by <em>Héléne Renard, Yves Robert and Frédéric Vivien</em>\n
290 In International Conference on High Performance Computing HiPC'2004\n
291 http://www.polytech.unice.fr/~hrenard/recherche/Hipc.pdf
293 - <b>Link-Contention-Aware Genetic Scheduling Using Task Duplication in Grid Environments</b>\n
294 by <em>Wensheng Yao, Xiao Xie and Jinyuan You</em>\n
295 in Grid and Cooperative Computing: Second International Workshop, GCC 2003, Shanghai, China, December 7-10, 2003 (LNCS)\n
296 http://www.chinagrid.edu.cn/chinagrid/download/GCC2003/pdf/266.pdf
297 - <b>New Dynamic Heuristics in the Client-Agent-Server Model</b>\n
298 by <em>Yves Caniou and Emmanuel Jeannot</em>\n
299 in IEEE 13th Heteregeneous Computing Workshop - HCW'03, Nice, France, April 2003.\n
300 http://graal.ens-lyon.fr/~ycaniou/HCW03.ps
301 - <b>A Hierarchical Resource Reservation Algorithm for Network Enabled Servers</b>\n
302 by <em>Eddy Caron, Frédéric Desprez, Franck Petit, V. Villain</em>\n
303 in the 17th International Parallel and Distributed Processing Symposium -- IPDPS'03, Nice - France, April 2003.
305 \page publis_intra Our own papers that use SimGrid-generated results
307 This list is a selection of the articles we have written that used results
308 generated by SimGrid.
311 - <b>Assessing the Quality of Automatically Built Network Representations</b>\n
312 by <em>Lionel Eyraud-Dubois and Martin Quinson</em>\n
313 In Seventh IEEE International Symposium on Cluster Computing and
314 the Grid (CCGrid 2007), 14-17 May 2007, Rio de Janeiro, Brazil.
315 - <b>A Comparison of Scheduling Approaches for Mixed-Parallel Applications on Heterogeneous Platforms</b>\n
316 by <em>Tchimou N'takpé, Frédéric Suter, and Henri Casanova</em>\n
317 In 6th International Symposium on Parallel and Distributed Computing, Hagenberg, Austria, July 2007.
318 - <b>A First Step Towards Automatically Building Network Representations</b>\n
319 by <em>Lionel Eyraud-Dubois, Arnaud Legrand, Martin Quinson and Frédéric Vivien</em>\n
320 In 12th International Euro-Par Conference 28-31 August, Rennes, France.
321 - <b>Centralized Versus Distributed Schedulers Multiple Bag-of-Tasks Applications</b>\n
322 by <em>Olivier Beaumont, Larry Carter, Jeanne Ferrante, Arnaud Legrand, Loris Marchal, and Yves Robert</em>\n
323 In IEEE Trans. Parallel Distributed Systems, 2007.
325 - <b>On the Harmfulness of Redundant Batch Requests</b>\n
326 by <em>H. Casanova</em>\n
327 Proceedings of the IEEE International Symposium on High Performance Distributed Computing (HPDC-15), Paris, France, May 2006.\n
328 http://navet.ics.hawaii.edu/~casanova/homepage/papers/hpdc_2006.pdf
329 - <b>An evaluation of Job Scheduling Strategies for Divisible Loads on Grid Platforms</b>\n
330 by <em>Y. Cardinale, H. Casanova</em>\n
331 in Proceedings of the High Performance Computing & Simulation Conference (HPC&S'06), Bonn, Germany, May 2006.\n
332 http://navet.ics.hawaii.edu/~casanova/homepage/papers/cardinale_2006.pdf
333 - <b>Centralized Versus Distributed Schedulers Multiple Bag-of-Tasks Applications</b>\n
334 by <em>Olivier Beaumont, Larry Carter, Jeanne Ferrante, Arnaud Legrand, Loris Marchal, and Yves Robert</em>\n
335 International Parallel and Distributed Processing Symposium IPDPS'2006, 2006
336 - <b>Interference-Aware Scheduling</b>\n
337 by <em>B. Kreaseck, L. Carter, H. Casanova, J. Ferrante, S. Nandy</em>\n
338 International Journal of High Performance Computing Applications (IJHPCA).\n
339 http://navet.ics.hawaii.edu/~casanova/homepage/papers/kreaseck_ijhpca_2005.pdf
341 - <b>From Heterogeneous Task Scheduling to Heterogeneous Mixed Data and Task Parallel Scheduling</b>\n
342 by <em>F. Suter, V. Boudet, F. Desprez, H. Casanova</em>\n
343 Proceedings of Europar, 230--237, (LCNS volume 3149), Pisa, Italy, August 2004.
344 - <b>On the Interference of Communication on Computation</b>\n
345 by <em>B. Kreaseck, L. Carter, H. Casanova, J. Ferrante</em>\n
346 Proceedings of the workshop on Performance Modeling, Evaluation, and Optimization of Parallel and Distributed Systems, Santa Fe, April 2004.\n
347 http://navet.ics.hawaii.edu/~casanova/homepage/papers/k_pmeo2004.pdf
350 - <b>RUMR: Robust Scheduling for Divisible Workloads</b>\n
351 by <em>Y. Yang, H. Casanova</em>\n
352 Proceedings of the 12th IEEE Symposium on High Performance and Distributed Computing (HPDC-12), Seattle, June 2003.\n
353 http://navet.ics.hawaii.edu/~casanova/homepage/papers/yang_hpdc2003.pdf
354 - <b>Resource Allocation Strategies for Guided Parameter Space Searches</b>\n
355 by <em>M. Faerman, A. Birnbaum, F. Berman, H. Casanova</em>\n
356 International Journal of High Performance Computing Applications (IJHPCA), 17(4), 383--402, 2003.\n
357 http://grail.sdsc.edu/papers/faerman_ijhpca04.pdf
359 - <b>Resource Allocation for Steerable Parallel Parameter Searches</b>\n
360 by <em>M. Faerman, A. Birnbaum, H. Casanova, F. Berman</em>\n
361 Proceedings of the Grid Computing Workshop, Baltimore, 157--169, November 2002.\n
362 http://grail.sdsc.edu/projects/vi_itr/grid02.pdf
364 - <b>Applying Scheduling and Tuning to On-line Parallel Tomography </b>\n
365 by <em>Shava Smallen, Henri Casanova, Francine Berman</em>\n
366 in Proceedings of Supercomputing 2001\n
367 http://grail.sdsc.edu/papers/tomo_journal.ps.gz
369 - <b>Heuristics for Scheduling Parameter Sweep applications in Grid environments</b>\n
370 by <em>Henri Casanova, Arnaud Legrand, Dmitrii Zagorodnov and Francine Berman</em>\n
371 in Proceedings of the 9th Heterogeneous Computing workshop (HCW'2000), pp349-363.\n
372 http://navet.ics.hawaii.edu/~casanova/homepage/papers/hcw00_pst.pdf
377 \li <b>Optimal algorithms for scheduling divisible workloads on
378 heterogeneous systems</b>\n
379 by <em>Olivier Beaumont and Arnaud Legrand and Yves Robert</em>\n
380 in Proceedings of the 17th International Parallel and Distributed Processing Symposium (IPDPS'03).\n
381 Preliminary version on ftp://ftp.ens-lyon.fr/pub/LIP/Rapports/RR/RR2002/RR2002-36.ps.gz
384 \li <b>On-line Parallel Tomography</b>\n
385 by <em>Shava Smallen</em>\n
386 Masters Thesis, UCSD, May 2001