3 p Testing a simple master/worker example application handling failures TCP crosstraffic DISABLED
6 $ ${bindir:=.}/s4u-platform-failures --log=xbt_cfg.thres:critical --log=no_loc ${platfdir}/small_platform_failures.xml ${srcdir:=.}/s4u-platform-failures_d.xml --cfg=path:${srcdir} --cfg=network/crosstraffic:0 "--log=root.fmt:[%10.6r]%e(%i:%a@%h)%e%m%n" --log=res_cpu.t:verbose
7 > [ 0.000000] (0:maestro@) Cannot launch actor 'worker' on failed host 'Fafard'
8 > [ 0.000000] (0:maestro@) Deployment includes some initially turned off Hosts ... nevermind.
9 > [ 0.000000] (1:master@Tremblay) Got 5 workers and 20 tasks to process
10 > [ 0.000000] (1:master@Tremblay) Send a message to worker-0
11 > [ 0.000000] (7:sleeper@Lilibeth) Start sleeping...
12 > [ 0.010309] (1:master@Tremblay) Send to worker-0 completed
13 > [ 0.010309] (2:worker@Tremblay) Start execution...
14 > [ 0.000000] (2:worker@Tremblay) Waiting a message on worker-0
15 > [ 0.000000] (3:worker@Jupiter) Waiting a message on worker-1
16 > [ 0.000000] (5:worker@Ginette) Waiting a message on worker-3
17 > [ 0.000000] (6:worker@Bourassa) Waiting a message on worker-4
18 > [ 0.010309] (1:master@Tremblay) Send a message to worker-1
19 > [ 1.000000] (0:maestro@) Restart actors on host Fafard
20 > [ 1.000000] (8:worker@Fafard) Waiting a message on worker-2
21 > [ 1.000000] (1:master@Tremblay) Mmh. The communication with 'worker-1' failed. Nevermind. Let's keep going!
22 > [ 1.000000] (1:master@Tremblay) Send a message to worker-2
23 > [ 2.000000] (1:master@Tremblay) Mmh. The communication with 'worker-2' failed. Nevermind. Let's keep going!
24 > [ 2.000000] (0:maestro@) Restart actors on host Jupiter
25 > [ 2.000000] (1:master@Tremblay) Send a message to worker-3
26 > [ 2.000000] (9:worker@Jupiter) Waiting a message on worker-1
27 > [ 2.010309] (2:worker@Tremblay) Execution complete.
28 > [ 2.010309] (2:worker@Tremblay) Waiting a message on worker-0
29 > [ 3.030928] (1:master@Tremblay) Send to worker-3 completed
30 > [ 3.030928] (1:master@Tremblay) Send a message to worker-4
31 > [ 3.030928] (5:worker@Ginette) Start execution...
32 > [ 4.061856] (1:master@Tremblay) Send to worker-4 completed
33 > [ 4.061856] (1:master@Tremblay) Send a message to worker-0
34 > [ 4.061856] (6:worker@Bourassa) Start execution...
35 > [ 4.072165] (1:master@Tremblay) Send to worker-0 completed
36 > [ 4.072165] (1:master@Tremblay) Send a message to worker-1
37 > [ 4.072165] (2:worker@Tremblay) Start execution...
38 > [ 5.000000] (0:maestro@) Restart actors on host Lilibeth
39 > [ 5.000000] (10:sleeper@Lilibeth) Start sleeping...
40 > [ 5.030928] (5:worker@Ginette) Execution complete.
41 > [ 5.030928] (5:worker@Ginette) Waiting a message on worker-3
42 > [ 5.103093] (1:master@Tremblay) Send to worker-1 completed
43 > [ 5.103093] (1:master@Tremblay) Send a message to worker-2
44 > [ 5.103093] (9:worker@Jupiter) Start execution...
45 > [ 6.000000] (10:sleeper@Lilibeth) done sleeping.
46 > [ 6.061856] (6:worker@Bourassa) Execution complete.
47 > [ 6.061856] (6:worker@Bourassa) Waiting a message on worker-4
48 > [ 6.072165] (2:worker@Tremblay) Execution complete.
49 > [ 6.072165] (2:worker@Tremblay) Waiting a message on worker-0
50 > [ 7.103093] (9:worker@Jupiter) Execution complete.
51 > [ 7.103093] (9:worker@Jupiter) Waiting a message on worker-1
52 > [ 15.103093] (1:master@Tremblay) Mmh. Got timeouted while speaking to 'worker-2'. Nevermind. Let's keep going!
53 > [ 15.103093] (1:master@Tremblay) Send a message to worker-3
54 > [ 15.103093] (1:master@Tremblay) Mmh. The communication with 'worker-3' failed. Nevermind. Let's keep going!
55 > [ 15.103093] (1:master@Tremblay) Send a message to worker-4
56 > [ 15.103093] (5:worker@Ginette) Mmh. Something went wrong. Nevermind. Let's keep going!
57 > [ 15.103093] (5:worker@Ginette) Waiting a message on worker-3
58 > [ 16.134021] (1:master@Tremblay) Send to worker-4 completed
59 > [ 16.134021] (1:master@Tremblay) Send a message to worker-0
60 > [ 16.134021] (6:worker@Bourassa) Start execution...
61 > [ 16.144330] (1:master@Tremblay) Send to worker-0 completed
62 > [ 16.144330] (1:master@Tremblay) Send a message to worker-1
63 > [ 16.144330] (2:worker@Tremblay) Start execution...
64 > [ 17.175258] (1:master@Tremblay) Send to worker-1 completed
65 > [ 17.175258] (1:master@Tremblay) Send a message to worker-2
66 > [ 17.175258] (9:worker@Jupiter) Start execution...
67 > [ 18.134021] (6:worker@Bourassa) Execution complete.
68 > [ 18.134021] (6:worker@Bourassa) Waiting a message on worker-4
69 > [ 18.144330] (2:worker@Tremblay) Execution complete.
70 > [ 18.144330] (2:worker@Tremblay) Waiting a message on worker-0
71 > [ 19.175258] (9:worker@Jupiter) Execution complete.
72 > [ 19.175258] (9:worker@Jupiter) Waiting a message on worker-1
73 > [ 20.000000] (0:maestro@) Restart actors on host Lilibeth
74 > [ 20.000000] (11:sleeper@Lilibeth) Start sleeping...
75 > [ 21.000000] (11:sleeper@Lilibeth) done sleeping.
76 > [ 27.175258] (1:master@Tremblay) Mmh. Got timeouted while speaking to 'worker-2'. Nevermind. Let's keep going!
77 > [ 27.175258] (1:master@Tremblay) Send a message to worker-3
78 > [ 28.206186] (1:master@Tremblay) Send to worker-3 completed
79 > [ 28.206186] (1:master@Tremblay) Send a message to worker-4
80 > [ 28.206186] (1:master@Tremblay) Mmh. The communication with 'worker-4' failed. Nevermind. Let's keep going!
81 > [ 28.206186] (1:master@Tremblay) Send a message to worker-0
82 > [ 28.206186] (5:worker@Ginette) Start execution...
83 > [ 28.206186] (6:worker@Bourassa) Mmh. Something went wrong. Nevermind. Let's keep going!
84 > [ 28.206186] (6:worker@Bourassa) Waiting a message on worker-4
85 > [ 28.216495] (1:master@Tremblay) Send to worker-0 completed
86 > [ 28.216495] (1:master@Tremblay) Send a message to worker-1
87 > [ 28.216495] (2:worker@Tremblay) Start execution...
88 > [ 29.247423] (1:master@Tremblay) Send to worker-1 completed
89 > [ 29.247423] (1:master@Tremblay) Send a message to worker-2
90 > [ 29.247423] (9:worker@Jupiter) Start execution...
91 > [ 30.206186] (5:worker@Ginette) Execution complete.
92 > [ 30.206186] (5:worker@Ginette) Waiting a message on worker-3
93 > [ 30.216495] (2:worker@Tremblay) Execution complete.
94 > [ 30.216495] (2:worker@Tremblay) Waiting a message on worker-0
95 > [ 31.247423] (9:worker@Jupiter) Execution complete.
96 > [ 31.247423] (9:worker@Jupiter) Waiting a message on worker-1
97 > [ 35.000000] (0:maestro@) Restart actors on host Lilibeth
98 > [ 35.000000] (12:sleeper@Lilibeth) Start sleeping...
99 > [ 36.000000] (12:sleeper@Lilibeth) done sleeping.
100 > [ 39.247423] (1:master@Tremblay) Mmh. Got timeouted while speaking to 'worker-2'. Nevermind. Let's keep going!
101 > [ 39.247423] (1:master@Tremblay) Send a message to worker-3
102 > [ 40.278351] (1:master@Tremblay) Send to worker-3 completed
103 > [ 40.278351] (1:master@Tremblay) Send a message to worker-4
104 > [ 40.278351] (5:worker@Ginette) Start execution...
105 > [ 41.309278] (1:master@Tremblay) Send to worker-4 completed
106 > [ 41.309278] (1:master@Tremblay) All tasks have been dispatched. Let's tell everybody the computation is over.
107 > [ 41.309278] (2:worker@Tremblay) I'm done. See you!
108 > [ 41.309278] (6:worker@Bourassa) Start execution...
109 > [ 41.309278] (9:worker@Jupiter) I'm done. See you!
110 > [ 42.309278] (1:master@Tremblay) Mmh. Got timeouted while speaking to 'worker-2'. Nevermind. Let's keep going!
111 > [ 43.309278] (0:maestro@) Simulation time 43.3093
112 > [ 43.309278] (1:master@Tremblay) Mmh. Got timeouted while speaking to 'worker-3'. Nevermind. Let's keep going!
113 > [ 43.309278] (1:master@Tremblay) Goodbye now!
114 > [ 43.309278] (6:worker@Bourassa) Execution complete.
115 > [ 43.309278] (6:worker@Bourassa) Waiting a message on worker-4
116 > [ 43.309278] (6:worker@Bourassa) I'm done. See you!
118 p Testing a simple master/worker example application handling failures. TCP crosstraffic ENABLED
121 $ ${bindir:=.}/s4u-platform-failures --log=xbt_cfg.thres:critical --log=no_loc ${platfdir}/small_platform_failures.xml ${srcdir:=.}/s4u-platform-failures_d.xml --cfg=path:${srcdir} "--log=root.fmt:[%10.6r]%e(%i:%a@%h)%e%m%n" --log=res_cpu.t:verbose
122 > [ 0.000000] (0:maestro@) Cannot launch actor 'worker' on failed host 'Fafard'
123 > [ 0.000000] (0:maestro@) Deployment includes some initially turned off Hosts ... nevermind.
124 > [ 0.000000] (1:master@Tremblay) Got 5 workers and 20 tasks to process
125 > [ 0.000000] (1:master@Tremblay) Send a message to worker-0
126 > [ 0.000000] (2:worker@Tremblay) Waiting a message on worker-0
127 > [ 0.000000] (3:worker@Jupiter) Waiting a message on worker-1
128 > [ 0.000000] (5:worker@Ginette) Waiting a message on worker-3
129 > [ 0.000000] (6:worker@Bourassa) Waiting a message on worker-4
130 > [ 0.000000] (7:sleeper@Lilibeth) Start sleeping...
131 > [ 0.010825] (2:worker@Tremblay) Start execution...
132 > [ 0.010825] (1:master@Tremblay) Send to worker-0 completed
133 > [ 0.010825] (1:master@Tremblay) Send a message to worker-1
134 > [ 1.000000] (0:maestro@) Restart actors on host Fafard
135 > [ 1.000000] (8:worker@Fafard) Waiting a message on worker-2
136 > [ 1.000000] (1:master@Tremblay) Mmh. The communication with 'worker-1' failed. Nevermind. Let's keep going!
137 > [ 1.000000] (1:master@Tremblay) Send a message to worker-2
138 > [ 2.000000] (0:maestro@) Restart actors on host Jupiter
139 > [ 2.000000] (9:worker@Jupiter) Waiting a message on worker-1
140 > [ 2.000000] (1:master@Tremblay) Mmh. The communication with 'worker-2' failed. Nevermind. Let's keep going!
141 > [ 2.000000] (1:master@Tremblay) Send a message to worker-3
142 > [ 2.010825] (2:worker@Tremblay) Execution complete.
143 > [ 2.010825] (2:worker@Tremblay) Waiting a message on worker-0
144 > [ 3.082474] (5:worker@Ginette) Start execution...
145 > [ 3.082474] (1:master@Tremblay) Send to worker-3 completed
146 > [ 3.082474] (1:master@Tremblay) Send a message to worker-4
147 > [ 4.164948] (6:worker@Bourassa) Start execution...
148 > [ 4.164948] (1:master@Tremblay) Send to worker-4 completed
149 > [ 4.164948] (1:master@Tremblay) Send a message to worker-0
150 > [ 4.175773] (2:worker@Tremblay) Start execution...
151 > [ 4.175773] (1:master@Tremblay) Send to worker-0 completed
152 > [ 4.175773] (1:master@Tremblay) Send a message to worker-1
153 > [ 5.000000] (0:maestro@) Restart actors on host Lilibeth
154 > [ 5.000000] (10:sleeper@Lilibeth) Start sleeping...
155 > [ 5.082474] (5:worker@Ginette) Execution complete.
156 > [ 5.082474] (5:worker@Ginette) Waiting a message on worker-3
157 > [ 5.258247] (9:worker@Jupiter) Start execution...
158 > [ 5.258247] (1:master@Tremblay) Send to worker-1 completed
159 > [ 5.258247] (1:master@Tremblay) Send a message to worker-2
160 > [ 6.000000] (10:sleeper@Lilibeth) done sleeping.
161 > [ 6.164948] (6:worker@Bourassa) Execution complete.
162 > [ 6.164948] (6:worker@Bourassa) Waiting a message on worker-4
163 > [ 6.175773] (2:worker@Tremblay) Execution complete.
164 > [ 6.175773] (2:worker@Tremblay) Waiting a message on worker-0
165 > [ 7.258247] (9:worker@Jupiter) Execution complete.
166 > [ 7.258247] (9:worker@Jupiter) Waiting a message on worker-1
167 > [ 15.258247] (1:master@Tremblay) Mmh. Got timeouted while speaking to 'worker-2'. Nevermind. Let's keep going!
168 > [ 15.258247] (1:master@Tremblay) Send a message to worker-3
169 > [ 15.258247] (5:worker@Ginette) Mmh. Something went wrong. Nevermind. Let's keep going!
170 > [ 15.258247] (5:worker@Ginette) Waiting a message on worker-3
171 > [ 15.258247] (1:master@Tremblay) Mmh. The communication with 'worker-3' failed. Nevermind. Let's keep going!
172 > [ 15.258247] (1:master@Tremblay) Send a message to worker-4
173 > [ 16.340722] (6:worker@Bourassa) Start execution...
174 > [ 16.340722] (1:master@Tremblay) Send to worker-4 completed
175 > [ 16.340722] (1:master@Tremblay) Send a message to worker-0
176 > [ 16.351546] (2:worker@Tremblay) Start execution...
177 > [ 16.351546] (1:master@Tremblay) Send to worker-0 completed
178 > [ 16.351546] (1:master@Tremblay) Send a message to worker-1
179 > [ 17.434021] (9:worker@Jupiter) Start execution...
180 > [ 17.434021] (1:master@Tremblay) Send to worker-1 completed
181 > [ 17.434021] (1:master@Tremblay) Send a message to worker-2
182 > [ 18.340722] (6:worker@Bourassa) Execution complete.
183 > [ 18.340722] (6:worker@Bourassa) Waiting a message on worker-4
184 > [ 18.351546] (2:worker@Tremblay) Execution complete.
185 > [ 18.351546] (2:worker@Tremblay) Waiting a message on worker-0
186 > [ 19.434021] (9:worker@Jupiter) Execution complete.
187 > [ 19.434021] (9:worker@Jupiter) Waiting a message on worker-1
188 > [ 20.000000] (0:maestro@) Restart actors on host Lilibeth
189 > [ 20.000000] (11:sleeper@Lilibeth) Start sleeping...
190 > [ 21.000000] (11:sleeper@Lilibeth) done sleeping.
191 > [ 27.434021] (1:master@Tremblay) Mmh. Got timeouted while speaking to 'worker-2'. Nevermind. Let's keep going!
192 > [ 27.434021] (1:master@Tremblay) Send a message to worker-3
193 > [ 28.516495] (5:worker@Ginette) Start execution...
194 > [ 28.516495] (1:master@Tremblay) Send to worker-3 completed
195 > [ 28.516495] (1:master@Tremblay) Send a message to worker-4
196 > [ 28.516495] (6:worker@Bourassa) Mmh. Something went wrong. Nevermind. Let's keep going!
197 > [ 28.516495] (6:worker@Bourassa) Waiting a message on worker-4
198 > [ 28.516495] (1:master@Tremblay) Mmh. The communication with 'worker-4' failed. Nevermind. Let's keep going!
199 > [ 28.516495] (1:master@Tremblay) Send a message to worker-0
200 > [ 28.527320] (2:worker@Tremblay) Start execution...
201 > [ 28.527320] (1:master@Tremblay) Send to worker-0 completed
202 > [ 28.527320] (1:master@Tremblay) Send a message to worker-1
203 > [ 29.609794] (9:worker@Jupiter) Start execution...
204 > [ 29.609794] (1:master@Tremblay) Send to worker-1 completed
205 > [ 29.609794] (1:master@Tremblay) Send a message to worker-2
206 > [ 30.516495] (5:worker@Ginette) Execution complete.
207 > [ 30.516495] (5:worker@Ginette) Waiting a message on worker-3
208 > [ 30.527320] (2:worker@Tremblay) Execution complete.
209 > [ 30.527320] (2:worker@Tremblay) Waiting a message on worker-0
210 > [ 31.609794] (9:worker@Jupiter) Execution complete.
211 > [ 31.609794] (9:worker@Jupiter) Waiting a message on worker-1
212 > [ 35.000000] (0:maestro@) Restart actors on host Lilibeth
213 > [ 35.000000] (12:sleeper@Lilibeth) Start sleeping...
214 > [ 36.000000] (12:sleeper@Lilibeth) done sleeping.
215 > [ 39.609794] (1:master@Tremblay) Mmh. Got timeouted while speaking to 'worker-2'. Nevermind. Let's keep going!
216 > [ 39.609794] (1:master@Tremblay) Send a message to worker-3
217 > [ 40.692268] (5:worker@Ginette) Start execution...
218 > [ 40.692268] (1:master@Tremblay) Send to worker-3 completed
219 > [ 40.692268] (1:master@Tremblay) Send a message to worker-4
220 > [ 41.774742] (6:worker@Bourassa) Start execution...
221 > [ 41.774742] (1:master@Tremblay) Send to worker-4 completed
222 > [ 41.774742] (1:master@Tremblay) All tasks have been dispatched. Let's tell everybody the computation is over.
223 > [ 41.774742] (2:worker@Tremblay) I'm done. See you!
224 > [ 41.774742] (9:worker@Jupiter) I'm done. See you!
225 > [ 42.774742] (1:master@Tremblay) Mmh. Got timeouted while speaking to 'worker-2'. Nevermind. Let's keep going!
226 > [ 43.774742] (6:worker@Bourassa) Execution complete.
227 > [ 43.774742] (6:worker@Bourassa) Waiting a message on worker-4
228 > [ 43.774742] (1:master@Tremblay) Mmh. Got timeouted while speaking to 'worker-3'. Nevermind. Let's keep going!
229 > [ 43.774742] (6:worker@Bourassa) I'm done. See you!
230 > [ 43.774742] (1:master@Tremblay) Goodbye now!
231 > [ 43.774742] (0:maestro@) Simulation time 43.7747