Logo AND Algorithmique Numérique Distribuée

Public GIT Repository
simgrid.git
7 years agoreact correctly if an exception does not originates the current process
Martin Quinson [Wed, 6 Aug 2014 23:59:31 +0000 (01:59 +0200)]
react correctly if an exception does not originates the current process

7 years agoensure that the process initializer actually works by using it
Martin Quinson [Wed, 6 Aug 2014 22:54:03 +0000 (00:54 +0200)]
ensure that the process initializer actually works by using it

7 years agomake public the function that can display an exception
Martin Quinson [Wed, 6 Aug 2014 22:53:00 +0000 (00:53 +0200)]
make public the function that can display an exception

7 years agouseless cleanups
Martin Quinson [Wed, 6 Aug 2014 20:04:35 +0000 (22:04 +0200)]
useless cleanups

- one letter variables are harder to read
- remove a useless assert (the system will complain if it's null)
- improve some debug messages and comments

7 years agoignore more generated files
Martin Quinson [Wed, 6 Aug 2014 19:50:35 +0000 (21:50 +0200)]
ignore more generated files

7 years ago[mc] Remove useless code & comment in mc_region_restore_sparse()
Gabriel Corona [Fri, 1 Aug 2014 13:25:08 +0000 (15:25 +0200)]
[mc] Remove useless code & comment in mc_region_restore_sparse()

This is not supposed to happen.

7 years ago[mc] DRY in MC_replay_liveness()
Gabriel Corona [Fri, 1 Aug 2014 13:07:21 +0000 (15:07 +0200)]
[mc] DRY in MC_replay_liveness()

7 years agobegin to add bcast MVAPICH collectives selector..
Augustin Degomme [Fri, 1 Aug 2014 11:51:56 +0000 (13:51 +0200)]
begin to add bcast MVAPICH collectives selector..
still defaults to mpich one for now

7 years agowarning --
Augustin Degomme [Fri, 1 Aug 2014 11:38:56 +0000 (13:38 +0200)]
warning --

7 years agoremove uninitialized warning
Augustin Degomme [Fri, 1 Aug 2014 11:09:49 +0000 (13:09 +0200)]
remove uninitialized warning

7 years agoremove useless (as for now) code
Augustin Degomme [Fri, 1 Aug 2014 10:48:29 +0000 (12:48 +0200)]
remove useless (as for now) code

7 years agofix dist
Augustin Degomme [Fri, 1 Aug 2014 10:38:05 +0000 (12:38 +0200)]
fix dist

7 years agoAdd Scatter SMP collective from MVAPICH2
Augustin Degomme [Fri, 1 Aug 2014 10:35:18 +0000 (12:35 +0200)]
Add Scatter SMP collective from MVAPICH2

7 years agoAdd Reduce SMP collective from MVAPICH2
Augustin Degomme [Fri, 1 Aug 2014 08:50:47 +0000 (10:50 +0200)]
Add Reduce SMP collective from MVAPICH2

7 years agoadd Allreduce SMP collective from MVAPICH2
Augustin Degomme [Fri, 1 Aug 2014 08:50:02 +0000 (10:50 +0200)]
add Allreduce SMP collective from MVAPICH2

7 years agoAdd Allgather SMP collective from MVAPICH2
Augustin Degomme [Fri, 1 Aug 2014 08:49:19 +0000 (10:49 +0200)]
Add Allgather SMP collective from MVAPICH2

7 years agoAdd Gather SMP collective from MVAPICH2
Augustin Degomme [Fri, 1 Aug 2014 08:48:30 +0000 (10:48 +0200)]
Add Gather SMP collective from MVAPICH2

7 years agoprovide support for SMP in MPI communicators.
Augustin Degomme [Fri, 1 Aug 2014 08:45:52 +0000 (10:45 +0200)]
provide support for SMP in MPI communicators.
smpi_comm_init_smp(comm) will create subcommunicators for intra and inter ndoes communications.
This is based on what MVAPICH2 does

7 years agochange hostfile used for mpich testsuite.
Augustin Degomme [Fri, 1 Aug 2014 08:43:45 +0000 (10:43 +0200)]
change hostfile used for mpich testsuite.
Deactivate a fortran part of a test, that made very strong assertions on order of completing calls..

7 years agoMove group_incl code, to allow use by collective algos or internal calls
Augustin Degomme [Fri, 1 Aug 2014 07:36:29 +0000 (09:36 +0200)]
Move group_incl code, to allow use by collective algos or internal calls

7 years agochange SMPI collectives hostfile, to use 4 contiguous processes/node
Augustin Degomme [Thu, 31 Jul 2014 12:53:51 +0000 (14:53 +0200)]
change SMPI collectives hostfile, to use 4 contiguous processes/node

7 years agoFix annoying typo in help message.
Stéphane Castelli [Thu, 31 Jul 2014 09:19:36 +0000 (11:19 +0200)]
Fix annoying typo in help message.

7 years agoSR experiment now working
etortilopez [Wed, 30 Jul 2014 08:22:00 +0000 (10:22 +0200)]
SR experiment now working

7 years agoAdd ignored files to .gitignore
Gabriel Corona [Tue, 29 Jul 2014 09:28:47 +0000 (11:28 +0200)]
Add ignored files to .gitignore

7 years ago[mc] Renable the sparse snpashot tests (after fixing them)
Gabriel Corona [Tue, 29 Jul 2014 08:17:12 +0000 (10:17 +0200)]
[mc] Renable the sparse snpashot tests (after fixing them)

This reverts commit 331c666b6237387356618583cf6708f2f981ea45.

7 years agomissing spaces between flags
Augustin Degomme [Mon, 28 Jul 2014 17:07:05 +0000 (19:07 +0200)]
missing spaces between flags

7 years agoSome old compilers seem to emit bogus warnings on ci
Augustin Degomme [Mon, 28 Jul 2014 17:02:23 +0000 (19:02 +0200)]
Some old compilers seem to emit bogus warnings on ci
See https://gcc.gnu.org/bugzilla/show_bug.cgi?id=45978 .
Avoid failing the build in this case

7 years agoremove warnings
Augustin Degomme [Mon, 28 Jul 2014 16:05:57 +0000 (18:05 +0200)]
remove warnings

7 years agomissing files - adrien
Adrien Lebre [Mon, 28 Jul 2014 16:02:23 +0000 (18:02 +0200)]
missing files - adrien

7 years agoMerge branch 'master' of git+ssh://scm.gforge.inria.fr//gitroot/simgrid/simgrid
Adrien Lebre [Mon, 28 Jul 2014 15:53:58 +0000 (17:53 +0200)]
Merge branch 'master' of git+ssh://scm.gforge.inria.fr//gitroot/simgrid/simgrid

7 years agoFound a bug/feature regarding the cancellation of a sleep (see comments directly...
Adrien Lebre [Mon, 28 Jul 2014 15:53:54 +0000 (17:53 +0200)]
Found a bug/feature regarding the cancellation of a sleep (see comments directly in the code) + add a temporary patch directly at the java level in order to force a hostFailureException - Adrien

7 years agofix dist
Augustin Degomme [Mon, 28 Jul 2014 15:17:11 +0000 (17:17 +0200)]
fix dist

7 years agothese assertions were a bit strong
Augustin Degomme [Mon, 28 Jul 2014 14:44:44 +0000 (16:44 +0200)]
these assertions were a bit strong

7 years agomake some algorithms return when using bad datatypes
Augustin Degomme [Mon, 28 Jul 2014 14:35:58 +0000 (16:35 +0200)]
make some algorithms return when using bad datatypes

7 years agoAdd Intel MPI (impi) selector.
Augustin Degomme [Mon, 28 Jul 2014 14:35:17 +0000 (16:35 +0200)]
Add Intel MPI (impi) selector.

Thresholds were obtained on Stampede cluster, by activating debug output for 1 process/node.
Algorithm list is available in the documentation of Intel MPI, available on their site

problems:
- doesn't take into account SMP for now (selection logic evolves, and thresholds change)
- some algorithms are unavailable (proprietary/undocumented) such as Shumilin's or Plum's. So others are used in these cases... And that's bad.

7 years agosome algos have bad behavior with some inputs, try to avoid that.
Augustin Degomme [Mon, 28 Jul 2014 14:30:35 +0000 (16:30 +0200)]
some algos have bad behavior with some inputs, try to avoid that.

7 years agoactivate a previously commented pairwise alltoall algo with rma comms
Augustin Degomme [Mon, 28 Jul 2014 11:16:22 +0000 (13:16 +0200)]
activate a previously commented pairwise alltoall algo with rma comms

7 years agoAdd Rabenseifner Reduce/Allreduce algorithms
Augustin Degomme [Fri, 25 Jul 2014 13:48:12 +0000 (15:48 +0200)]
Add Rabenseifner Reduce/Allreduce algorithms
See https://fs.hlrs.de/projects/par/mpi//myreduce.html for details

7 years agoFix host_on_off_process tesh
Paul Bédaride [Mon, 28 Jul 2014 14:27:56 +0000 (16:27 +0200)]
Fix host_on_off_process tesh

7 years agoFix tesh host_on_off_processes 4
Paul Bédaride [Mon, 28 Jul 2014 13:09:56 +0000 (15:09 +0200)]
Fix tesh host_on_off_processes 4

7 years agoMerge branch 'master' of git+ssh://scm.gforge.inria.fr//gitroot/simgrid/simgrid
Adrien Lebre [Mon, 28 Jul 2014 12:43:54 +0000 (14:43 +0200)]
Merge branch 'master' of git+ssh://scm.gforge.inria.fr//gitroot/simgrid/simgrid

7 years agofix tesh (add the sleep test output) - adrien
Adrien Lebre [Mon, 28 Jul 2014 12:43:50 +0000 (14:43 +0200)]
fix tesh (add the sleep test output) - adrien

7 years agoShell script to generate a simple tesh file
Gabriel Corona [Mon, 28 Jul 2014 12:39:54 +0000 (14:39 +0200)]
Shell script to generate a simple tesh file

7 years agoMerge branch 'master' of git+ssh://scm.gforge.inria.fr//gitroot/simgrid/simgrid
Adrien Lebre [Mon, 28 Jul 2014 12:19:04 +0000 (14:19 +0200)]
Merge branch 'master' of git+ssh://scm.gforge.inria.fr//gitroot/simgrid/simgrid

7 years agoAdd test sleep for host_on_off + minor changes - Adrien
Adrien Lebre [Mon, 28 Jul 2014 12:18:57 +0000 (14:18 +0200)]
Add test sleep for host_on_off + minor changes - Adrien

7 years ago[mc] Disable sparse snpashot tests
Gabriel Corona [Mon, 28 Jul 2014 09:35:05 +0000 (11:35 +0200)]
[mc] Disable sparse snpashot tests

It seems sparse snapshots are not stable yet.

7 years ago[mc] Remove unused variables
Gabriel Corona [Mon, 28 Jul 2014 09:08:10 +0000 (11:08 +0200)]
[mc] Remove unused variables

Stop the continuous integration server from complaining.

7 years agoMerge branch 'master' of git+ssh://scm.gforge.inria.fr//gitroot/simgrid/simgrid
Adrien Lebre [Mon, 28 Jul 2014 08:54:11 +0000 (10:54 +0200)]
Merge branch 'master' of git+ssh://scm.gforge.inria.fr//gitroot/simgrid/simgrid

7 years agoRollback - the java code should make a THROWF call - adrien
Adrien Lebre [Mon, 28 Jul 2014 08:54:05 +0000 (10:54 +0200)]
Rollback - the java code should make a THROWF call - adrien

7 years ago[mc] Make state/snapshot comparison work with SMPI variable privatisation
Gabriel Corona [Mon, 30 Jun 2014 11:06:12 +0000 (13:06 +0200)]
[mc] Make state/snapshot comparison work with SMPI variable privatisation

Changes :

  * a snapshot region now has a standard address (address in the
    virtual process) and permanent address (address of the
    privatisation region, where the memory is always mapped event when
    another process is active);

  * handle privatised memory regions on snapshot and snapshot restore;

  * compare global vaiable separately for each simulated process;

  * the ID of the simulated must be passed everywhere;

  * mc_snapshot_read() and friends as well as DWARF expression can be
    evaluated in the context of a given simulated process and fetch
    the data in the corresponding privatisation region.

7 years agoMerge branch 'mc'
Gabriel Corona [Fri, 25 Jul 2014 11:28:50 +0000 (13:28 +0200)]
Merge branch 'mc'

Conflicts:
src/mc/mc_compare.cpp

7 years ago[mc] Change the signature of mc_restore_page_snapshot_region
Gabriel Corona [Fri, 25 Jul 2014 10:24:59 +0000 (12:24 +0200)]
[mc] Change the signature of mc_restore_page_snapshot_region

Better separation of the different snapshot layers.

7 years ago[mc] Split mc_region_new_dense out of MC_region_new
Gabriel Corona [Fri, 25 Jul 2014 10:24:22 +0000 (12:24 +0200)]
[mc] Split mc_region_new_dense out of MC_region_new

MC_region_new uses the global SimGrid setting.
mc_region_new_dense uses dense snapshot.

7 years ago[mc] Expand unit test of mc_snapshot.c
Gabriel Corona [Fri, 25 Jul 2014 10:23:00 +0000 (12:23 +0200)]
[mc] Expand unit test of mc_snapshot.c

7 years ago[mc] Remove page_store.tesh (has been changed into a unit test)
Gabriel Corona [Fri, 25 Jul 2014 10:42:42 +0000 (12:42 +0200)]
[mc] Remove page_store.tesh (has been changed into a unit test)

7 years ago(stupid) warning --
Augustin Degomme [Thu, 24 Jul 2014 16:36:52 +0000 (18:36 +0200)]
(stupid) warning --

7 years agoFix msg-host-on-off tesh
Paul Bédaride [Thu, 24 Jul 2014 15:47:19 +0000 (17:47 +0200)]
Fix msg-host-on-off tesh

7 years agoAdd and use knomial reduce algorithm from mvapich
Augustin Degomme [Thu, 24 Jul 2014 15:26:37 +0000 (17:26 +0200)]
Add and use knomial reduce algorithm from mvapich

7 years agoAdd and use mvapich's scatter_dest alltoall algorithm
Augustin Degomme [Thu, 24 Jul 2014 14:41:28 +0000 (16:41 +0200)]
Add and use mvapich's scatter_dest alltoall algorithm

7 years agoMerge remote-tracking branch 'origin/mc-fastsnapshot' into mc
Gabriel Corona [Thu, 24 Jul 2014 14:11:19 +0000 (16:11 +0200)]
Merge remote-tracking branch 'origin/mc-fastsnapshot' into mc

Conflicts:
buildtools/Cmake/Flags.cmake

7 years ago[mc] Remove irrelevant comment
Gabriel Corona [Thu, 24 Jul 2014 14:10:02 +0000 (16:10 +0200)]
[mc] Remove irrelevant comment

7 years agofix daemon typos
Adrien Lebre [Thu, 24 Jul 2014 14:01:33 +0000 (16:01 +0200)]
fix daemon typos

7 years agoMerge branch 'master' of git+ssh://scm.gforge.inria.fr//gitroot/simgrid/simgrid
Adrien Lebre [Thu, 24 Jul 2014 13:49:43 +0000 (15:49 +0200)]
Merge branch 'master' of git+ssh://scm.gforge.inria.fr//gitroot/simgrid/simgrid

7 years agofix minor issues in the management of Java Exceptions + comment process cancelled...
Adrien Lebre [Thu, 24 Jul 2014 13:49:37 +0000 (15:49 +0200)]
fix minor issues in the management of Java Exceptions + comment process cancelled exception call

7 years agofix dist
Augustin Degomme [Thu, 24 Jul 2014 13:39:18 +0000 (15:39 +0200)]
fix dist

7 years agoindent
Augustin Degomme [Thu, 24 Jul 2014 13:22:33 +0000 (15:22 +0200)]
indent

7 years agoprotect these calls against MPI_DATATYPE_NULL errors
Augustin Degomme [Thu, 24 Jul 2014 13:20:49 +0000 (15:20 +0200)]
protect these calls against MPI_DATATYPE_NULL errors

7 years agocleanup a bit the code, ensure tests do pass
Augustin Degomme [Thu, 24 Jul 2014 13:20:25 +0000 (15:20 +0200)]
cleanup a bit the code, ensure tests do pass

7 years agoadd mvapich allreduce rs algorithm
Augustin Degomme [Thu, 24 Jul 2014 09:06:29 +0000 (11:06 +0200)]
add mvapich allreduce rs algorithm

7 years agomanually privatize allred test, to have it work on non-mmap systems also
Augustin Degomme [Thu, 24 Jul 2014 08:41:00 +0000 (10:41 +0200)]
manually privatize allred test, to have it work on non-mmap systems also

7 years agoAdd last collectives from mvapich selector : bcast reduce reduce_scatter scatter
Augustin Degomme [Thu, 24 Jul 2014 00:04:40 +0000 (02:04 +0200)]
Add last collectives from mvapich selector : bcast reduce reduce_scatter scatter

bcast stilll defaults to mpich one, as they need smp support

7 years agoNew collectives for mvapich2 selector : allgatherv, allreduce, alltoallv, barrier
Augustin Degomme [Wed, 23 Jul 2014 15:35:52 +0000 (17:35 +0200)]
New collectives for mvapich2 selector : allgatherv, allreduce, alltoallv, barrier

7 years agoBegin to add a MVAPICH2 collectives selector. Alltoall, Allgather and gather done.
Augustin Degomme [Wed, 23 Jul 2014 12:26:17 +0000 (14:26 +0200)]
Begin to add a MVAPICH2 collectives selector. Alltoall, Allgather and gather done.

Problems :
- code is (for now) quite a mess, with several versions of tuning available at the same time (coll folder from mvapich has currently 145k cloc).
- code is copy-pasted directly ... So, no comments
- only Stampede calibration is imported for now. Some others are available, we should provide a mechanism to switch to another calibration.
- MVAPICH collectives are SMP aware. SMPI is not really ... A mechanism to automatically generate an "internal" communicator for processes sharing a physical node will be needed. Gather actually defaults to mpich one as a result of this.

7 years agoFix mallocator tesh
Paul Bédaride [Thu, 24 Jul 2014 13:16:12 +0000 (15:16 +0200)]
Fix mallocator tesh

7 years ago[mc] Use mc_region_contain where it could be used
Gabriel Corona [Thu, 24 Jul 2014 12:43:13 +0000 (14:43 +0200)]
[mc] Use mc_region_contain where it could be used

7 years ago[mc] Udpate doxygen comments
Gabriel Corona [Thu, 24 Jul 2014 12:09:55 +0000 (14:09 +0200)]
[mc] Udpate doxygen comments

7 years agoAdd test for host on off and some fixes
Paul Bédaride [Thu, 24 Jul 2014 09:53:48 +0000 (11:53 +0200)]
Add test for host on off and some fixes

7 years agoSet 4 tests that fail on Windows and won't ever pass as "expected to fail"
Augustin Degomme [Wed, 23 Jul 2014 09:07:42 +0000 (11:07 +0200)]
Set 4 tests that fail on Windows and won't ever pass as "expected to fail"

This will allow to better discriminate between bugs and wontfix non-issues.

One of them fails because random returns differently on win and generates different colors in the resulting trace. Duh.
The three others are expected to crash, and do crash correctly on Windows, but the return code is different from the one expected (SIGABRT). Meh

7 years ago[mc] Add tests for sparse snapshot
Gabriel Corona [Tue, 22 Jul 2014 13:42:44 +0000 (15:42 +0200)]
[mc] Add tests for sparse snapshot

7 years ago[mc] Disable soft-dirty page tracking by default
Gabriel Corona [Tue, 22 Jul 2014 13:15:17 +0000 (15:15 +0200)]
[mc] Disable soft-dirty page tracking by default

In all tests I ran, it has a negative impact on performance.

7 years agosanitize get/set_name functions for fortran use
Augustin Degomme [Mon, 21 Jul 2014 13:22:50 +0000 (15:22 +0200)]
sanitize get/set_name functions for fortran use

7 years ago[mc] Disable optimisation for xbt when using MC
Gabriel Corona [Mon, 21 Jul 2014 12:36:16 +0000 (14:36 +0200)]
[mc] Disable optimisation for xbt when using MC

In some cases, it breaks the state comparison for some reason. This
was observed with mm.c and dynar.c.

7 years ago[mc] Do not handle mc_model_checker->parent_snapshot unless it is necessary
Gabriel Corona [Mon, 21 Jul 2014 11:38:08 +0000 (13:38 +0200)]
[mc] Do not handle mc_model_checker->parent_snapshot unless it is necessary

7 years agoAdd MPI_Win_get_name and MPI_Win_set_name support
Augustin Degomme [Mon, 21 Jul 2014 09:33:08 +0000 (11:33 +0200)]
Add MPI_Win_get_name and MPI_Win_set_name support

7 years agoreactivate allred test
Augustin Degomme [Mon, 21 Jul 2014 09:32:40 +0000 (11:32 +0200)]
reactivate allred test

But have its compilation flags set to O0 to avoid issues with ci slaves (too long time to compile with optims all the macros used, and too much memory used)

7 years ago[mmalloc] Force metadata update in mmalloc/mrealloc
Gabriel Corona [Thu, 17 Jul 2014 14:30:45 +0000 (16:30 +0200)]
[mmalloc] Force metadata update in mmalloc/mrealloc

7 years ago[mmalloc] Add mmcheck() which checks mmalloc heap consistency
Gabriel Corona [Thu, 17 Jul 2014 08:41:20 +0000 (10:41 +0200)]
[mmalloc] Add mmcheck() which checks mmalloc heap consistency

7 years ago[mmalloc] Add new block type for heapinfo blocks
Gabriel Corona [Thu, 10 Jul 2014 13:48:16 +0000 (15:48 +0200)]
[mmalloc] Add new block type for heapinfo blocks

7 years agorevert changes on allgatherv4, which needed manual privatization to run on freebsd
Augustin Degomme [Thu, 17 Jul 2014 16:38:50 +0000 (18:38 +0200)]
revert changes on allgatherv4, which needed manual privatization to run on freebsd

7 years agoremove warning with mc
Augustin Degomme [Thu, 17 Jul 2014 16:15:02 +0000 (18:15 +0200)]
remove warning with mc

7 years agoremove warning
Augustin Degomme [Thu, 17 Jul 2014 16:00:47 +0000 (18:00 +0200)]
remove warning

7 years agoFinish pulling changes from mpich trunk testsuite
Augustin Degomme [Thu, 17 Jul 2014 15:52:24 +0000 (17:52 +0200)]
Finish pulling changes from mpich trunk testsuite

7 years agoUpdate comm, datatype from mpich trunk
Augustin Degomme [Thu, 17 Jul 2014 14:54:35 +0000 (16:54 +0200)]
Update comm, datatype from mpich trunk

7 years agoenforce a scatter error in some cases
Augustin Degomme [Thu, 17 Jul 2014 13:38:45 +0000 (15:38 +0200)]
enforce a scatter error in some cases

7 years agoupdate collectives teshsuite from mpich git (only minor changes)
Augustin Degomme [Thu, 17 Jul 2014 13:38:28 +0000 (15:38 +0200)]
update collectives teshsuite from mpich git (only minor changes)

7 years agotesh update for fabien's work on surf
Augustin Degomme [Thu, 17 Jul 2014 09:06:53 +0000 (11:06 +0200)]
tesh update for fabien's work on surf

7 years agofabien's work on surf
Augustin Degomme [Thu, 17 Jul 2014 09:06:12 +0000 (11:06 +0200)]
fabien's work on surf

7 years agoPush Takahiro Patch and fix cloud.tesh - Adrien
Adrien Lebre [Wed, 16 Jul 2014 17:39:34 +0000 (19:39 +0200)]
Push Takahiro Patch and fix cloud.tesh - Adrien

7 years agoMerge branch 'master' of git+ssh://scm.gforge.inria.fr//gitroot/simgrid/simgrid
Adrien Lebre [Wed, 16 Jul 2014 16:09:41 +0000 (18:09 +0200)]
Merge branch 'master' of git+ssh://scm.gforge.inria.fr//gitroot/simgrid/simgrid

7 years agoFix a typo in the java-cloud example and add one TODO related to the migration invoca...
Adrien Lebre [Wed, 16 Jul 2014 16:09:37 +0000 (18:09 +0200)]
Fix a typo in the java-cloud example and add one TODO related to the migration invocation in VM.java - adrien