Logo AND Algorithmique Numérique Distribuée

Public GIT Repository
simgrid.git
9 years agoMerge branch 'master' of git+ssh://scm.gforge.inria.fr//gitroot/simgrid/simgrid
Adrien Lebre [Mon, 28 Jul 2014 15:53:58 +0000 (17:53 +0200)]
Merge branch 'master' of git+ssh://scm.gforge.inria.fr//gitroot/simgrid/simgrid

9 years agoFound a bug/feature regarding the cancellation of a sleep (see comments directly...
Adrien Lebre [Mon, 28 Jul 2014 15:53:54 +0000 (17:53 +0200)]
Found a bug/feature regarding the cancellation of a sleep (see comments directly in the code) + add a temporary patch directly at the java level in order to force a hostFailureException - Adrien

9 years agofix dist
Augustin Degomme [Mon, 28 Jul 2014 15:17:11 +0000 (17:17 +0200)]
fix dist

9 years agothese assertions were a bit strong
Augustin Degomme [Mon, 28 Jul 2014 14:44:44 +0000 (16:44 +0200)]
these assertions were a bit strong

9 years agomake some algorithms return when using bad datatypes
Augustin Degomme [Mon, 28 Jul 2014 14:35:58 +0000 (16:35 +0200)]
make some algorithms return when using bad datatypes

9 years agoAdd Intel MPI (impi) selector.
Augustin Degomme [Mon, 28 Jul 2014 14:35:17 +0000 (16:35 +0200)]
Add Intel MPI (impi) selector.

Thresholds were obtained on Stampede cluster, by activating debug output for 1 process/node.
Algorithm list is available in the documentation of Intel MPI, available on their site

problems:
- doesn't take into account SMP for now (selection logic evolves, and thresholds change)
- some algorithms are unavailable (proprietary/undocumented) such as Shumilin's or Plum's. So others are used in these cases... And that's bad.

9 years agosome algos have bad behavior with some inputs, try to avoid that.
Augustin Degomme [Mon, 28 Jul 2014 14:30:35 +0000 (16:30 +0200)]
some algos have bad behavior with some inputs, try to avoid that.

9 years agoactivate a previously commented pairwise alltoall algo with rma comms
Augustin Degomme [Mon, 28 Jul 2014 11:16:22 +0000 (13:16 +0200)]
activate a previously commented pairwise alltoall algo with rma comms

9 years agoAdd Rabenseifner Reduce/Allreduce algorithms
Augustin Degomme [Fri, 25 Jul 2014 13:48:12 +0000 (15:48 +0200)]
Add Rabenseifner Reduce/Allreduce algorithms
See https://fs.hlrs.de/projects/par/mpi//myreduce.html for details

9 years agoFix host_on_off_process tesh
Paul Bédaride [Mon, 28 Jul 2014 14:27:56 +0000 (16:27 +0200)]
Fix host_on_off_process tesh

9 years agoFix tesh host_on_off_processes 4
Paul Bédaride [Mon, 28 Jul 2014 13:09:56 +0000 (15:09 +0200)]
Fix tesh host_on_off_processes 4

9 years agoMerge branch 'master' of git+ssh://scm.gforge.inria.fr//gitroot/simgrid/simgrid
Adrien Lebre [Mon, 28 Jul 2014 12:43:54 +0000 (14:43 +0200)]
Merge branch 'master' of git+ssh://scm.gforge.inria.fr//gitroot/simgrid/simgrid

9 years agofix tesh (add the sleep test output) - adrien
Adrien Lebre [Mon, 28 Jul 2014 12:43:50 +0000 (14:43 +0200)]
fix tesh (add the sleep test output) - adrien

9 years agoShell script to generate a simple tesh file
Gabriel Corona [Mon, 28 Jul 2014 12:39:54 +0000 (14:39 +0200)]
Shell script to generate a simple tesh file

9 years agoMerge branch 'master' of git+ssh://scm.gforge.inria.fr//gitroot/simgrid/simgrid
Adrien Lebre [Mon, 28 Jul 2014 12:19:04 +0000 (14:19 +0200)]
Merge branch 'master' of git+ssh://scm.gforge.inria.fr//gitroot/simgrid/simgrid

9 years agoAdd test sleep for host_on_off + minor changes - Adrien
Adrien Lebre [Mon, 28 Jul 2014 12:18:57 +0000 (14:18 +0200)]
Add test sleep for host_on_off + minor changes - Adrien

9 years ago[mc] Disable sparse snpashot tests
Gabriel Corona [Mon, 28 Jul 2014 09:35:05 +0000 (11:35 +0200)]
[mc] Disable sparse snpashot tests

It seems sparse snapshots are not stable yet.

9 years ago[mc] Remove unused variables
Gabriel Corona [Mon, 28 Jul 2014 09:08:10 +0000 (11:08 +0200)]
[mc] Remove unused variables

Stop the continuous integration server from complaining.

9 years agoMerge branch 'master' of git+ssh://scm.gforge.inria.fr//gitroot/simgrid/simgrid
Adrien Lebre [Mon, 28 Jul 2014 08:54:11 +0000 (10:54 +0200)]
Merge branch 'master' of git+ssh://scm.gforge.inria.fr//gitroot/simgrid/simgrid

9 years agoRollback - the java code should make a THROWF call - adrien
Adrien Lebre [Mon, 28 Jul 2014 08:54:05 +0000 (10:54 +0200)]
Rollback - the java code should make a THROWF call - adrien

9 years ago[mc] Make state/snapshot comparison work with SMPI variable privatisation
Gabriel Corona [Mon, 30 Jun 2014 11:06:12 +0000 (13:06 +0200)]
[mc] Make state/snapshot comparison work with SMPI variable privatisation

Changes :

  * a snapshot region now has a standard address (address in the
    virtual process) and permanent address (address of the
    privatisation region, where the memory is always mapped event when
    another process is active);

  * handle privatised memory regions on snapshot and snapshot restore;

  * compare global vaiable separately for each simulated process;

  * the ID of the simulated must be passed everywhere;

  * mc_snapshot_read() and friends as well as DWARF expression can be
    evaluated in the context of a given simulated process and fetch
    the data in the corresponding privatisation region.

9 years agoMerge branch 'mc'
Gabriel Corona [Fri, 25 Jul 2014 11:28:50 +0000 (13:28 +0200)]
Merge branch 'mc'

Conflicts:
src/mc/mc_compare.cpp

9 years ago[mc] Change the signature of mc_restore_page_snapshot_region
Gabriel Corona [Fri, 25 Jul 2014 10:24:59 +0000 (12:24 +0200)]
[mc] Change the signature of mc_restore_page_snapshot_region

Better separation of the different snapshot layers.

9 years ago[mc] Split mc_region_new_dense out of MC_region_new
Gabriel Corona [Fri, 25 Jul 2014 10:24:22 +0000 (12:24 +0200)]
[mc] Split mc_region_new_dense out of MC_region_new

MC_region_new uses the global SimGrid setting.
mc_region_new_dense uses dense snapshot.

9 years ago[mc] Expand unit test of mc_snapshot.c
Gabriel Corona [Fri, 25 Jul 2014 10:23:00 +0000 (12:23 +0200)]
[mc] Expand unit test of mc_snapshot.c

9 years ago[mc] Remove page_store.tesh (has been changed into a unit test)
Gabriel Corona [Fri, 25 Jul 2014 10:42:42 +0000 (12:42 +0200)]
[mc] Remove page_store.tesh (has been changed into a unit test)

9 years ago(stupid) warning --
Augustin Degomme [Thu, 24 Jul 2014 16:36:52 +0000 (18:36 +0200)]
(stupid) warning --

9 years agoFix msg-host-on-off tesh
Paul Bédaride [Thu, 24 Jul 2014 15:47:19 +0000 (17:47 +0200)]
Fix msg-host-on-off tesh

9 years agoAdd and use knomial reduce algorithm from mvapich
Augustin Degomme [Thu, 24 Jul 2014 15:26:37 +0000 (17:26 +0200)]
Add and use knomial reduce algorithm from mvapich

9 years agoAdd and use mvapich's scatter_dest alltoall algorithm
Augustin Degomme [Thu, 24 Jul 2014 14:41:28 +0000 (16:41 +0200)]
Add and use mvapich's scatter_dest alltoall algorithm

9 years agoMerge remote-tracking branch 'origin/mc-fastsnapshot' into mc
Gabriel Corona [Thu, 24 Jul 2014 14:11:19 +0000 (16:11 +0200)]
Merge remote-tracking branch 'origin/mc-fastsnapshot' into mc

Conflicts:
buildtools/Cmake/Flags.cmake

9 years ago[mc] Remove irrelevant comment
Gabriel Corona [Thu, 24 Jul 2014 14:10:02 +0000 (16:10 +0200)]
[mc] Remove irrelevant comment

9 years agofix daemon typos
Adrien Lebre [Thu, 24 Jul 2014 14:01:33 +0000 (16:01 +0200)]
fix daemon typos

9 years agoMerge branch 'master' of git+ssh://scm.gforge.inria.fr//gitroot/simgrid/simgrid
Adrien Lebre [Thu, 24 Jul 2014 13:49:43 +0000 (15:49 +0200)]
Merge branch 'master' of git+ssh://scm.gforge.inria.fr//gitroot/simgrid/simgrid

9 years agofix minor issues in the management of Java Exceptions + comment process cancelled...
Adrien Lebre [Thu, 24 Jul 2014 13:49:37 +0000 (15:49 +0200)]
fix minor issues in the management of Java Exceptions + comment process cancelled exception call

9 years agofix dist
Augustin Degomme [Thu, 24 Jul 2014 13:39:18 +0000 (15:39 +0200)]
fix dist

9 years agoindent
Augustin Degomme [Thu, 24 Jul 2014 13:22:33 +0000 (15:22 +0200)]
indent

9 years agoprotect these calls against MPI_DATATYPE_NULL errors
Augustin Degomme [Thu, 24 Jul 2014 13:20:49 +0000 (15:20 +0200)]
protect these calls against MPI_DATATYPE_NULL errors

9 years agocleanup a bit the code, ensure tests do pass
Augustin Degomme [Thu, 24 Jul 2014 13:20:25 +0000 (15:20 +0200)]
cleanup a bit the code, ensure tests do pass

9 years agoadd mvapich allreduce rs algorithm
Augustin Degomme [Thu, 24 Jul 2014 09:06:29 +0000 (11:06 +0200)]
add mvapich allreduce rs algorithm

9 years agomanually privatize allred test, to have it work on non-mmap systems also
Augustin Degomme [Thu, 24 Jul 2014 08:41:00 +0000 (10:41 +0200)]
manually privatize allred test, to have it work on non-mmap systems also

9 years agoAdd last collectives from mvapich selector : bcast reduce reduce_scatter scatter
Augustin Degomme [Thu, 24 Jul 2014 00:04:40 +0000 (02:04 +0200)]
Add last collectives from mvapich selector : bcast reduce reduce_scatter scatter

bcast stilll defaults to mpich one, as they need smp support

9 years agoNew collectives for mvapich2 selector : allgatherv, allreduce, alltoallv, barrier
Augustin Degomme [Wed, 23 Jul 2014 15:35:52 +0000 (17:35 +0200)]
New collectives for mvapich2 selector : allgatherv, allreduce, alltoallv, barrier

9 years agoBegin to add a MVAPICH2 collectives selector. Alltoall, Allgather and gather done.
Augustin Degomme [Wed, 23 Jul 2014 12:26:17 +0000 (14:26 +0200)]
Begin to add a MVAPICH2 collectives selector. Alltoall, Allgather and gather done.

Problems :
- code is (for now) quite a mess, with several versions of tuning available at the same time (coll folder from mvapich has currently 145k cloc).
- code is copy-pasted directly ... So, no comments
- only Stampede calibration is imported for now. Some others are available, we should provide a mechanism to switch to another calibration.
- MVAPICH collectives are SMP aware. SMPI is not really ... A mechanism to automatically generate an "internal" communicator for processes sharing a physical node will be needed. Gather actually defaults to mpich one as a result of this.

9 years agoFix mallocator tesh
Paul Bédaride [Thu, 24 Jul 2014 13:16:12 +0000 (15:16 +0200)]
Fix mallocator tesh

9 years ago[mc] Use mc_region_contain where it could be used
Gabriel Corona [Thu, 24 Jul 2014 12:43:13 +0000 (14:43 +0200)]
[mc] Use mc_region_contain where it could be used

9 years ago[mc] Udpate doxygen comments
Gabriel Corona [Thu, 24 Jul 2014 12:09:55 +0000 (14:09 +0200)]
[mc] Udpate doxygen comments

9 years agoAdd test for host on off and some fixes
Paul Bédaride [Thu, 24 Jul 2014 09:53:48 +0000 (11:53 +0200)]
Add test for host on off and some fixes

9 years agoSet 4 tests that fail on Windows and won't ever pass as "expected to fail"
Augustin Degomme [Wed, 23 Jul 2014 09:07:42 +0000 (11:07 +0200)]
Set 4 tests that fail on Windows and won't ever pass as "expected to fail"

This will allow to better discriminate between bugs and wontfix non-issues.

One of them fails because random returns differently on win and generates different colors in the resulting trace. Duh.
The three others are expected to crash, and do crash correctly on Windows, but the return code is different from the one expected (SIGABRT). Meh

9 years ago[mc] Add tests for sparse snapshot
Gabriel Corona [Tue, 22 Jul 2014 13:42:44 +0000 (15:42 +0200)]
[mc] Add tests for sparse snapshot

9 years ago[mc] Disable soft-dirty page tracking by default
Gabriel Corona [Tue, 22 Jul 2014 13:15:17 +0000 (15:15 +0200)]
[mc] Disable soft-dirty page tracking by default

In all tests I ran, it has a negative impact on performance.

9 years agosanitize get/set_name functions for fortran use
Augustin Degomme [Mon, 21 Jul 2014 13:22:50 +0000 (15:22 +0200)]
sanitize get/set_name functions for fortran use

9 years ago[mc] Disable optimisation for xbt when using MC
Gabriel Corona [Mon, 21 Jul 2014 12:36:16 +0000 (14:36 +0200)]
[mc] Disable optimisation for xbt when using MC

In some cases, it breaks the state comparison for some reason. This
was observed with mm.c and dynar.c.

9 years ago[mc] Do not handle mc_model_checker->parent_snapshot unless it is necessary
Gabriel Corona [Mon, 21 Jul 2014 11:38:08 +0000 (13:38 +0200)]
[mc] Do not handle mc_model_checker->parent_snapshot unless it is necessary

9 years agoAdd MPI_Win_get_name and MPI_Win_set_name support
Augustin Degomme [Mon, 21 Jul 2014 09:33:08 +0000 (11:33 +0200)]
Add MPI_Win_get_name and MPI_Win_set_name support

9 years agoreactivate allred test
Augustin Degomme [Mon, 21 Jul 2014 09:32:40 +0000 (11:32 +0200)]
reactivate allred test

But have its compilation flags set to O0 to avoid issues with ci slaves (too long time to compile with optims all the macros used, and too much memory used)

9 years ago[mmalloc] Force metadata update in mmalloc/mrealloc
Gabriel Corona [Thu, 17 Jul 2014 14:30:45 +0000 (16:30 +0200)]
[mmalloc] Force metadata update in mmalloc/mrealloc

9 years ago[mmalloc] Add mmcheck() which checks mmalloc heap consistency
Gabriel Corona [Thu, 17 Jul 2014 08:41:20 +0000 (10:41 +0200)]
[mmalloc] Add mmcheck() which checks mmalloc heap consistency

9 years ago[mmalloc] Add new block type for heapinfo blocks
Gabriel Corona [Thu, 10 Jul 2014 13:48:16 +0000 (15:48 +0200)]
[mmalloc] Add new block type for heapinfo blocks

9 years agorevert changes on allgatherv4, which needed manual privatization to run on freebsd
Augustin Degomme [Thu, 17 Jul 2014 16:38:50 +0000 (18:38 +0200)]
revert changes on allgatherv4, which needed manual privatization to run on freebsd

9 years agoremove warning with mc
Augustin Degomme [Thu, 17 Jul 2014 16:15:02 +0000 (18:15 +0200)]
remove warning with mc

9 years agoremove warning
Augustin Degomme [Thu, 17 Jul 2014 16:00:47 +0000 (18:00 +0200)]
remove warning

9 years agoFinish pulling changes from mpich trunk testsuite
Augustin Degomme [Thu, 17 Jul 2014 15:52:24 +0000 (17:52 +0200)]
Finish pulling changes from mpich trunk testsuite

9 years agoUpdate comm, datatype from mpich trunk
Augustin Degomme [Thu, 17 Jul 2014 14:54:35 +0000 (16:54 +0200)]
Update comm, datatype from mpich trunk

9 years agoenforce a scatter error in some cases
Augustin Degomme [Thu, 17 Jul 2014 13:38:45 +0000 (15:38 +0200)]
enforce a scatter error in some cases

9 years agoupdate collectives teshsuite from mpich git (only minor changes)
Augustin Degomme [Thu, 17 Jul 2014 13:38:28 +0000 (15:38 +0200)]
update collectives teshsuite from mpich git (only minor changes)

9 years agotesh update for fabien's work on surf
Augustin Degomme [Thu, 17 Jul 2014 09:06:53 +0000 (11:06 +0200)]
tesh update for fabien's work on surf

9 years agofabien's work on surf
Augustin Degomme [Thu, 17 Jul 2014 09:06:12 +0000 (11:06 +0200)]
fabien's work on surf

9 years agoPush Takahiro Patch and fix cloud.tesh - Adrien
Adrien Lebre [Wed, 16 Jul 2014 17:39:34 +0000 (19:39 +0200)]
Push Takahiro Patch and fix cloud.tesh - Adrien

9 years agoMerge branch 'master' of git+ssh://scm.gforge.inria.fr//gitroot/simgrid/simgrid
Adrien Lebre [Wed, 16 Jul 2014 16:09:41 +0000 (18:09 +0200)]
Merge branch 'master' of git+ssh://scm.gforge.inria.fr//gitroot/simgrid/simgrid

9 years agoFix a typo in the java-cloud example and add one TODO related to the migration invoca...
Adrien Lebre [Wed, 16 Jul 2014 16:09:37 +0000 (18:09 +0200)]
Fix a typo in the java-cloud example and add one TODO related to the migration invocation in VM.java - adrien

9 years agofix dist
Augustin Degomme [Wed, 16 Jul 2014 16:00:55 +0000 (18:00 +0200)]
fix dist

9 years agorename function
Augustin Degomme [Wed, 16 Jul 2014 15:44:00 +0000 (17:44 +0200)]
rename function

9 years agorequalify tesh
Augustin Degomme [Wed, 16 Jul 2014 15:43:52 +0000 (17:43 +0200)]
requalify tesh

9 years agouse global variables to store values that may be used millions of times..
Augustin Degomme [Wed, 16 Jul 2014 15:16:50 +0000 (17:16 +0200)]
use global variables to store values that may be used millions of times..

9 years agoadd time injection in MPI_Wtime and MPI_Test, to match what was done in iprobe
Augustin Degomme [Wed, 16 Jul 2014 14:52:03 +0000 (16:52 +0200)]
add time injection in MPI_Wtime and MPI_Test, to match what was done in iprobe

9 years agoadd comment for magic value
Augustin Degomme [Wed, 16 Jul 2014 12:49:18 +0000 (14:49 +0200)]
add comment for magic value

9 years agoadd F90 rma tests
Augustin Degomme [Wed, 16 Jul 2014 12:46:37 +0000 (14:46 +0200)]
add F90 rma tests

9 years agomigration: minor cleanup and update TODO
Takahiro Hirofuchi [Wed, 16 Jul 2014 07:14:23 +0000 (16:14 +0900)]
migration: minor cleanup and update TODO

Use MSG_process_create() instead of that of _with_arguments().

9 years agomigration: fix status check of migration
Takahiro Hirofuchi [Wed, 16 Jul 2014 07:04:53 +0000 (16:04 +0900)]
migration: fix status check of migration

When a migration of a VM is already ongoing, do not allow
MSG_vm_migrate() for the VM.

9 years agoremove the allred test
degomme [Tue, 15 Jul 2014 18:07:44 +0000 (20:07 +0200)]
remove the allred test
ci slaves have trouble building it (not enough memory probably)

9 years agoallred test needed unsigned char support, which was forgotten.
degomme [Tue, 15 Jul 2014 16:29:37 +0000 (18:29 +0200)]
allred test needed unsigned char support, which was forgotten.
TODO : This part of the code is ugly and should be replaced by macros asap, as it may lead to nasty bugs

9 years agoAdd MPI_Type_set_name and MPI_Type_get_name and activate tests
degomme [Tue, 15 Jul 2014 16:28:21 +0000 (18:28 +0200)]
Add MPI_Type_set_name and MPI_Type_get_name and activate tests

9 years agoactivate scatterv test, which needed Carts and Dim
degomme [Tue, 15 Jul 2014 14:14:52 +0000 (16:14 +0200)]
activate scatterv test, which needed Carts and Dim

9 years agoadd definitions
degomme [Mon, 14 Jul 2014 22:32:57 +0000 (00:32 +0200)]
add definitions

9 years agofix dist
degomme [Mon, 14 Jul 2014 21:41:35 +0000 (23:41 +0200)]
fix dist

9 years agoset default size of Aint to integer*8 ... not ideal, though.
degomme [Mon, 14 Jul 2014 21:36:43 +0000 (23:36 +0200)]
set default size of Aint to integer*8 ... not ideal, though.
Real mpich testsuite uses autoconf to configure the size

9 years agoadapt mpif.h for rma
degomme [Sat, 12 Jul 2014 00:00:35 +0000 (02:00 +0200)]
adapt mpif.h for rma

9 years agoAdd f77 RMA tests
degomme [Fri, 11 Jul 2014 23:52:13 +0000 (01:52 +0200)]
Add f77 RMA tests

9 years agompich testsuite: add f77 topo test
degomme [Fri, 11 Jul 2014 20:19:17 +0000 (22:19 +0200)]
mpich testsuite: add f77 topo test

9 years agompich testsuite: activate now working datatype test
degomme [Fri, 11 Jul 2014 20:18:21 +0000 (22:18 +0200)]
mpich testsuite: activate now working datatype test

9 years ago[mmalloc] Add documentation
Gabriel Corona [Fri, 11 Jul 2014 10:06:47 +0000 (12:06 +0200)]
[mmalloc] Add documentation

9 years ago[mmalloc] Avoid useless memset0
Gabriel Corona [Fri, 11 Jul 2014 10:04:34 +0000 (12:04 +0200)]
[mmalloc] Avoid useless memset0

9 years ago[mc] Remove 'previous' variable ind mmalloc_compare_heap()
Gabriel Corona [Fri, 11 Jul 2014 08:15:59 +0000 (10:15 +0200)]
[mc] Remove 'previous' variable ind mmalloc_compare_heap()

It was not used.

9 years ago[mc] Fix test on type in mc_diff
Gabriel Corona [Fri, 11 Jul 2014 08:02:18 +0000 (10:02 +0200)]
[mc] Fix test on type in mc_diff

9 years ago[mc] Fix name of mc_snapshot_memcmp()
Gabriel Corona [Thu, 10 Jul 2014 11:32:29 +0000 (13:32 +0200)]
[mc] Fix name of mc_snapshot_memcmp()

9 years ago[mc] Add unit tests for reading/comparing the whole region in mc_snapshot
Gabriel Corona [Thu, 10 Jul 2014 10:44:22 +0000 (12:44 +0200)]
[mc] Add unit tests for reading/comparing the whole region in mc_snapshot

9 years ago[mc] Test flat snapshots as well
Gabriel Corona [Thu, 10 Jul 2014 10:30:10 +0000 (12:30 +0200)]
[mc] Test flat snapshots as well

9 years ago[mc] Add unit test for mc_snapshot
Gabriel Corona [Thu, 10 Jul 2014 09:36:32 +0000 (11:36 +0200)]
[mc] Add unit test for mc_snapshot

9 years agoFix distcheck
Augustin Degomme [Tue, 8 Jul 2014 22:51:01 +0000 (00:51 +0200)]
Fix distcheck