[doc] trailing space cosmetics

author Millian Poquet <millian.poquet@inria.fr>

Tue, 14 May 2019 12:52:02 +0000 (14:52 +0200)

committer Millian Poquet <millian.poquet@inria.fr>

Tue, 14 May 2019 12:52:02 +0000 (14:52 +0200)
author Millian Poquet <millian.poquet@inria.fr>
Tue, 14 May 2019 12:52:02 +0000 (14:52 +0200)
committer Millian Poquet <millian.poquet@inria.fr>
Tue, 14 May 2019 12:52:02 +0000 (14:52 +0200)
diff --git a/docs/source/Configuring_SimGrid.rst b/docs/source/Configuring_SimGrid.rst

index 1aa55d8..682f887 100644 (file)
--- a/docs/source/Configuring_SimGrid.rst
+++ b/docs/source/Configuring_SimGrid.rst
@@ -33,7 +33,7 @@ example, to set the item ``Item`` to the value ``Value``, simply
  type the following on the command-line:
  
  .. code-block:: shell
  type the following on the command-line:
  
  .. code-block:: shell
-               
+
     my_simulator --cfg=Item:Value (other arguments)
  
  Several ``--cfg`` command line arguments can naturally be used. If you
     my_simulator --cfg=Item:Value (other arguments)
  
  Several ``--cfg`` command line arguments can naturally be used. If you
@@ -52,7 +52,7 @@ can be done by adding the following to the beginning of your platform
  file:
  
  .. code-block:: xml
  file:
  
  .. code-block:: xml
-               
+
    <config>
      <prop id="Item" value="Value"/>
    </config>
    <config>
      <prop id="Item" value="Value"/>
    </config>
@@ -61,19 +61,19 @@ A last solution is to pass your configuration directly in your program
  with :cpp:func:`simgrid::s4u::Engine::set_config` or :cpp:func:`MSG_config`.
  
  .. code-block:: cpp
  with :cpp:func:`simgrid::s4u::Engine::set_config` or :cpp:func:`MSG_config`.
  
  .. code-block:: cpp
-               
+
     #include <simgrid/s4u.hpp>
  
     int main(int argc, char *argv[]) {
       simgrid::s4u::Engine e(&argc, argv);
     #include <simgrid/s4u.hpp>
  
     int main(int argc, char *argv[]) {
       simgrid::s4u::Engine e(&argc, argv);
-     
+
       e->set_config("Item:Value");
       e->set_config("Item:Value");
-     
+
       // Rest of your code
     }
  
  .. _options_list:
       // Rest of your code
     }
  
  .. _options_list:
-   
+
  Existing Configuration Items
  ----------------------------
  
  Existing Configuration Items
  ----------------------------
  
@@ -193,13 +193,13 @@ models). Also, ``--help-models`` should provide information about all
  models for all existing resources.
  
  - ``network/model``: specify the used network model. Possible values:
  models for all existing resources.
  
  - ``network/model``: specify the used network model. Possible values:
-  
+
    - **LV08 (default one):** Realistic network analytic model
      (slow-start modeled by multiplying latency by 13.01, bandwidth by
      .97; bottleneck sharing uses a payload of S=20537 for evaluating
      RTT). Described in `Accuracy Study and Improvement of Network
      Simulation in the SimGrid Framework
    - **LV08 (default one):** Realistic network analytic model
      (slow-start modeled by multiplying latency by 13.01, bandwidth by
      .97; bottleneck sharing uses a payload of S=20537 for evaluating
      RTT). Described in `Accuracy Study and Improvement of Network
      Simulation in the SimGrid Framework
-    <http://mescal.imag.fr/membres/arnaud.legrand/articles/simutools09.pdf>`_.     
+    <http://mescal.imag.fr/membres/arnaud.legrand/articles/simutools09.pdf>`_.
    - **Constant:** Simplistic network model where all communication
      take a constant time (one second). This model provides the lowest
      realism, but is (marginally) faster.
    - **Constant:** Simplistic network model where all communication
      take a constant time (one second). This model provides the lowest
      realism, but is (marginally) faster.
@@ -219,11 +219,11 @@ models for all existing resources.
      <ftp://ftp.ens-lyon.fr/pub/LIP/Rapports/RR/RR2002/RR2002-40.ps.gz>`_.
    - **Reno/Reno2/Vegas:** Models from Steven H. Low using lagrange_solve instead of
      lmm_solve (experts only; check the code for more info).
      <ftp://ftp.ens-lyon.fr/pub/LIP/Rapports/RR/RR2002/RR2002-40.ps.gz>`_.
    - **Reno/Reno2/Vegas:** Models from Steven H. Low using lagrange_solve instead of
      lmm_solve (experts only; check the code for more info).
-  - **NS3** (only available if you compiled SimGrid accordingly): 
+  - **NS3** (only available if you compiled SimGrid accordingly):
      Use the packet-level network
      simulators as network models (see :ref:`pls_ns3`).
      This model can be :ref:`further configured <options_pls>`.
      Use the packet-level network
      simulators as network models (see :ref:`pls_ns3`).
      This model can be :ref:`further configured <options_pls>`.
-    
+
  - ``cpu/model``: specify the used CPU model.  We have only one model
    for now:
  
  - ``cpu/model``: specify the used CPU model.  We have only one model
    for now:
  
@@ -238,7 +238,7 @@ models for all existing resources.
    allow parallel tasks because these beasts need some collaboration
    between the network and CPU model. That is why, ptask_07 is used by
    default when using SimDag.
    allow parallel tasks because these beasts need some collaboration
    between the network and CPU model. That is why, ptask_07 is used by
    default when using SimDag.
-  
+
    - **default:** Default host model. Currently, CPU:Cas01 and
      network:LV08 (with cross traffic enabled)
    - **compound:** Host model that is automatically chosen if
    - **default:** Default host model. Currently, CPU:Cas01 and
      network:LV08 (with cross traffic enabled)
    - **compound:** Host model that is automatically chosen if
@@ -264,7 +264,7 @@ is, all our analytical models) accept specific optimization
  configurations.
  
    - items ``network/optim`` and ``cpu/optim`` (both default to 'Lazy'):
  configurations.
  
    - items ``network/optim`` and ``cpu/optim`` (both default to 'Lazy'):
-    
+
      - **Lazy:** Lazy action management (partial invalidation in lmm +
        heap in action remaining).
      - **TI:** Trace integration. Highly optimized mode when using
      - **Lazy:** Lazy action management (partial invalidation in lmm +
        heap in action remaining).
      - **TI:** Trace integration. Highly optimized mode when using
@@ -272,7 +272,7 @@ configurations.
        now).
      - **Full:** Full update of remaining and variables. Slow but may be
        useful when debugging.
        now).
      - **Full:** Full update of remaining and variables. Slow but may be
        useful when debugging.
-      
+
    - items ``network/maxmin-selective-update`` and
      ``cpu/maxmin-selective-update``: configure whether the underlying
      should be lazily updated or not. It should have no impact on the
    - items ``network/maxmin-selective-update`` and
      ``cpu/maxmin-selective-update``: configure whether the underlying
      should be lazily updated or not. It should have no impact on the
@@ -340,7 +340,7 @@ be retrieved using the following commands. Both give a set of values,
  and you should use the last one, which is the maximal size.
  
  .. code-block:: shell
  and you should use the last one, which is the maximal size.
  
  .. code-block:: shell
-               
+
     cat /proc/sys/net/ipv4/tcp_rmem # gives the sender window
     cat /proc/sys/net/ipv4/tcp_wmem # gives the receiver window
  
     cat /proc/sys/net/ipv4/tcp_rmem # gives the sender window
     cat /proc/sys/net/ipv4/tcp_wmem # gives the receiver window
  
@@ -348,7 +348,7 @@ and you should use the last one, which is the maximal size.
  .. _cfg=network/bandwidth-factor:
  .. _cfg=network/latency-factor:
  .. _cfg=network/weight-S:
  .. _cfg=network/bandwidth-factor:
  .. _cfg=network/latency-factor:
  .. _cfg=network/weight-S:
-   
+
  Correcting Important Network Parameters
  ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  
  Correcting Important Network Parameters
  ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  
@@ -469,11 +469,11 @@ Here is the full list of plugins that can be activated this way:
     computations. More details in @ref plugin_energy.
   - **link_energy:** keeps track of the energy dissipated by
     communications. More details in @ref SURF_plugin_energy.
     computations. More details in @ref plugin_energy.
   - **link_energy:** keeps track of the energy dissipated by
     communications. More details in @ref SURF_plugin_energy.
- - **host_load:** keeps track of the computational load. 
+ - **host_load:** keeps track of the computational load.
     More details in @ref plugin_load.
  
  .. _options_modelchecking:
     More details in @ref plugin_load.
  
  .. _options_modelchecking:
-   
+
  Configuring the Model-Checking
  ------------------------------
  
  Configuring the Model-Checking
  ------------------------------
  
@@ -481,14 +481,14 @@ To enable the SimGrid model-checking support the program should
  be executed using the simgrid-mc wrapper:
  
  .. code-block:: shell
  be executed using the simgrid-mc wrapper:
  
  .. code-block:: shell
-               
+
     simgrid-mc ./my_program
  
  Safety properties are expressed as assertions using the function
  :cpp:func:`void MC_assert(int prop)`.
  
  .. _cfg=model-check/property:
     simgrid-mc ./my_program
  
  Safety properties are expressed as assertions using the function
  :cpp:func:`void MC_assert(int prop)`.
  
  .. _cfg=model-check/property:
-     
+
  Specifying a liveness property
  ..............................
  
  Specifying a liveness property
  ..............................
  
@@ -500,11 +500,11 @@ property, as formatted by the ltl2ba program.
  
  
  .. code-block:: shell
  
  
  .. code-block:: shell
-               
+
     simgrid-mc ./my_program --cfg=model-check/property:<filename>
  
  .. _cfg=model-check/checkpoint:
     simgrid-mc ./my_program --cfg=model-check/property:<filename>
  
  .. _cfg=model-check/checkpoint:
-   
+
  Going for Stateful Verification
  ...............................
  
  Going for Stateful Verification
  ...............................
  
@@ -768,7 +768,7 @@ the slowest to the most efficient:
     raw implementation.
     |br| Install the relevant library (e.g. with the
     libboost-contexts-dev package on Debian/Ubuntu) and recompile
     raw implementation.
     |br| Install the relevant library (e.g. with the
     libboost-contexts-dev package on Debian/Ubuntu) and recompile
-   SimGrid. 
+   SimGrid.
   - **raw:** amazingly fast factory using a context switching mechanism
     of our own, directly implemented in assembly (only available for x86
     and amd64 platforms for now) and without any unneeded system call.
   - **raw:** amazingly fast factory using a context switching mechanism
     of our own, directly implemented in assembly (only available for x86
     and amd64 platforms for now) and without any unneeded system call.
@@ -832,7 +832,7 @@ application.
  .. _cfg=contexts/nthreads:
  .. _cfg=contexts/parallel-threshold:
  .. _cfg=contexts/synchro:
  .. _cfg=contexts/nthreads:
  .. _cfg=contexts/parallel-threshold:
  .. _cfg=contexts/synchro:
-  
+
  Running User Code in Parallel
  .............................
  
  Running User Code in Parallel
  .............................
  
@@ -874,7 +874,6 @@ which value is either:
     your machine for no good reason. You probably prefer the other less
     eager schemas.
  
     your machine for no good reason. You probably prefer the other less
     eager schemas.
  
-   
  Configuring the Tracing
  -----------------------
  
  Configuring the Tracing
  -----------------------
  
@@ -915,7 +914,7 @@ you never used the tracing API.
  - SMPI simulator and traces for a space/time view:
  
    .. code-block:: shell
  - SMPI simulator and traces for a space/time view:
  
    .. code-block:: shell
-     
+
       smpirun -trace ...
  
    The `-trace` parameter for the smpirun script runs the simulation
       smpirun -trace ...
  
    The `-trace` parameter for the smpirun script runs the simulation
@@ -935,7 +934,7 @@ reproduce an experiment. You have two ways to do that:
  - Add the contents of a textual file on top of the trace file as comment:
  
    .. code-block:: shell
  - Add the contents of a textual file on top of the trace file as comment:
  
    .. code-block:: shell
-                 
+
       --cfg=tracing/comment-file:my_file_with_additional_information.txt
  
  Please, use these two parameters (for comments) to make reproducible
       --cfg=tracing/comment-file:my_file_with_additional_information.txt
  
  Please, use these two parameters (for comments) to make reproducible
@@ -1011,12 +1010,12 @@ To disable the benchmarking/simulation of computation in the simulated
  application, the variable ``smpi/simulate-computation`` should be set
  to no.  This option just ignores the timings in your simulation; it
  still executes the computations itself. If you want to stop SMPI from
  application, the variable ``smpi/simulate-computation`` should be set
  to no.  This option just ignores the timings in your simulation; it
  still executes the computations itself. If you want to stop SMPI from
-doing that, you should check the SMPI_SAMPLE macros, documented in 
+doing that, you should check the SMPI_SAMPLE macros, documented in
  Section :ref:`SMPI_adapting_speed`.
  
  +------------------------------------+-------------------------+-----------------------------+
  |  Solution                          | Computations executed?  | Computations simulated?     |
  Section :ref:`SMPI_adapting_speed`.
  
  +------------------------------------+-------------------------+-----------------------------+
  |  Solution                          | Computations executed?  | Computations simulated?     |
-+====================================+=========================+=============================+   
++====================================+=========================+=============================+
  | --cfg=smpi/simulate-computation:no | Yes                     | Never                       |
  +------------------------------------+-------------------------+-----------------------------+
  | --cfg=smpi/cpu-threshold:42        | Yes, in all cases       | If it lasts over 42 seconds |
  | --cfg=smpi/simulate-computation:no | Yes                     | Never                       |
  +------------------------------------+-------------------------+-----------------------------+
  | --cfg=smpi/cpu-threshold:42        | Yes, in all cases       | If it lasts over 42 seconds |
@@ -1077,7 +1076,7 @@ http://simgrid.gforge.inria.fr/contrib/smpi-calibration-doc.html
  http://simgrid.gforge.inria.fr/contrib/smpi-saturation-doc.html
  
  .. _cfg=smpi/display-timing:
  http://simgrid.gforge.inria.fr/contrib/smpi-saturation-doc.html
  
  .. _cfg=smpi/display-timing:
-       
+
  Reporting Simulation Time
  .........................
  
  Reporting Simulation Time
  .........................
  
@@ -1120,7 +1119,7 @@ actual bandwidth (i.e., values between 0 and 1 are valid), latency factors
  increase the latency, i.e., values larger than or equal to 1 are valid here.
  
  .. _cfg=smpi/papi-events:
  increase the latency, i.e., values larger than or equal to 1 are valid here.
  
  .. _cfg=smpi/papi-events:
-       
+
  Trace hardware counters with PAPI
  .................................
  
  Trace hardware counters with PAPI
  .................................
  
@@ -1131,7 +1130,7 @@ names of PAPI counters and adds their respective values to the trace
  files (See Section :ref:`tracing_tracing_options`).
  
  .. warning::
  files (See Section :ref:`tracing_tracing_options`).
  
  .. warning::
-   
+
     This feature currently requires superuser privileges, as registers
     are queried.  Only use this feature with code you trust! Call
     smpirun for instance via ``smpirun -wrapper "sudo "
     This feature currently requires superuser privileges, as registers
     are queried.  Only use this feature with code you trust! Call
     smpirun for instance via ``smpirun -wrapper "sudo "
@@ -1195,7 +1194,7 @@ or full names.  Check with ldd the name of the library you want to
  use.  Example:
  
  .. code-block:: shell
  use.  Example:
  
  .. code-block:: shell
-                 
+
     ldd allpairf90
        ...
        libgfortran.so.3 => /usr/lib/x86_64-linux-gnu/libgfortran.so.3 (0x00007fbb4d91b000)
     ldd allpairf90
        ...
        libgfortran.so.3 => /usr/lib/x86_64-linux-gnu/libgfortran.so.3 (0x00007fbb4d91b000)
@@ -1420,14 +1419,14 @@ mem[100..199] are shared while other area remain private.
  Then, it can be deallocated by calling SMPI_SHARED_FREE(mem).
  
  When smpi/shared-malloc:global is used, the memory consumption problem
  Then, it can be deallocated by calling SMPI_SHARED_FREE(mem).
  
  When smpi/shared-malloc:global is used, the memory consumption problem
-is solved, but it may induce too much load on the kernel's pages table. 
+is solved, but it may induce too much load on the kernel's pages table.
  In this case, you should use huge pages so that we create only one
  entry per Mb of malloced data instead of one entry per 4k.
  To activate this, you must mount a hugetlbfs on your system and allocate
  at least one huge page:
  
  .. code-block:: shell
  In this case, you should use huge pages so that we create only one
  entry per Mb of malloced data instead of one entry per 4k.
  To activate this, you must mount a hugetlbfs on your system and allocate
  at least one huge page:
  
  .. code-block:: shell
-               
+
      mkdir /home/huge
      sudo mount none /home/huge -t hugetlbfs -o rw,mode=0777
      sudo sh -c 'echo 1 > /proc/sys/vm/nr_hugepages' # echo more if you need more
      mkdir /home/huge
      sudo mount none /home/huge -t hugetlbfs -o rw,mode=0777
      sudo sh -c 'echo 1 > /proc/sys/vm/nr_hugepages' # echo more if you need more
@@ -1449,7 +1448,7 @@ to 0, the simulated clock is not advanced in these calls, which leads
  to issue if your application contains such a loop:
  
  .. code-block:: cpp
  to issue if your application contains such a loop:
  
  .. code-block:: cpp
-               
+
     while(MPI_Wtime() < some_time_bound) {
          /* some tests, with no communication nor computation */
     }
     while(MPI_Wtime() < some_time_bound) {
          /* some tests, with no communication nor computation */
     }
@@ -1514,7 +1513,7 @@ with gdb:
     set variable simgrid::simix::breakpoint = 3.1416
  
  .. _cfg=verbose-exit:
     set variable simgrid::simix::breakpoint = 3.1416
  
  .. _cfg=verbose-exit:
-   
+
  Behavior on Ctrl-C
  ..................
  
  Behavior on Ctrl-C
  ..................
  
diff --git a/docs/source/app_smpi.rst b/docs/source/app_smpi.rst

index 8a2f5a0..e3e09a4 100644 (file)
--- a/docs/source/app_smpi.rst
+++ b/docs/source/app_smpi.rst
@@ -50,7 +50,7 @@ Using SMPI online
  In this mode, your application is actually executed. Every computation
  occurs for real while every communication is simulated. In addition,
  the executions are automatically benchmarked so that their timings can
  In this mode, your application is actually executed. Every computation
  occurs for real while every communication is simulated. In addition,
  the executions are automatically benchmarked so that their timings can
-be applied within the simulator. 
+be applied within the simulator.
  
  SMPI can also go offline by replaying a trace. :ref:`Trace replay
  <SMPI_offline>` is usually ways faster than online simulation (because
  
  SMPI can also go offline by replaying a trace. :ref:`Trace replay
  <SMPI_offline>` is usually ways faster than online simulation (because
@@ -113,7 +113,7 @@ usual.
  
  .. _SMPI_use_colls:
  
  
  .. _SMPI_use_colls:
  
-................................   
+................................
  Simulating Collective Operations
  ................................
  
  Simulating Collective Operations
  ................................
  
@@ -140,15 +140,15 @@ You can switch the automatic selector through the
   - **ompi:** default selection logic of OpenMPI (version 3.1.2)
   - **mpich**: default selection logic of MPICH (version 3.3b)
   - **mvapich2**: selection logic of MVAPICH2 (version 1.9) tuned
   - **ompi:** default selection logic of OpenMPI (version 3.1.2)
   - **mpich**: default selection logic of MPICH (version 3.3b)
   - **mvapich2**: selection logic of MVAPICH2 (version 1.9) tuned
-   on the Stampede cluster   
+   on the Stampede cluster
   - **impi**: preliminary version of an Intel MPI selector (version
     4.1.3, also tuned for the Stampede cluster). Due the closed source
     nature of Intel MPI, some of the algorithms described in the
   - **impi**: preliminary version of an Intel MPI selector (version
     4.1.3, also tuned for the Stampede cluster). Due the closed source
     nature of Intel MPI, some of the algorithms described in the
-   documentation are not available, and are replaced by mvapich ones.   
+   documentation are not available, and are replaced by mvapich ones.
   - **default**: legacy algorithms used in the earlier days of
     SimGrid. Do not use for serious perform performance studies.
  
   - **default**: legacy algorithms used in the earlier days of
     SimGrid. Do not use for serious perform performance studies.
  
-.. todo:: default should not even exist.   
+.. todo:: default should not even exist.
  
  ....................
  Available Algorithms
  
  ....................
  Available Algorithms
@@ -176,12 +176,12 @@ Most of these are best described in `STAR-MPI <http://www.cs.arizona.edu/~dkl/re
   - mpich: use mpich selector for the alltoall operations
   - mvapich2: use mvapich2 selector for the alltoall operations
   - impi: use intel mpi selector for the alltoall operations
   - mpich: use mpich selector for the alltoall operations
   - mvapich2: use mvapich2 selector for the alltoall operations
   - impi: use intel mpi selector for the alltoall operations
- - automatic (experimental): use an automatic self-benchmarking algorithm 
+ - automatic (experimental): use an automatic self-benchmarking algorithm
   - bruck: Described by Bruck et.al. in <a href="http://ieeexplore.ieee.org/xpl/articleDetails.jsp?arnumber=642949">this paper</a>
   - bruck: Described by Bruck et.al. in <a href="http://ieeexplore.ieee.org/xpl/articleDetails.jsp?arnumber=642949">this paper</a>
- - 2dmesh: organizes the nodes as a two dimensional mesh, and perform allgather 
+ - 2dmesh: organizes the nodes as a two dimensional mesh, and perform allgather
     along the dimensions
   - 3dmesh: adds a third dimension to the previous algorithm
     along the dimensions
   - 3dmesh: adds a third dimension to the previous algorithm
- - rdb: recursive doubling: extends the mesh to a nth dimension, each one 
+ - rdb: recursive doubling: extends the mesh to a nth dimension, each one
     containing two nodes
   - pair: pairwise exchange, only works for power of 2 procs, size-1 steps,
     each process sends and receives from the same process at each step
     containing two nodes
   - pair: pairwise exchange, only works for power of 2 procs, size-1 steps,
     each process sends and receives from the same process at each step
@@ -204,7 +204,7 @@ MPI_Alltoallv
   - mpich: use mpich selector for the alltoallv operations
   - mvapich2: use mvapich2 selector for the alltoallv operations
   - impi: use intel mpi selector for the alltoallv operations
   - mpich: use mpich selector for the alltoallv operations
   - mvapich2: use mvapich2 selector for the alltoallv operations
   - impi: use intel mpi selector for the alltoallv operations
- - automatic (experimental): use an automatic self-benchmarking algorithm 
+ - automatic (experimental): use an automatic self-benchmarking algorithm
   - bruck: same as alltoall
   - pair: same as alltoall
   - pair_light_barrier: same as alltoall
   - bruck: same as alltoall
   - pair: same as alltoall
   - pair_light_barrier: same as alltoall
@@ -239,7 +239,7 @@ MPI_Barrier
   - mpich: use mpich selector for the barrier operations
   - mvapich2: use mvapich2 selector for the barrier operations
   - impi: use intel mpi selector for the barrier operations
   - mpich: use mpich selector for the barrier operations
   - mvapich2: use mvapich2 selector for the barrier operations
   - impi: use intel mpi selector for the barrier operations
- - automatic (experimental): use an automatic self-benchmarking algorithm 
+ - automatic (experimental): use an automatic self-benchmarking algorithm
   - ompi_basic_linear: all processes send to root
   - ompi_two_procs: special case for two processes
   - ompi_bruck: nsteps = sqrt(size), at each step, exchange data with rank-2^k and rank+2^k
   - ompi_basic_linear: all processes send to root
   - ompi_two_procs: special case for two processes
   - ompi_bruck: nsteps = sqrt(size), at each step, exchange data with rank-2^k and rank+2^k
@@ -257,8 +257,8 @@ MPI_Scatter
   - mpich: use mpich selector for the scatter operations
   - mvapich2: use mvapich2 selector for the scatter operations
   - impi: use intel mpi selector for the scatter operations
   - mpich: use mpich selector for the scatter operations
   - mvapich2: use mvapich2 selector for the scatter operations
   - impi: use intel mpi selector for the scatter operations
- - automatic (experimental): use an automatic self-benchmarking algorithm 
- - ompi_basic_linear: basic linear scatter 
+ - automatic (experimental): use an automatic self-benchmarking algorithm
+ - ompi_basic_linear: basic linear scatter
   - ompi_binomial: binomial tree scatter
   - mvapich2_two_level_direct: SMP aware algorithm, with an intra-node stage (default set to mpich selector), and then a basic linear inter node stage. Use mvapich2 selector to change these to tuned algorithms for Stampede cluster. 
   - mvapich2_two_level_binomial: SMP aware algorithm, with an intra-node stage (default set to mpich selector), and then a binomial phase. Use mvapich2 selector to change these to tuned algorithms for Stampede cluster.
   - ompi_binomial: binomial tree scatter
   - mvapich2_two_level_direct: SMP aware algorithm, with an intra-node stage (default set to mpich selector), and then a basic linear inter node stage. Use mvapich2 selector to change these to tuned algorithms for Stampede cluster. 
   - mvapich2_two_level_binomial: SMP aware algorithm, with an intra-node stage (default set to mpich selector), and then a binomial phase. Use mvapich2 selector to change these to tuned algorithms for Stampede cluster.
@@ -271,28 +271,28 @@ MPI_Reduce
   - mpich: use mpich selector for the reduce operations
   - mvapich2: use mvapich2 selector for the reduce operations
   - impi: use intel mpi selector for the reduce operations
   - mpich: use mpich selector for the reduce operations
   - mvapich2: use mvapich2 selector for the reduce operations
   - impi: use intel mpi selector for the reduce operations
- - automatic (experimental): use an automatic self-benchmarking algorithm 
+ - automatic (experimental): use an automatic self-benchmarking algorithm
   - arrival_pattern_aware: root exchanges with the first process to arrive
   - binomial: uses a binomial tree
   - flat_tree: uses a flat tree
   - arrival_pattern_aware: root exchanges with the first process to arrive
   - binomial: uses a binomial tree
   - flat_tree: uses a flat tree
- - NTSL: Non-topology-specific pipelined linear-bcast function 
+ - NTSL: Non-topology-specific pipelined linear-bcast function
     0->1, 1->2 ,2->3, ....., ->last node: in a pipeline fashion, with segments
     of 8192 bytes
   - scatter_gather: scatter then gather
   - ompi_chain: openmpi reduce algorithms are built on the same basis, but the
     topology is generated differently for each flavor
     0->1, 1->2 ,2->3, ....., ->last node: in a pipeline fashion, with segments
     of 8192 bytes
   - scatter_gather: scatter then gather
   - ompi_chain: openmpi reduce algorithms are built on the same basis, but the
     topology is generated differently for each flavor
-   chain = chain with spacing of size/2, and segment size of 64KB 
- - ompi_pipeline: same with pipeline (chain with spacing of 1), segment size 
+   chain = chain with spacing of size/2, and segment size of 64KB
+ - ompi_pipeline: same with pipeline (chain with spacing of 1), segment size
     depends on the communicator size and the message size
   - ompi_binary: same with binary tree, segment size of 32KB
     depends on the communicator size and the message size
   - ompi_binary: same with binary tree, segment size of 32KB
- - ompi_in_order_binary: same with binary tree, enforcing order on the 
+ - ompi_in_order_binary: same with binary tree, enforcing order on the
     operations
     operations
- - ompi_binomial: same with binomial algo (redundant with default binomial 
+ - ompi_binomial: same with binomial algo (redundant with default binomial
     one in most cases)
   - ompi_basic_linear: basic algorithm, each process sends to root
   - mvapich2_knomial: k-nomial algorithm. Default factor is 4 (mvapich2 selector adapts it through tuning)
   - mvapich2_two_level: SMP-aware reduce, with default set to mpich both for intra and inter communicators. Use mvapich2 selector to change these to tuned algorithms for Stampede cluster.
     one in most cases)
   - ompi_basic_linear: basic algorithm, each process sends to root
   - mvapich2_knomial: k-nomial algorithm. Default factor is 4 (mvapich2 selector adapts it through tuning)
   - mvapich2_two_level: SMP-aware reduce, with default set to mpich both for intra and inter communicators. Use mvapich2 selector to change these to tuned algorithms for Stampede cluster.
- - rab: `Rabenseifner <https://fs.hlrs.de/projects/par/mpi//myreduce.html>`_'s reduce algorithm 
+ - rab: `Rabenseifner <https://fs.hlrs.de/projects/par/mpi//myreduce.html>`_'s reduce algorithm
  
  MPI_Allreduce
  ^^^^^^^^^^^^^
  
  MPI_Allreduce
  ^^^^^^^^^^^^^
@@ -302,21 +302,21 @@ MPI_Allreduce
   - mpich: use mpich selector for the allreduce operations
   - mvapich2: use mvapich2 selector for the allreduce operations
   - impi: use intel mpi selector for the allreduce operations
   - mpich: use mpich selector for the allreduce operations
   - mvapich2: use mvapich2 selector for the allreduce operations
   - impi: use intel mpi selector for the allreduce operations
- - automatic (experimental): use an automatic self-benchmarking algorithm 
+ - automatic (experimental): use an automatic self-benchmarking algorithm
   - lr: logical ring reduce-scatter then logical ring allgather
   - rab1: variations of the  <a href="https://fs.hlrs.de/projects/par/mpi//myreduce.html">Rabenseifner</a> algorithm: reduce_scatter then allgather
   - rab2: variations of the  <a href="https://fs.hlrs.de/projects/par/mpi//myreduce.html">Rabenseifner</a> algorithm: alltoall then allgather
   - lr: logical ring reduce-scatter then logical ring allgather
   - rab1: variations of the  <a href="https://fs.hlrs.de/projects/par/mpi//myreduce.html">Rabenseifner</a> algorithm: reduce_scatter then allgather
   - rab2: variations of the  <a href="https://fs.hlrs.de/projects/par/mpi//myreduce.html">Rabenseifner</a> algorithm: alltoall then allgather
- - rab_rsag: variation of the  <a href="https://fs.hlrs.de/projects/par/mpi//myreduce.html">Rabenseifner</a> algorithm: recursive doubling 
-   reduce_scatter then recursive doubling allgather 
+ - rab_rsag: variation of the  <a href="https://fs.hlrs.de/projects/par/mpi//myreduce.html">Rabenseifner</a> algorithm: recursive doubling
+   reduce_scatter then recursive doubling allgather
   - rdb: recursive doubling
   - rdb: recursive doubling
- - smp_binomial: binomial tree with smp: binomial intra 
+ - smp_binomial: binomial tree with smp: binomial intra
     SMP reduce, inter reduce, inter broadcast then intra broadcast
   - smp_binomial_pipeline: same with segment size = 4096 bytes
     SMP reduce, inter reduce, inter broadcast then intra broadcast
   - smp_binomial_pipeline: same with segment size = 4096 bytes
- - smp_rdb: intra: binomial allreduce, inter: Recursive 
+ - smp_rdb: intra: binomial allreduce, inter: Recursive
     doubling allreduce, intra: binomial broadcast
     doubling allreduce, intra: binomial broadcast
- - smp_rsag: intra: binomial allreduce, inter: reduce-scatter, 
+ - smp_rsag: intra: binomial allreduce, inter: reduce-scatter,
     inter:allgather, intra: binomial broadcast
     inter:allgather, intra: binomial broadcast
- - smp_rsag_lr: intra: binomial allreduce, inter: logical ring 
+ - smp_rsag_lr: intra: binomial allreduce, inter: logical ring
     reduce-scatter, logical ring inter:allgather, intra: binomial broadcast
   - smp_rsag_rab: intra: binomial allreduce, inter: rab
     reduce-scatter, rab inter:allgather, intra: binomial broadcast
     reduce-scatter, logical ring inter:allgather, intra: binomial broadcast
   - smp_rsag_rab: intra: binomial allreduce, inter: rab
     reduce-scatter, rab inter:allgather, intra: binomial broadcast
@@ -334,7 +334,7 @@ MPI_Reduce_scatter
   - mpich: use mpich selector for the reduce_scatter operations
   - mvapich2: use mvapich2 selector for the reduce_scatter operations
   - impi: use intel mpi selector for the reduce_scatter operations
   - mpich: use mpich selector for the reduce_scatter operations
   - mvapich2: use mvapich2 selector for the reduce_scatter operations
   - impi: use intel mpi selector for the reduce_scatter operations
- - automatic (experimental): use an automatic self-benchmarking algorithm 
+ - automatic (experimental): use an automatic self-benchmarking algorithm
   - ompi_basic_recursivehalving: recursive halving version from OpenMPI
   - ompi_ring: ring version from OpenMPI
   - mpich_pair: pairwise exchange version from MPICH
   - ompi_basic_recursivehalving: recursive halving version from OpenMPI
   - ompi_ring: ring version from OpenMPI
   - mpich_pair: pairwise exchange version from MPICH
@@ -350,13 +350,13 @@ MPI_Allgather
   - mpich: use mpich selector for the allgather operations
   - mvapich2: use mvapich2 selector for the allgather operations
   - impi: use intel mpi selector for the allgather operations
   - mpich: use mpich selector for the allgather operations
   - mvapich2: use mvapich2 selector for the allgather operations
   - impi: use intel mpi selector for the allgather operations
- - automatic (experimental): use an automatic self-benchmarking algorithm 
+ - automatic (experimental): use an automatic self-benchmarking algorithm
   - 2dmesh: see alltoall
   - 3dmesh: see alltoall
   - bruck: Described by Bruck et.al. in <a href="http://ieeexplore.ieee.org/xpl/articleDetails.jsp?arnumber=642949">
   - 2dmesh: see alltoall
   - 3dmesh: see alltoall
   - bruck: Described by Bruck et.al. in <a href="http://ieeexplore.ieee.org/xpl/articleDetails.jsp?arnumber=642949">
-   Efficient algorithms for all-to-all communications in multiport message-passing systems</a> 
+   Efficient algorithms for all-to-all communications in multiport message-passing systems</a>
   - GB: Gather - Broadcast (uses tuned version if specified)
   - GB: Gather - Broadcast (uses tuned version if specified)
- - loosely_lr: Logical Ring with grouping by core (hardcoded, default 
+ - loosely_lr: Logical Ring with grouping by core (hardcoded, default
     processes/node: 4)
   - NTSLR: Non Topology Specific Logical Ring
   - NTSLR_NB: Non Topology Specific Logical Ring, Non Blocking operations
     processes/node: 4)
   - NTSLR: Non Topology Specific Logical Ring
   - NTSLR_NB: Non Topology Specific Logical Ring, Non Blocking operations
@@ -364,15 +364,15 @@ MPI_Allgather
   - rdb: see alltoall
   - rhv: only power of 2 number of processes
   - ring: see alltoall
   - rdb: see alltoall
   - rhv: only power of 2 number of processes
   - ring: see alltoall
- - SMP_NTS: gather to root of each SMP, then every root of each SMP node 
-   post INTER-SMP Sendrecv, then do INTRA-SMP Bcast for each receiving message, 
+ - SMP_NTS: gather to root of each SMP, then every root of each SMP node
+   post INTER-SMP Sendrecv, then do INTRA-SMP Bcast for each receiving message,
     using logical ring algorithm (hardcoded, default processes/SMP: 8)
     using logical ring algorithm (hardcoded, default processes/SMP: 8)
- - smp_simple: gather to root of each SMP, then every root of each SMP node 
-   post INTER-SMP Sendrecv, then do INTRA-SMP Bcast for each receiving message, 
+ - smp_simple: gather to root of each SMP, then every root of each SMP node
+   post INTER-SMP Sendrecv, then do INTRA-SMP Bcast for each receiving message,
     using simple algorithm (hardcoded, default processes/SMP: 8)
   - spreading_simple: from node i, order of communications is i -> i + 1, i ->
     i + 2, ..., i -> (i + p -1) % P
     using simple algorithm (hardcoded, default processes/SMP: 8)
   - spreading_simple: from node i, order of communications is i -> i + 1, i ->
     i + 2, ..., i -> (i + p -1) % P
- - ompi_neighborexchange: Neighbor Exchange algorithm for allgather. 
+ - ompi_neighborexchange: Neighbor Exchange algorithm for allgather.
     Described by Chen et.al. in  `Performance Evaluation of Allgather
     Algorithms on Terascale Linux Cluster with Fast Ethernet <http://ieeexplore.ieee.org/xpl/articleDetails.jsp?tp=&arnumber=1592302>`_
   - mvapich2_smp: SMP aware algorithm, performing intra-node gather, inter-node allgather with one process/node, and bcast intra-node
     Described by Chen et.al. in  `Performance Evaluation of Allgather
     Algorithms on Terascale Linux Cluster with Fast Ethernet <http://ieeexplore.ieee.org/xpl/articleDetails.jsp?tp=&arnumber=1592302>`_
   - mvapich2_smp: SMP aware algorithm, performing intra-node gather, inter-node allgather with one process/node, and bcast intra-node
@@ -385,7 +385,7 @@ MPI_Allgatherv
   - mpich: use mpich selector for the allgatherv operations
   - mvapich2: use mvapich2 selector for the allgatherv operations
   - impi: use intel mpi selector for the allgatherv operations
   - mpich: use mpich selector for the allgatherv operations
   - mvapich2: use mvapich2 selector for the allgatherv operations
   - impi: use intel mpi selector for the allgatherv operations
- - automatic (experimental): use an automatic self-benchmarking algorithm 
+ - automatic (experimental): use an automatic self-benchmarking algorithm
   - GB: Gatherv - Broadcast (uses tuned version if specified, but only for Bcast, gatherv is not tuned)
   - pair: see alltoall
   - ring: see alltoall
   - GB: Gatherv - Broadcast (uses tuned version if specified, but only for Bcast, gatherv is not tuned)
   - pair: see alltoall
   - ring: see alltoall
@@ -402,7 +402,7 @@ MPI_Bcast
   - mpich: use mpich selector for the bcast operations
   - mvapich2: use mvapich2 selector for the bcast operations
   - impi: use intel mpi selector for the bcast operations
   - mpich: use mpich selector for the bcast operations
   - mvapich2: use mvapich2 selector for the bcast operations
   - impi: use intel mpi selector for the bcast operations
- - automatic (experimental): use an automatic self-benchmarking algorithm 
+ - automatic (experimental): use an automatic self-benchmarking algorithm
   - arrival_pattern_aware: root exchanges with the first process to arrive
   - arrival_pattern_aware_wait: same with slight variation
   - binomial_tree: binomial tree exchange
   - arrival_pattern_aware: root exchanges with the first process to arrive
   - arrival_pattern_aware_wait: same with slight variation
   - binomial_tree: binomial tree exchange
@@ -419,7 +419,7 @@ MPI_Bcast
   - SMP_linear: linear algorithm with 8 cores/SMP
   - ompi_split_bintree: binary tree algorithm from OpenMPI, with message split in 8192 bytes pieces
   - ompi_pipeline: pipeline algorithm from OpenMPI, with message split in 128KB pieces
   - SMP_linear: linear algorithm with 8 cores/SMP
   - ompi_split_bintree: binary tree algorithm from OpenMPI, with message split in 8192 bytes pieces
   - ompi_pipeline: pipeline algorithm from OpenMPI, with message split in 128KB pieces
- - mvapich2_inter_node: Inter node default mvapich worker 
+ - mvapich2_inter_node: Inter node default mvapich worker
   - mvapich2_intra_node: Intra node default mvapich worker
   - mvapich2_knomial_intra_node:  k-nomial intra node default mvapich worker. default factor is 4.
  
   - mvapich2_intra_node: Intra node default mvapich worker
   - mvapich2_knomial_intra_node:  k-nomial intra node default mvapich worker. default factor is 4.
  
@@ -428,10 +428,10 @@ Automatic Evaluation
  
  .. warning:: This is still very experimental.
  
  
  .. warning:: This is still very experimental.
  
-An automatic version is available for each collective (or even as a selector). This specific 
-version will loop over all other implemented algorithm for this particular collective, and apply 
-them while benchmarking the time taken for each process. It will then output the quickest for 
-each process, and the global quickest. This is still unstable, and a few algorithms which need 
+An automatic version is available for each collective (or even as a selector). This specific
+version will loop over all other implemented algorithm for this particular collective, and apply
+them while benchmarking the time taken for each process. It will then output the quickest for
+each process, and the global quickest. This is still unstable, and a few algorithms which need
  specific number of nodes may crash.
  
  Adding an algorithm
  specific number of nodes may crash.
  
  Adding an algorithm
@@ -469,17 +469,17 @@ result in overloaded, hard to interpret traces. If you want to debug
  and compare collective algorithms, you should set the
  ``tracing/smpi/internals`` configuration item to 1 instead of 0.
  
  and compare collective algorithms, you should set the
  ``tracing/smpi/internals`` configuration item to 1 instead of 0.
  
-Here are examples of two alltoall collective algorithms runs on 16 nodes, 
+Here are examples of two alltoall collective algorithms runs on 16 nodes,
  the first one with a ring algorithm, the second with a pairwise one.
  
  .. image:: /img/smpi_simgrid_alltoall_ring_16.png
     :align: center
  the first one with a ring algorithm, the second with a pairwise one.
  
  .. image:: /img/smpi_simgrid_alltoall_ring_16.png
     :align: center
-          
+
  Alltoall on 16 Nodes with the Ring Algorithm.
  
  .. image:: /img/smpi_simgrid_alltoall_pair_16.png
     :align: center
  Alltoall on 16 Nodes with the Ring Algorithm.
  
  .. image:: /img/smpi_simgrid_alltoall_pair_16.png
     :align: center
-          
+
  Alltoall on 16 Nodes with the Pairwise Algorithm.
  
  -------------------------
  Alltoall on 16 Nodes with the Pairwise Algorithm.
  
  -------------------------
@@ -495,7 +495,7 @@ MPI coverage of SMPI
  ....................
  
  Our coverage of the interface is very decent, but still incomplete;
  ....................
  
  Our coverage of the interface is very decent, but still incomplete;
-Given the size of the MPI standard, we may well never manage to 
+Given the size of the MPI standard, we may well never manage to
  implement absolutely all existing primitives. Currently, we have
  almost no support for I/O primitives, but we still pass a very large
  amount of the MPICH coverage tests.
  implement absolutely all existing primitives. Currently, we have
  almost no support for I/O primitives, but we still pass a very large
  amount of the MPICH coverage tests.
@@ -625,7 +625,7 @@ processes write and read to the same place without any kind of coordination,
  then this macro can dramatically shrink your memory consumption. For example,
  that will be very beneficial to a matrix multiplication code, as all blocks will
  be stored on the same area. Of course, the resulting computations will useless,
  then this macro can dramatically shrink your memory consumption. For example,
  that will be very beneficial to a matrix multiplication code, as all blocks will
  be stored on the same area. Of course, the resulting computations will useless,
-but you can still study the application behavior this way. 
+but you can still study the application behavior this way.
  
  Naturally, this won't work if your code is data-dependent. For example, a Jacobi
  iterative computation depends on the result computed by the code to detect
  
  Naturally, this won't work if your code is data-dependent. For example, a Jacobi
  iterative computation depends on the result computed by the code to detect
@@ -649,7 +649,7 @@ SMPI_SAMPLE_LOCAL, and shared between all processors with
  SMPI_SAMPLE_GLOBAL. Of course, none of this will work if the execution
  time of your loop iteration are not stable.
  
  SMPI_SAMPLE_GLOBAL. Of course, none of this will work if the execution
  time of your loop iteration are not stable.
  
-This feature is demoed by the example file 
+This feature is demoed by the example file
  `examples/smpi/NAS/ep.c <https://framagit.org/simgrid/simgrid/tree/master/examples/smpi/NAS/ep.c>`_
  
  .............................
  `examples/smpi/NAS/ep.c <https://framagit.org/simgrid/simgrid/tree/master/examples/smpi/NAS/ep.c>`_
  
  .............................
@@ -679,7 +679,7 @@ results that you observe between both settings (visualization can be
  precious for that). Then, try to modify your model (of the platform,
  of the collective operations) to reduce the most preeminent differences.
  
  precious for that). Then, try to modify your model (of the platform,
  of the collective operations) to reduce the most preeminent differences.
  
-If the discrepancies come from the computing time, try adapting the 
+If the discrepancies come from the computing time, try adapting the
  ``smpi/host-speed``: reduce it if your simulation runs faster than in
  reality. If the error come from the communication, then you need to
  fiddle with your platform file.
  ``smpi/host-speed``: reduce it if your simulation runs faster than in
  reality. If the error come from the communication, then you need to
  fiddle with your platform file.
@@ -733,7 +733,7 @@ In addition to the previous answers, some projects also need to be
  explicitely told what compiler to use, as follows:
  
  .. code-block:: shell
  explicitely told what compiler to use, as follows:
  
  .. code-block:: shell
-               
+
     SMPI_PRETEND_CC=1 ./configure CC=smpicc # here come the other configure parameters
     make
  
     SMPI_PRETEND_CC=1 ./configure CC=smpicc # here come the other configure parameters
     make
  
@@ -765,7 +765,7 @@ Trace Replay and Offline SMPI
  
  Although SMPI is often used for :ref:`online simulation
  <SMPI_online>`, where the application is executed for real, you can
  
  Although SMPI is often used for :ref:`online simulation
  <SMPI_online>`, where the application is executed for real, you can
-also go for offline simulation through trace replay. 
+also go for offline simulation through trace replay.
  
  SimGrid uses time-independent traces, in which each actor is given a
  script of the actions to do sequentially. These trace files can
  
  SimGrid uses time-independent traces, in which each actor is given a
  script of the actions to do sequentially. These trace files can
@@ -773,7 +773,7 @@ actually be captured with the online version of SMPI, as follows:
  
  .. code-block:: shell
  
  
  .. code-block:: shell
  
-   $ smpirun -trace-ti --cfg=tracing/filename:LU.A.32 -np 32 -platform ../cluster_backbone.xml bin/lu.A.32 
+   $ smpirun -trace-ti --cfg=tracing/filename:LU.A.32 -np 32 -platform ../cluster_backbone.xml bin/lu.A.32
  
  The produced trace is composed of a file ``LU.A.32`` and a folder
  ``LU.A.32_files``. The file names don't match with the MPI ranks, but
  
  The produced trace is composed of a file ``LU.A.32`` and a folder
  ``LU.A.32_files``. The file names don't match with the MPI ranks, but
author	Millian Poquet <millian.poquet@inria.fr>
	Tue, 14 May 2019 12:52:02 +0000 (14:52 +0200)
committer	Millian Poquet <millian.poquet@inria.fr>
	Tue, 14 May 2019 12:52:02 +0000 (14:52 +0200)
docs/source/Configuring_SimGrid.rst		patch \| blob \| history
docs/source/app_smpi.rst		patch \| blob \| history