Replace non-breaking spaces.

[simgrid.git] / docs / source / app_smpi.rst
diff --git a/docs/source/app_smpi.rst b/docs/source/app_smpi.rst

index 306676f..89a8b65 100644 (file)
--- a/docs/source/app_smpi.rst
+++ b/docs/source/app_smpi.rst
@@ -6,11 +6,12 @@ SMPI: Simulate MPI Applications
  
  .. raw:: html
  
-   <object id="TOC" data="graphical-toc.svg" width="100%" type="image/svg+xml"></object>
+   <object id="TOC" data="graphical-toc.svg" type="image/svg+xml"></object>
     <script>
     window.onload=function() { // Wait for the SVG to be loaded before changing it
       var elem=document.querySelector("#TOC").contentDocument.getElementById("SMPIBox")
       elem.style="opacity:0.93999999;fill:#ff0000;fill-opacity:0.1";
+     elem.style="opacity:0.93999999;fill:#ff0000;fill-opacity:0.1;stroke:#000000;stroke-width:0.35277778;stroke-linecap:round;stroke-linejoin:round;stroke-miterlimit:4;stroke-dasharray:none;stroke-dashoffset:0;stroke-opacity:1";
     }
     </script>
     <br/>
@@ -20,8 +21,8 @@ SMPI enables the study of MPI application by emulating them on top of
  the SimGrid simulator. This is particularly interesting to study
  existing MPI applications within the comfort of the simulator.
  
-To get started with SMPI, you should head to `the SMPI tutorial
-<Tutorial_MPI_Applications>`_. You may also want to read the `SMPI reference
+To get started with SMPI, you should head to :ref:`the SMPI tutorial
+<usecase_smpi>`. You may also want to read the `SMPI reference
  article <https://hal.inria.fr/hal-01415484>`_ or these `introductory
  slides <http://simgrid.org/tutorials/simgrid-smpi-101.pdf>`_.  If you
  are new to MPI, you should first take our online `SMPI CourseWare
@@ -69,6 +70,14 @@ the right compilation flags along the way. If your application is in
  C++, Fortran 77 or Fortran 90, use respectively ``smpicxx``,
  ``smpiff`` or ``smpif90``.
  
+If you use cmake, set the variables ``MPI_C_COMPILER``, ``MPI_CXX_COMPILER`` and
+``MPI_Fortran_COMPILER`` to the full path of smpicc, smpicxx and smpiff (or
+smpif90), respectively. Example:
+
+.. code-block:: shell
+
+   cmake -DMPI_C_COMPILER=/opt/simgrid/bin/smpicc -DMPI_CXX_COMPILER=/opt/simgrid/bin/smpicxx -DMPI_Fortran_COMPILER=/opt/simgrid/bin/smpiff .
+
  ....................
  Simulating your Code
  ....................
@@ -169,7 +178,7 @@ means that the selected algorithm will be used
  MPI_Alltoall
  ^^^^^^^^^^^^
  
-Most of these are best described in `STAR-MPI's white paper <www.cs.fsu.edu/~xyuan/paper/06ics.pdf>`_.
+Most of these are best described in `STAR-MPI's white paper <https://doi.org/10.1145/1183401.1183431>`_.
  
   - default: naive one, by default
   - ompi: use openmpi selector for the alltoall operations
@@ -323,7 +332,7 @@ MPI_Allreduce
   - redbcast: reduce then broadcast, using default or tuned algorithms if specified
   - ompi_ring_segmented: ring algorithm used by OpenMPI
   - mvapich2_rs: rdb for small messages, reduce-scatter then allgather else
- - mvapich2_two_level: SMP-aware algorithm, with mpich as intra algoritm, and rdb as inter (Change this behavior by using mvapich2 selector to use tuned values)
+ - mvapich2_two_level: SMP-aware algorithm, with mpich as intra algorithm, and rdb as inter (Change this behavior by using mvapich2 selector to use tuned values)
   - rab: default `Rabenseifner <https://fs.hlrs.de/projects/par/mpi//myreduce.html>`_ implementation
  
  MPI_Reduce_scatter
@@ -532,14 +541,14 @@ privatized the globals through static analysis of the source code. But
  our implementation was not robust enough to be used in production, so
  it was removed at some point. Currently, SMPI comes with two
  privatization mechanisms that you can :ref:`select at runtime
-<cfg=smpi/privatization>`_.  The dlopen approach is used by
+<cfg=smpi/privatization>`.  The dlopen approach is used by
  default as it is much faster and still very robust.  The mmap approach
  is an older approach that proves to be slower.
  
  With the **mmap approach**, SMPI duplicates and dynamically switch the
  ``.data`` and ``.bss`` segments of the ELF process when switching the
  MPI ranks. This allows each ranks to have its own copy of the global
-variables.  No copy actually occures as this mechanism uses ``mmap()``
+variables.  No copy actually occurs as this mechanism uses ``mmap()``
  for efficiency. This mechanism is considered to be very robust on all
  systems supporting ``mmap()`` (Linux and most BSDs). Its performance
  is questionable since each context switch between MPI ranks induces
@@ -558,10 +567,10 @@ link against the SimGrid library itself.
  
  With the **dlopen approach**, SMPI loads several copies of the same
  executable in memory as if it were a library, so that the global
-variables get naturally dupplicated. It first requires the executable
+variables get naturally duplicated. It first requires the executable
  to be compiled as a relocatable binary, which is less common for
  programs than for libraries. But most distributions are now compiled
-this way for security reason as it allows to randomize the address
+this way for security reason as it allows one to randomize the address
  space layout. It should thus be safe to compile most (any?) program
  this way.  The second trick is that the dynamic linker refuses to link
  the exact same file several times, be it a library or a relocatable
@@ -569,12 +578,12 @@ executable. It makes perfectly sense in the general case, but we need
  to circumvent this rule of thumb in our case. To that extend, the
  binary is copied in a temporary file before being re-linked against.
  ``dlmopen()`` cannot be used as it only allows 256 contextes, and as it
-would also dupplicate simgrid itself.
+would also duplicate simgrid itself.
  
  This approach greatly speeds up the context switching, down to about
  40 CPU cycles with our raw contextes, instead of requesting several
  syscalls with the ``mmap()`` approach. Another advantage is that it
-permits to run the SMPI contexts in parallel, which is obviously not
+permits one to run the SMPI contexts in parallel, which is obviously not
  possible with the ``mmap()`` approach. It was tricky to implement, but
  we are not aware of any flaws, so smpirun activates it by default.
  
@@ -614,7 +623,7 @@ Reducing your memory footprint
  
  If you get short on memory (the whole app is executed on a single node when
  simulated), you should have a look at the SMPI_SHARED_MALLOC and
-SMPI_SHARED_FREE macros. It allows to share memory areas between processes: The
+SMPI_SHARED_FREE macros. It allows one to share memory areas between processes: The
  purpose of these macro is that the same line malloc on each process will point
  to the exact same memory area. So if you have a malloc of 2M and you have 16
  processes, this macro will change your memory consumption from 2M*16 to 2M
@@ -636,6 +645,8 @@ SMPI_SHARED_MALLOC macro in this case, sorry.
  This feature is demoed by the example file
  `examples/smpi/NAS/dt.c <https://framagit.org/simgrid/simgrid/tree/master/examples/smpi/NAS/dt.c>`_
  
+.. _SMPI_use_faster:
+
  .........................
  Toward Faster Simulations
  .........................
@@ -730,7 +741,7 @@ fail without ``smpirun``.
  ..............................................
  
  In addition to the previous answers, some projects also need to be
-explicitely told what compiler to use, as follows:
+explicitly told what compiler to use, as follows:
  
  .. code-block:: shell