MSBuild - How to build multiple files AND projects in parallel

10,290

Solution 1

The /MP switch for the compiler (cl.exe) and the /m option for msbuild.exe are indeed the way to go. Here are some relevant documentation extracts:

The /MP option causes the compiler to create one or more copies of itself, each in a separate process. Then these copies simultaneously compile the source files. Optional Argument: The maximum number of processes that the compiler can create. If you omit the processMax argument, the compiler retrieves the number of effective processors on your computer from the operating system, and creates a process for each processor. The /MP option is incompatible with some compiler options and language features. If you use an incompatible compiler option with the /MP option, the compiler issues warning D9030 and ignores the /MP option

Examples suppose you specify the following command line:

cl /MP7 a.cpp b.cpp c.cpp d.cpp e.cpp

In this case the compiler uses five processes because that is the lesser of five source files and a maximum of seven processes. Alternatively, suppose your computer has two effective processors and you specify the following command line:

cl /MP a.cpp b.cpp c.cpp

In this case the operating system reports two processors; therefore, the compiler uses two processes in its calculation. As a result, the compiler will execute the build with two processes because that is the lesser of two processes and three source files.

So as you noticed this was all about the compiler, and no word about msbuild yet, since the compiler is a standalone tool. In other words: suppose you open 2 command windows and in both of them invoke a command like cl /MP3 a.cpp b.cpp c.cpp at the same time, you'd have 6 compiler processes running at the same time. Which is the behaviour you are after and which is also what msbuild can do.

Output from invoking msbuild /?:

/maxcpucount[:n] Specifies the maximum number of concurrent processes to build with. If the switch is not used, the default value used is 1. If the switch is used without a value MSBuild will use up to the number of processors on the computer. (Short form: /m[:n])

Also more relevant information here, specifically the part where it says

BuildInParallel is an optional boolean parameter on a MSBuild task. When BuildInParallel is set to true (its default value), multiple worker processes are generated to build as many projects at the same time as possible. For this to work correctly, the /maxcpucount switch must be set to a value greater than 1, and the system must be at least dual-core or have two or more processors.

Now if you'd just call msbuild /m my.vcxproj on a typical project this won't do anything special because there is only one project to build. However if inside an msbuild file there is an call to the MsBuild task like

<MsBuild Projects="@(ListOfProjects)" BuildInParallel="True"/>

and the /m option is passed to msbuild then msbuild will spawn multiple other msbuild processes per the rules laid out above. And coincidently that is exactly what happens when building a solution with msbuild. Msbuild first converts the solution to an actual msbuild file which calls the MsBuild task with a list of projects in the solution. As such multiple projects can be built in parallel and for that multiple msbuild processes are created. In turn, if these are C++ projects with the /MP option set, each of those projects will create multiple compiler processes yielding a maximum of <max num cl processes> * <max num msbuild instances> parallel compilations. It depends on your build machine what is the optimum, if you want you can play with the options (to do this efficiently each of your projects would import a common property sheet in which you set the /MP option else you'd have to edit all of them seperately) and look at the elapsed time msbuild reports when a build is done. I always just leave the arguments for /m and /MP to the default because I don't want other developpers using my projects to have possible suboptimal configurations, and because it easily maxes out CPU.

Solution 2

You can achieve this on the command line by calling:

MSBuild /m:PARALLEL_PROJECT_COUNT /p:CL_MPCount=PARALLEL_FILE_COUNT

The number of resulting processes is going to be:

MSBuild = PARALLEL_PROJECT_COUNT Cl = PARALLEL_PROJECT_COUNT * PARALLEL_FILE_COUNT

Share:
10,290

Related videos on Youtube

Cloud
Author by

Cloud

Updated on September 14, 2022

Comments

  • Cloud
    Cloud over 1 year

    I have a large Visual Studio 2010 solution file (.sln) that contains a large number of projects (~50). Each project consists of roughly 20 .cpp and .h files. Building the entire project takes roughly 2 hours on a fast Intel i7 machine with all the dependencies cached locally on a fast SSD.

    I am attempting to replicate an existing experiment I found on this topic, and need some clarification on how to:

    1. Get the parallelization I'm seeking to work in Visual Studio (with the GUI/IDE).
    2. Get these changes also working with automated command-line builds via MSBuild.exe.

    First, are the /m and /maxcpucount options the same? I've read an article on the /m option, which seems to build more projects in parallel. This appears to be the same option I set via the following actions in the GUI:

    • Tools
      • Options
      • Projects and Solutions
      • Build and Run
        • Maximum number of parallel project builds

    There is another option, /MP, which I can reach in the GUI via:

    • Projects
      • Properties
      • Configuration Properties
      • C/C++
        • General
        • Multi-processor Compilation

    Please correct me if I'm wrong on this, but this seems to suggest that the project will build multiple .cpp files in parallel, but there's no option to specify how many, unless I have to manually set it in the text box (ie: /MP 4 or /MP4.

    It also seems that in order for /MP to work, I need to disable minimal rebuild (ie: Enabled Minimal Rebuild: No (/GM-)), and it also doesn't work with pre-compiled headers. I've worked around the pre-compiled headers issue by making the precompiled headers be a dedicated project that the solution builds first, before all other projects.

    To the question: How can I implement these options for parallel project builds within the solution, and parallel file (.cpp) builds within the projects, using MSBuild via the command-line, and confirm that they are working as expected? Are my assumptions above also correct? My end goal is to test building the project with up to the total number of cores being uses, or possibly even overloading it if my build process is I/O bound rather than CPU bound. I can do this easily enough in Linux via:

    NUMCPUS=`grep -c '^processor' /proc/cpuinfo`
    NUMJOBS="$(echo "$NUMCPUS*1.5" | bc)"
    NUMJOBS="$(printf '%.0f' $NUMJOBS)"
    alias pmake='time nice make -j$NUMJOBS'
    

    This allows me to, on my 8-core PC, queue up 12 build tasks at once, and make automatically builds up to 12 .cpp files in parallel, handling implicit build rules/dependencies, etc. I'm trying to achieve the same thing in Visual Studio Professional and MSBuild. When my builds are I/O bound, this approach has been the best at keeping all cores at ~100% usage.

    Thank you.

    • stijn
      stijn over 8 years
      Those options are very different - /MP is for C++ compilation, /m is for msbuild. Think of it this way: msbuild can also build C# projects, or custom projects of any language. /MP is only for C++ so even if you do not use msbuild but cl.exe directly it still has effect.
    • Cloud
      Cloud
      @stijn After reading some related articles, your comments have effectively answered my question. If you can re-create your comments as an answer, I will gladly mark it as "Accepted".
    • stijn
      stijn
      Please correct me if I'm wrong on this from a quick glance at what happens here you are correct. With those options both VS and msbuild build multiple projects at the same time (as far as dependencies allow) and spawns multiple compiler processes for each of those projects. Also note the log will contain numbers for each project becuase things happen interleaved now. If you really are concerned with build times you're going to have to try some combinations and see which works best on your particular machine though.