Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

v4-CMake #430

Open
wants to merge 46 commits into
base: v4
Choose a base branch
from
Open

v4-CMake #430

wants to merge 46 commits into from

Conversation

otbrown
Copy link

@otbrown otbrown commented Jul 2, 2024

Hi @TysonRayJones,

Creating a PR to discuss and critique my work on the new build system :) I'll use this as a place to keep notes and thoughts too.

I've been through and identified the following open issues which I'll try to address, as well as my own biases:

Notes:

  • I've bumped from 3.7 to 3.18 (which happens to be the version available in Debian 11 apt) but consider it a placeholder until the build system is complete, at which point I'll try to figure out a real minimum. I'd be very surprised if it's below 3.13, but might be willing to forgo 3.2x features...
  • I personally dislike the build in place user executable functionality that was in the old CMake, but I'll make sure it continues to be supported alongside more a standard library build/install target.
  • I will need support testing against QuESTLink/Windows things!

@TysonRayJones
Copy link
Member

Wew sounds good! I'm happy to cop the Windows testing. I'll setup CI too once the build is runnable (and probably a linter, since I've already slopped in some spaghetti).

I agree it's clumsy/overbearing/amateurish to build the user's executable, though I suspect a good chunk of QuEST's user base have physics backgrounds and lack the requisite skills to make their own build, or integrate QuEST into a bigger stack. I want to make compilation as easy as possible for that group somehow. I'm open to better ideas!

@otbrown
Copy link
Author

otbrown commented Jul 8, 2024

Can confirm that I can build and run a (very simple) example code using the available new API with and without multithreading enabled (though it doesn't deploy with MT at one qubit...), so now I can get started on the fun stuff.

Notes:

  • Include paths need to be rationalised but this goes hand-in-hand with setting up the install targets, since we can use separate generators for the build and install includes.
  • Once I've done that I can also reimplement the user executable functionality.
  • Similarly, I can now look at install prefix and library naming conventions #351.
  • On the other hand I can't make much headway with the GPU build(s) until tests are fixed.

@otbrown
Copy link
Author

otbrown commented Jul 9, 2024

Bump to CMake 3.14 incoming for policy CMP0082.

@TysonRayJones
Copy link
Member

Ahh I'm sorry for the rebasing and conflicting the PR! I hadn't realised the source files were being modified too. I'm intending to rename some files (structures to matrices since it' already quite long; other new data structures will have their own files) - is it okay if I rebase these too and we sort out the conflicts later? 😅

Btw are the relative path includes changed in b87b7f5 essential for CMake building? I was following Kolpackov's project structure (but preserving src and include subdirecs), and enjoyed the explicit separation (e.g. like in validation.cpp) of:

// API
#include "quest/include/modes.h"
#include "quest/include/environment.h"
#include "quest/include/qureg.h"
#include "quest/include/structures.h"

// core
#include "quest/src/core/errors.hpp"
#include "quest/src/core/bitwise.hpp"
#include "quest/src/core/memory.hpp"
#include "quest/src/core/utilities.hpp"
#include "quest/src/comm/comm_config.hpp"
#include "quest/src/cpu/cpu_config.hpp"
#include "quest/src/gpu/gpu_config.hpp"

// external library
#include <iostream>
#include <cstdlib>
#include <vector>
#include <map>

It was also consistent in that all paths were relative to the directory containing quest.

If I understand right, with the new includes, some are relative to include/ (fair enough)

#include "types.h"

and the rest are relative to the source file location (e.g. this in comm/):

#include "../core/errors.hpp"

This seems less elegant and consistent - is it necessary for compilation, or offer another benefit I've missed?

@otbrown
Copy link
Author

otbrown commented Jul 10, 2024

No problem! Any conflicts should be pretty trivial to resolve, as I wouldn't expect to touch anything more than includes, so happy to just keep merging as and when.

Apologies, I'd assumed it was just an artifact of the make build 😅

Advantages of the way I've done it:

  • Install directory has a standard structure: include, libs, bins. Users just need to add -I<installdir>/include to build their programs using the public API.
  • Relative includes don't require a -I flag at all at compile time, and are slightly less sensitive to changes in the project structure.

However, through CMake, all things are possible 😉

  • Build and install interfaces for include directories can be set separately so I can make CMake export whatever we like as as the install include interface, and we can just tell users how to correctly write their includes relative to that.
  • Setting the build includes to anything is trivial and really doesn't affect users at all 🤷‍♂️

So very happy to change it all back, it's just a find/replace job! It's your project, so let me know which you'd prefer.

@otbrown
Copy link
Author

otbrown commented Sep 3, 2024

Library renaming is now available -- the user can either just provide a custom string to produce lib${LIB_NAME}.so, or turn on VERBOSE_LIB_NAME to append to the lib name based on configuration options. This should all play nicely with the install export stuff too, though I haven't tested that.

GPU acceleration and MPI distribution are now also available, but there are two issues:

  • cuQuantum: nvidia don't provide a CMake integration, and I don't realllllly want to write a findcuQuantum for them, though I will if I have to. Need to ponder that one.
  • HIP: It looks as if the HIP version builds just by using the HIP compiler to compile the CUDA which is nice, but may present some issues for CMake. The simplest solution (for me) is to just split ENABLE_GPU_ACCELERATION into ENABLE_CUDA and ENABLE_HIP, but will see if there's a more creative solution! Never mind, there's already a USE_HIP option at play in v3, so I'll just make use of that! 😁

@otbrown
Copy link
Author

otbrown commented Sep 13, 2024

most users to build cuStateVec via CUDA Quantum (which we should expressly not do)

Almost certainly! I will probably just relent and write a simple findCuQuantum. This works fine as long as standard installations of it have a consistent directory structure.

you can choose any name that makes sense / is consistent, like COMPILE_HIP

For the compiler flags COMPILE_CUDA still makes sense as what's written is CUDA whether we compile it with nvcc or hipcc, and there's no point changing the pragmas. On the other hand I may ditch ENABLE_GPU_ACCELERATION in favour of ENABLE_CUDA and ENABLE_HIP a) because I don't see a scenario where someone tries to build both, and if they do they have bigger problems than the CMake, and b) both are shorter than ENABLE_GPU_ACCELERATION. 😃

Note that the GPU code (even when not compiling cuStateVec) should use Thrust,

Good to know! Seems like both Thrust and rocThrust export CMake targets, so should at least be able to make this an explicit dependency. I'll look into it (no point adding it if it breaks builds that actually should work...).

feel free to defer HIP support

I actually looked into it already on the ARCHER2 GPU testbed, and enabling HIP support is super easy with CMake 3.21+ (for context, the latest is 3.30, so we're still well behind the curve). Shouldn't be an issue therefore +/- potential Thrust issues. I can at least get HIP support in and building for now, and then see if it actually runs in the testing/benchmarking phase.

Perfect, thanks for the versions, I'll add them in!

@otbrown
Copy link
Author

otbrown commented Sep 13, 2024

Note: Currently #include <cuda.h> in gpu_config.cpp breaks the HIP build -- I'll include AMD support in the CMake for now and we can always just remove it later!

@otbrown
Copy link
Author

otbrown commented Sep 13, 2024

Eurgh, so although Thrust has a CMake target, but for some reason CUDAToolkit doesn't expose it 🙄 Best option for now to make this an explicit dependency is probably to figure out what version of CUDAToolkit bundled the minimum version of Thrust and then set that as a minimum version of CUDAToolkit... We could insist on explicitly linking Thrust, but I feel it's not worth getting the user to dig through NVIDIA install directories when the code would actually link fine.

otbrown and others added 19 commits September 13, 2024 15:38
which progress toward making the v3 deprecated unit tests run without error.

We also relax exclusivity of multithreading and GPU-acceleration, which was pointless and annoying.
This is a scratch commit and should not be merged into non-scratch branches. The final v3 tests should be committed by first git moving the old tests into v3 subfolder, and only in a subsequent commit should in-file changes be committed (preserving git blame/dif). It is possible too that we will wish to re-use `test_utilities` for the v4 tests.

This commit also includes a hacky unix `compile.sh` script for compiling v3 tests
@otbrown otbrown marked this pull request as ready for review December 31, 2024 12:25
@otbrown
Copy link
Author

otbrown commented Dec 31, 2024

Hi @TysonRayJones!

Hope you're well, and have found some time to enjoy a holiday in amongst the development work...

I've marked this as ready for review now, as I think what we have is enough to be useful for your planned 'developer' release, but naturally it's your call whether you'd like to merge now or later. Note the TODOs listed below. I've added docs/cmake.md to list the new CMake options.

I've got a paper to review, PhD thesis corrections to approve, and I'm getting married next week, so I probably won't be looking at this again before I'm back in the office 😅

Happy new year! 🎊

CMake TODO:

  • Implement CuQuantum support. The CMake config variable exists but doesn't really do anything currently. As mentioned before this may require me to write a FindCuQuantum which should simple enough, but once I'm back in the office! This is the most egregious CMake specific issue now.
  • What version of clang are you using? I stil get compilation failures on the tests with clang 16 and clang 18 I'm afraid. So I still need to test the tests, but all the required CMake is there I think.
  • Test HIP support.
  • Test compilation on Windows.
  • Update compilation documentation, dependent on the tutorial program.

@TysonRayJones
Copy link
Member

Wew amazing! Happy holidays to you, and a massive congratulations on your wedding!!! 🎉 🎉

I've got two items (🤞) left on my backlog before I'll start using your CMake build and test it on as many platforms as I can access.

To your dot points:

  1. I recall having trouble linking cuStateVec with CMake in QuEST v3 because I was trying to support the despairingly old default CMake version on the Oxford cluster (as I lament here). I'm happy for us to force a modern CMake minimum version if it helps.
  2. Oh my bad! I've been using only Clang 15 to run the v3 tests, i.e. test the deprecation API. It is probably that optional hacky header causing the problems - I'll check it out!
  3. I'll have a go at compiling with HIP and on Windows, and can also update the revised tutorial doc - your args page looks brilliant!

Thanks very much for all your fantastic work so far! 🙌

@TysonRayJones
Copy link
Member

Howdy! I've finally had time to use the CMake build. I get it to compile by...

  1. copying over the tests folder from the scratch-v3-tests branch (which has some patches)
  2. retaining your CMakeLists.txt from this PR (otherwise overwritten by above)
  3. replacing the reference to DEAULT_VALIDATION_EPSILON (forgive the misspelling eep) in line 38 of test_utilities.hpp with a constant 1E-10.

Of course I'll fix these shenanigans!

Disabling the tests, I can indeed compile (on my Macbook) with both clang 15.0.0 and gcc 14.2.0 🎉

The tests do not compile (except in clang 15) due to deprecated.h's use of _Generic, like here. I've soberingly realised that this C11 syntax is not adopted by C++17 😢 (test here). Only a handful of compilers (like the Frankenstein's monster that is Clang) will accept it as an extension. Indeed with multithreading disabled (see below), I can successfully compile with clang 15.0.0, though as you've noted, even that may go away with newer clang versions!

So, let's just give up on cross-platform supporting the v3 unit tests, which seek only to test the deprecation API anyway. We'll instead target new, v4, C++17 compatible unit tests, which I'll prepare. The deprecation tests can run on old clang offline, since they're inessential ¯\(ツ)

Some other notes:

  • The doc suggests cmake -Bbuild . from the root dir, in lieu of the standard(?) cmake .. from within build. Not a problem, though we should note users should subsequently call make -C build to avoid confusion.
  • Some additional bull shittery is needed to get OpenMP working with Clang compilers, shown here. We can direct Clang users to install libomp in the doc.

I channeled my _Generic frustrations inspired this meme:

img

@otbrown
Copy link
Author

otbrown commented Jan 22, 2025

Thanks for taking the time to give it a whirl! Glad it's mostly working on someone else's computer.

It's a shame the deprecated API won't work, although reworking the tests probably is better in the long run.

To your notes:

  • Good point! Or indeed using cd build && make. I can add this, no problem.
  • Good spot! I was confused at first but no doubt I installed libomp-dev at some point not long after I last refreshed my VM 😅

It is weird (and a shame) that _Generic support was dropped 😞

I'm back in the office now, so I'll carve out some time around reviewing QCTiP submissions to test on our AMD GPU nodes and look into CuQuantum (finally!).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants