Merge pull request #11 from vtopt/develop

Version 2 release
vtopt · Apr 26, 2024 · fb560bd · fb560bd
2 parents f3ffef0 + c548b7b
commit fb560bd
Show file tree

Hide file tree

Showing 34 changed files with 26,492 additions and 17,098 deletions.
diff --git a/LICENSE b/LICENSE
@@ -1,7 +1,7 @@
 MIT License
 
 Copyright (c) 2020 Tyler H. Chang, Layne T. Watson, Thomas C. H. Lux,
-Ali R. Butt, Kirk W. Cameron, and Yili Hong.
+and other DELAUNAYSPARSE contributors.
 
 Permission is hereby granted, free of charge, to any person obtaining a copy
 of this software and associated documentation files (the "Software"), to deal

diff --git a/README.md b/README.md
@@ -36,10 +36,11 @@ Further detailed user information is documented in the `USAGE` document.
 
 The physical organization is as follows.
 
- * `src` contains the original unmodified Fortran source code, as published
-   in ACM TOMS Algorithm 1012. This includes 2 command line drivers
-   `samples.f90` (serial driver) and `samplep.f90` (parallel driver), which
-   can be used on formatted data files from the command line.
+ * `src` contains version 2 of the ACM TOMS Fortran source code, as published
+   in ACM TOMS Algorithm 1012 and modified in the subsequent remark.
+   This includes 2 command line drivers `samples.f90` (serial driver) and
+   `samplep.f90` (parallel driver), which can be used on formatted data files
+   from the command line.
    Comments at the top of each subroutine document their usage.
    See this directory's internal README for further information on
    building, testing, and usage.
@@ -59,28 +60,56 @@ The physical organization is as follows.
  * DelaunaySparse is shared under the MIT Software License, in the `LICENSE`
    file.
 
+## Version 2
+
+This is version 2 of DELAUNAYSPARSE.
+
+In the latest version, we have switched away from using the SLATEC subroutine
+DWNNLS for computing projections onto the convex hull. Instead, we favor the
+quadratic program solver BQPD of R. Fletcher, which is much more robust for
+large projection problems.
+
+Since this problem was somewhat nontrivial, we have added an additional
+external subroutine ``PROJECT`` to the ``delsparse.f90`` file, for
+external usage.
+
 ## Citation
 
 To cite this work, please use
 
 ```
-@article{TOMSalg1012,
-author = {Chang, Tyler H. and Watson, Layne T. and Lux, Thomas C. H. and Butt, Ali R. and Cameron, Kirk W. and Hong, Yili},
-title = {Algorithm 1012: {DELAUNAYSPARSE}: {I}nterpolation via a Sparse Subset of the {D}elaunay Triangulation in Medium to High Dimensions},
-year = {2020},
-publisher = {Association for Computing Machinery},
-address = {New York, NY, USA},
-volume = {46},
-number = {4},
-doi = {10.1145/3422818},
-journal = {ACM Trans. Math. Softw.},
-month = {nov},
-articleno = {38},
-numpages = {20}
+@article{TOMSalgorithm1012,
+    author = {Chang, Tyler H. and Watson, Layne T. and Lux, Thomas C. H. and Butt, Ali R. and Cameron, Kirk W. and Hong, Yili},
+    title = {Algorithm 1012: {DELAUNAYSPARSE}: {I}nterpolation via a Sparse Subset of the {D}elaunay Triangulation in Medium to High Dimensions},
+    year = {2020},
+    month = {nov},
+    publisher = {Association for Computing Machinery},
+    address = {New York, NY, USA},
+    journal = {ACM Trans. Math. Softw.},
+    volume = {46},
+    number = {4},
+    articleno = {38},
+    numpages = {20},
+    doi = {10.1145/3422818},
 }
 ```
 
+The new ``PROJECT`` subroutine released in Version 2 is described in
+
+```
+@article{TOMSremark1012,
+    author = {Chang, Tyler H. and Watson, Layne T. and Leyffer, Sven and Lux, Thomas C. H. and Almohri, Hussain M. J.},
+    title = {Remark on {Algorithm 1012}: Computing projections with large data sets},
+    year = {2024},
+    publisher = {Association for Computing Machinery},
+    address = {New York, NY, USA},
+    journal = {ACM Trans. Math. Softw.},
+    numpages = {7},
+    doi = {10.1145/3656581},
+    note = {In press}
+```
+
 ## Inquiries
 
 For further inquiries, contact
-Tyler Chang, [email protected].
+Tyler Chang, [email protected].
diff --git a/USAGE.md b/USAGE.md
@@ -172,18 +172,37 @@ On output:
     - 61 : A value that was judged appropriate later caused LAPACK to
            encounter a singularity. Try increasing the value of `EPS`.
 
-    The errors 70--72 were caused by the DWNNLS library from SLATEC, which
-    is only used during extrapolation. Note that there is a known issue
-    with this library, when it is linked against included public-domain
-    copy of BLAS/LAPACK, instead of an installed version
-    (i.e., `-lblas` `-llapack`).
-
-    - 70 : Allocation error for the extrapolation work arrays.
-    - 71 : The SLATEC subroutine `DWNNLS` failed to converge during the
-           projection of an extrapolation point onto the convex hull.
-    - 72 : The SLATEC subroutine `DWNNLS` has reported a usage error.
-
-   The errors 72, 80--83 should never occur, and likely indicate a
+    The errors 70+ can occur during extrapolation outside the convex hull.
+    Errors 71+ are received from the solver BQPD and should never occur.
+    Since extrapolation has a much higher memory overhead than interpolation,
+    if one of these errors occurs, it is likely that your system does not have
+    sufficient memory to support extrapolation for the given problem size.
+
+    - 70 : Allocation error for the extrapolation work arrays. Note that
+           extrapolation has a higher memory overhead than interpolation for
+           the current version.
+    - 71 : Unbounded problem detected. This error should never occur
+           unless there is a user error.
+    - 72 : bl(i) > bu(i) for some i This error should never occur
+           unless there is a user error.
+    - 73 : Infeasible problem detected in Phase 1 (should never occur).
+    - 74 : Incorrect setting of m, n, kmax, mlp, mode or tol. This is
+           extremely unlikely, but could indicate that the problem is
+           too large for usage of BQPD.
+    - 75 : Not enough space in lp. This error should not occur.
+           Double-check your usage, then contact authors if the issue
+           persists.
+    - 76 : Not enough space for reduced Hessian matrix (increase kmax).
+           This error should not occur. Double-check your usage, then
+           contact authors if the issue persists.
+    - 77 : Not enough space for sparse factors. This error should not
+           occur, but may indicate an un-anticipated problem size.
+           Double-check your usage then contact the authors if the issue
+           persists.
+    - 78 : Maximum number of unsuccessful restarts taken. This issue
+           should never occur since the projection problem is convex.
+
+   The errors 80--83 should never occur, and likely indicate a
    compiler bug or hardware failure.
 
     - 80 : The LAPACK subroutine `DGEQP3` has reported an illegal value.
@@ -253,18 +272,21 @@ Optional arguments:
 
  * `EXACT` is a logical input argument that determines whether the exact
    diameter should be computed and whether a check for duplicate data
-   points should be performed in advance. When `EXACT=.FALSE.`, the
-   diameter of `PTS` is approximated by twice the distance from the
-   barycenter of `PTS` to the farthest point in `PTS`, and no check is
-   done to find the closest pair of points, which could result in hard
-   to find bugs later on. When `EXACT=.TRUE.`, the exact diameter is
-   computed and an error is returned whenever PTS contains duplicate
-   values up to the precision `EPS`. By default `EXACT=.TRUE.`, but setting
-   `EXACT=.FALSE.` could result in significant speedup when `N` is large.
-   It is strongly recommended that most users leave `EXACT=.TRUE.`, as
+   points should be performed in advance. These checks are $O(N^2 D)$ time
+   complexity, while `DELAUNAYSPARSE` tends toward $O(N D^4)$ on average.
+   By default, `EXACT=.TRUE.` and the exact diameter is computed and an error
+   is returned whenever `PTS` contains duplicate values up to the precision
+   `EPS`. When `EXACT=.FALSE.`, the diameter of `PTS` is approximated by twice
+   the distance from the barycenter of `PTS` to the farthest point in `PTS`,
+   and no check is done to find the closest pair of points.
+   When `EXACT=.TRUE.`, `DELAUNAYSPARSE` could spend over 90% of runtime
+   calculating these constants, which are not critical to the `DELAUNAYSPARSE`
+   algorithm. In particular, this happens for large values of `N`. However,
    setting `EXACT=.FALSE.` could result in input errors that are difficult
-   to identify. Also, the diameter approximation could be wrong by up to
-   a factor of two.
+   to identify. It is recommended that users verify the input set `PTS`
+   and possibly rescale `PTS` manually while `EXACT=.TRUE.` Then, when
+   100% sure that `PTS` is valid, users may choose to set `EXACT=.FALSE.`
+   in production runs for large values of N to achieve massive speedups.
 
 
 Subroutines and functions directly referenced from BLAS are
@@ -278,16 +300,15 @@ and from LAPACK are
  * `DGETRS`,
  * `DORMQR`.
 
-The SLATEC subroutine
- * `DWNNLS` is also directly referenced.
+The BQPD driver
+ * `BQPD` is also directly referenced.
 
-`DWNNLS` and all its SLATEC dependencies have been slightly edited to
-comply with the Fortran 2008 standard, with all print statements and
-references to stderr being commented out. For a reference to `DWNNLS`,
-see ACM TOMS Algorithm 587 (Hanson and Haskell).
+`BQPD` and all its dependencies have been flattened into a single file `bqpd.f`.
+For a reference to `BQPD`, see
+   Annals of Operations Research, 46 : 307--334 (1993).
 The module `REAL_PRECISION` from HOMPACK90 (ACM TOMS Algorithm 777) is
-used for the real data type. The `REAL_PRECISION` module, `DELAUNAYSPARSES`,
-and `DWNNLS` and its dependencies comply with the Fortran 2008 standard.  
+used for the real data type. The `REAL_PRECISION` module, `DELAUNAYSPARSEP`,
+and `BQPD` and its dependencies comply with the Fortran 2008 standard.  
 
 ## DELAUNAYSPARSEP
 
@@ -415,18 +436,38 @@ On output:
     - 61 : A value that was judged appropriate later caused LAPACK to
            encounter a singularity. Try increasing the value of `EPS`.
 
-    The errors 70--72 were caused by the DWNNLS library from SLATEC, which
-    is only used during extrapolation. Note that there is a known issue
-    with this library, when it is linked against included public-domain
-    copy of BLAS/LAPACK, instead of an installed version
-    (i.e., `-lblas` `-llapack`).
-
-    - 70 : Allocation error for the extrapolation work arrays.
-    - 71 : The SLATEC subroutine `DWNNLS` failed to converge during the
-           projection of an extrapolation point onto the convex hull.
-    - 72 : The SLATEC subroutine `DWNNLS` has reported a usage error.
-
-   The errors 72, 80--83 should never occur, and likely indicate a
+    The errors 70+ can occur during extrapolation outside the convex hull.
+    Errors 71+ are received from the solver BQPD and should never occur.
+    Since extrapolation has a much higher memory overhead than interpolation,
+    if one of these errors occurs, it is likely that your system does not have
+    sufficient memory to support extrapolation for the given problem size.
+
+    - 70 : Allocation error for the extrapolation work arrays. Note that
+           extrapolation has a higher memory overhead than interpolation for
+           the current version.
+    - 71 : Unbounded problem detected. This error should never occur
+           unless there is a user error.
+    - 72 : bl(i) > bu(i) for some i This error should never occur
+           unless there is a user error.
+    - 73 : Infeasible problem detected in Phase 1 (should never occur).
+    - 74 : Incorrect setting of m, n, kmax, mlp, mode or tol. This is
+           extremely unlikely, but could indicate that the problem is
+           too large for usage of BQPD.
+    - 75 : Not enough space in lp. This error should not occur.
+           Double-check your usage, then contact authors if the issue
+           persists.
+    - 76 : Not enough space for reduced Hessian matrix (increase kmax).
+           This error should not occur. Double-check your usage, then
+           contact authors if the issue persists.
+    - 77 : Not enough space for sparse factors. This error should not
+           occur, but may indicate an un-anticipated problem size.
+           Double-check your usage then contact the authors if the issue
+           persists.
+    - 78 : Maximum number of unsuccessful restarts taken. This issue
+           should never occur since the projection problem is convex.
+
+
+   The errors 80--83 should never occur, and likely indicate a
    compiler bug or hardware failure.
 
     - 80 : The LAPACK subroutine `DGEQP3` has reported an illegal value.
@@ -500,18 +541,21 @@ Optional arguments:
 
  * `EXACT` is a logical input argument that determines whether the exact
    diameter should be computed and whether a check for duplicate data
-   points should be performed in advance. When `EXACT=.FALSE.`, the
-   diameter of `PTS` is approximated by twice the distance from the
-   barycenter of `PTS` to the farthest point in `PTS`, and no check is
-   done to find the closest pair of points, which could result in hard
-   to find bugs later on. When `EXACT=.TRUE.`, the exact diameter is
-   computed and an error is returned whenever PTS contains duplicate
-   values up to the precision `EPS`. By default `EXACT=.TRUE.`, but setting
-   `EXACT=.FALSE.` could result in significant speedup when `N` is large.
-   It is strongly recommended that most users leave `EXACT=.TRUE.`, as
+   points should be performed in advance. These checks are $O(N^2 D)$ time
+   complexity, while `DELAUNAYSPARSE` tends toward $O(N D^4)$ on average.
+   By default, `EXACT=.TRUE.` and the exact diameter is computed and an error
+   is returned whenever `PTS` contains duplicate values up to the precision
+   `EPS`. When `EXACT=.FALSE.`, the diameter of `PTS` is approximated by twice
+   the distance from the barycenter of `PTS` to the farthest point in `PTS`,
+   and no check is done to find the closest pair of points.
+   When `EXACT=.TRUE.`, `DELAUNAYSPARSE` could spend over 90% of runtime
+   calculating these constants, which are not critical to the `DELAUNAYSPARSE`
+   algorithm. In particular, this happens for large values of `N`. However,
    setting `EXACT=.FALSE.` could result in input errors that are difficult
-   to identify. Also, the diameter approximation could be wrong by up to
-   a factor of two.
+   to identify. It is recommended that users verify the input set `PTS`
+   and possibly rescale `PTS` manually while `EXACT=.TRUE.` Then, when
+   100% sure that `PTS` is valid, users may choose to set `EXACT=.FALSE.`
+   in production runs for large values of N to achieve massive speedups.
 
  * `PMODE` is an integer specifying the level of parallelism to be exploited.
     - If `PMODE = 1`, then parallelism is exploited at the level of the loop
@@ -537,13 +581,12 @@ and from LAPACK are
  * `DGETRS`,
  * `DORMQR`.
 
-The SLATEC subroutine
- * `DWNNLS` is also directly referenced.
+The BQPD driver
+ * `BQPD` is also directly referenced.
 
-`DWNNLS` and all its SLATEC dependencies have been slightly edited to
-comply with the Fortran 2008 standard, with all print statements and
-references to stderr being commented out. For a reference to `DWNNLS`,
-see ACM TOMS Algorithm 587 (Hanson and Haskell).
+`BQPD` and all its dependencies have been flattened into a single file `bqpd.f`.
+For a reference to `BQPD`, see
+   Annals of Operations Research, 46 : 307--334 (1993).
 The module `REAL_PRECISION` from HOMPACK90 (ACM TOMS Algorithm 777) is
 used for the real data type. The `REAL_PRECISION` module, `DELAUNAYSPARSEP`,
-and `DWNNLS` and its dependencies comply with the Fortran 2008 standard.  
+and `BQPD` and its dependencies comply with the Fortran 2008 standard.  
diff --git a/c_binding/LICENSE b/c_binding/LICENSE
@@ -1,7 +1,7 @@
 MIT License
 
 Copyright (c) 2020 Tyler H. Chang, Layne T. Watson, Thomas C. H. Lux,
-Ali R. Butt, Kirk W. Cameron, and Yili Hong.
+and other DELAUNAYSPARSE contributors.
 
 Permission is hereby granted, free of charge, to any person obtaining a copy
 of this software and associated documentation files (the "Software"), to deal
@@ -20,3 +20,7 @@ AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
 LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
 OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
 SOFTWARE.
+
+
+** Please note that bqpd.f is a dependency of DELAUNAYSPARSE but carries its
+own LICENSE file found in bqpd_min/LICENSE. **
diff --git a/c_binding/Makefile b/c_binding/Makefile
@@ -3,10 +3,9 @@ CC = gcc
 CFLAGS = -c
 OPTS = -fopenmp 
 LIBS = blas.f lapack.f
-LEGACY = -std=legacy
 
-all: test_install.o delsparse_bind_c.o delsparse.o slatec.o delsparse.h
-	$(FORT) $(OPTS) test_install.o delsparse_bind_c.o delsparse.o slatec.o $(LIBS) -o test_install
+all: test_install.o delsparse_bind_c.o delsparse.o bqpd.o delsparse.h
+	$(FORT) $(OPTS) test_install.o delsparse_bind_c.o delsparse.o bqpd.o $(LIBS) -o test_install
 	./test_install
 
 test_install.o: test_install.c
@@ -18,8 +17,8 @@ delsparse_bind_c.o: delsparse_bind_c.f90 delsparse.o
 delsparse.o: delsparse.f90 
 	$(FORT) $(CFLAGS) $(OPTS) delsparse.f90 -o delsparse.o
 
-slatec.o : slatec.f
-	$(FORT) $(CFLAGS) $(OPTS) $(LEGACY) slatec.f -o slatec.o
+bqpd.o : bqpd_min/bqpd.f
+	$(FORT) $(CFLAGS) $(OPTS) $(LEGACY) bqpd_min/bqpd.f -o bqpd.o
 
 clean:
 	rm -f *.o *.mod test_install
diff --git a/c_binding/README b/c_binding/README
@@ -38,5 +38,5 @@ USAGE:
 
 CONTRIBUTORS:
 
-  Tyler Chang, [email protected]
+  Tyler Chang, [email protected]
 
diff --git a/c_binding/bqpd_min/LICENSE b/c_binding/bqpd_min/LICENSE
@@ -0,0 +1,13 @@
+BQPD carries the following copyright notice:
+
+BQPD is copyrighted by the University of Dundee, UK. The library
+was developed by Roger Fletcher and Sven Leyffer. It is made available
+"as is" without express or implied warranty. Permission for distributing
+this library must be obtained from Roger Fletcher at the University of
+Dundee ([email protected]).
+
+
+Via this LICENSE file:
+
+Permission is hereby granted to distribute BQPD for research purposes only.
+For more information, please contact Sven Leyffer.
Original file line number	Diff line number	Diff line change
Expand Up		@@ -38,5 +38,5 @@ USAGE:

		CONTRIBUTORS:

		Tyler Chang, [email protected]
		Tyler Chang, [email protected]