This work was supported by the Mathematical, Information, and Computational Sciences Division subprogram of the Office of Computational and Technology Research, U.S. Department of Energy, under Contract W-31-109-Eng-38.
The Toolkit for Advanced Optimization (TAO) focuses on the development of algorithms and software for the solution of large-scale optimization problems on high-performance architectures. Areas of interest include nonlinear least squares, unconstrained and bound-constrained optimization, and general nonlinear optimization.
The development of TAO was motivated by the scattered support for parallel computations and the lack of reuse of external toolkits in current optimization software. Our aim is to use object-oriented techniques to produce high-quality optimization software for a range of computing environments ranging from serial workstations and laptops to massively parallel high-performance architectures. Our design decisions are strongly motivated by the challenges inherent in the use of large-scale distributed memory architectures and the reality of working with large, often poorly structured legacy codes for specific applications.
This manual describes the use of TAO. Since TAO is still under development, changes in usage and calling sequences may occur. TAO is fully supported; see the the web site http://www.mcs.anl.gov/tao for information on contacting the TAO developers.
The initial development of TAO was funded by the ACTS Toolkit Project in the Office of Advanced Scientific Computing Research, U.S. Department of Energy. We gratefully acknowledge their support.
TAO owes much to the developers of PETSc. We have benefitted from their experience, tools, software, and advice. In many ways, TAO is a natural outcome of the PETSc development. TAO has also benefitted from the work of various researchers who have provided solvers, test problems, and interfaces. In particular, we acknowledge
Finally, we thank all TAO users for their comments, bug reports, and encouragement.
The Toolkit for Advanced Optimization (TAO) focuses on the design and implementation of optimization software for the solution of large-scale optimization applications on high-performance architectures. Our approach is motivated by the scattered support for parallel computations and lack of reuse of linear algebra software in currently available optimization software. The TAO design allows the reuse of toolkits that provide lower-level support (parallel sparse matrix data structures, preconditioners, solvers), and thus we are able to build on top of these toolkits instead of having to redevelop code. The advantages in terms of efficiency and development time are significant.
The TAO design philosophy uses object-oriented techniques of data and state encapsulation, abstract classes, and limited inheritance to create a flexible optimization toolkit. This chapter provides a short introduction to our design philosophy by describing the objects in TAO and the importance of this design. Since a major concern in the TAO project is the performance and scalability of optimization algorithms on large problems, we also present some performance resuls.
The TAO design philosophy place strongs emphasis on the reuse of external tools where appropriate. Our design enables bidirectional connection to lower-level linear algebra support (e.g. parallel sparse matrix data structures) provided in toolkits such as PETSc [3] [4,2] as well as higher-level application frameworks. Our design decisions are strongly motivated by the challenges inherent in the use of large-scale distributed memory architectures and the reality of working with large and often poorly structured legacy codes for specific applications. Figure 1.1 illustrates how the TAO software works with external libraries and application code.
The TAO solvers use four fundamental objects to define and solve optimization problems: vectors, index sets, matrices, and linear solvers. The concepts of vectors and matrices are standard, while an index set refers to a set of integers used to identify particular elements of vectors or matrices. An optimization algorithm is a sequence of well defined operations on these objects. These operations include vector sums, inner products, and matrix-vector multiplication. TAO makes no assumptions about the representation of these objects by passing pointers to data-structure-neutral objects for the execution of these numerical operations.
With sufficiently flexible abstract interfaces, TAO can support a variety of implementations of data structures and algorithms. These abstractions allow us to more easily experiment with a range of algorithmic and data structure options for realistic problems, such as within this case study. Such capabilities are critical for making high-performance optimization software adaptable to the continual evolution of parallel and distributed architectures and the research community's discovery of new algorithms that exploit their features.
Our current TAO implementation uses the parallel system infrastructure and linear algebra objects offered by PETSc, which uses MPI [13] for all interprocessor communication. The PETSc package supports objects for vectors, matrices, index sets, and linear solvers.
The TAO design philosophy eliminates some of the barriers in using independently developed software components by accepting data that is independent of representation and calling sequence written for particular data formats. The user can initialize an application with external frameworks, provide function information to a TAO solver, and call TAO to solve the application problem.
The use of abstractions for matrices and vectors in TAO optimization software also enables us to leverage automatic differentiation technology to facilitate the parallel computation of gradients and Hessians needed within optimization algorithms. We have demonstrated the viability of this approach through preliminary interfacing between TAO solvers and the automatic differentiation tools ADIFOR and ADIC. We are currently working on developing TAO interfaces that use special problem features (for example, partial separability, stencil information) in automatic differentiation computations.
A major concern in the TAO project is the performance and scalability of optimization algorithms on large problems. In this section we focus on the GPCG (gradient projection, conjugate gradient) algorithm for the solution of bound-constrained convex quadratic programming problems. Originally developed by Moré and Toraldo [20], the GPCG algorithm was designed for large-scale problems but had only been implemented for a single processor. GPCG combines the advantages of the identification properties of the gradient projection method with the finite termination properties of the conjugate gradient method. Moreover, the performance of the TAO implementation on large optimization problems is noteworthy.
We illustrate the performance of the GPCG algorithm by
presenting results for a journal bearing problem
with over 2.5 million variables.
The journal bearing problem
is a finite element approximation to a variational problem
over a rectangular two-dimensional grid. A
grid with
points in each direction, for example, is formulated
as a bound constrained quadratic problem with
variables.
The triangulation of the grid results in a matrix that has the
usual five diagonal nonzero structure that arises
from a difference approximation to the Laplacian operator.
The journal bearing problem contains an eccentricity parameter,
, that influences the number of active
variables at the solution and the difficulty in solving it.
Figure 1.2 shows the solution of the journal bearing problem
for
. The steep gradient in the solution
makes this problem a difficult benchmark.
The performance results in Table 1.1 are noteworthy is several
ways. First of all, the number of faces visited by GPCG is remarkably
small. Other strategies can lead to a large number of gradient
projection iterates, but the GPCG algorithm is remarkably efficient.
Another interesting aspect of these results is that due to the low
memory requirements of iterative solvers, we were able to solve these
problems with only
processors. Strategies that rely on
direct solvers are likely to need significantly more storage, and thus
more processors. Finally, these results show that the GPCG
implementation has excellent efficiency. For example, the efficiency
of GPCG with respect to
processors ranges between
and
when
. This sustained efficiency
is remarkable since the GPCG algorithm is solving a sequence of linear
problems with a coefficient matrix set to the submatrix of the Hessian
matrix with respect to the free variables for the current iterate.
Thus, our implementation's repartitioning of submatrices effectively
deals with the load-balancing problem that is inherent in the GPCG
algorithm.
An important aspect of our results that is not apparent from Table 1.1 is that for these results we were able to experiment easily with all the preconditioners offered by PETSc. In particular, we were able to compare the diagonal Jacobi preconditioner with block Jacobi and overlapping additive Schwarz preconditioners that use a zero-fill ILU solver in each block. We also experimented with a parallel zero-fill incomplete Cholesky preconditioner provided by a PETSc interface to the BlockSolve95 [15] package of Jones and Plassmann. Interestingly enough, the diagonal Jacobi preconditioner achieved better performance on this problem.
TAO contains unconstrained minimization, bound constrained minimization, and nonlinear complementarity solvers. The structure of these problems can differ significantly, but TAO has a similar interface to all of its solvers. Routines that most solvers have in common will be discussed in this chapter. A complete list of options can be found by consulting the manual pages. Many of the options can also be set at the command line. These options can also be found in manual pages or by running a program with the -help option.
info = TaoInitialize(int *argc,char ***argv,char *file_name,
char *help_message);
This command initializes TAO, as well as MPI, PETSc, and other packages
to which TAO applications may link (if these have not yet
been initialized elsewhere).
In particular, the arguments argc and
argv are the command line arguments delivered in all C and C++
programs; these arguments initialize the options database.
The argument file_name
optionally indicates an alternative name for an options file, which by
default is called .petscrc and resides in the user's home directory.
One of the last routines that all TAO programs should call is
info = TaoFinalize();This routine finalizes TAO and any other libraries that may have been initialized during the TaoInitialize() phase. For example, TaoFinalize() calls MPI_Finalize() if TaoInitialize() began MPI. If MPI was initiated externally from TAO (by either the user or another software package), then the user is responsible for calling MPI_Finalize().
A TAO solver can be created with the command
info = TaoCreate(MPI_Comm comm,TaoMethod method,TAO_SOLVER *newsolver);The first argument in this routine is an MPI communicator indicating which processes are involved in the solution process. In most cases, this should be set to MPI_COMM_WORLD. The second argument in this creation routine specifies the default method that should be be used to solve the optimization problem. The third argument in TaoCreate() is a pointer to a TAO solver object. This routine creates the object and returns it to the user. The TAO object is then to be used in all TAO routines.
The various types of TAO solvers and the flags that identify them will be discussed in the following chapters. The solution method should be carefully chosen depending upon the problem that is being solved. Some solvers, for instance, are meant for problems with no constraints, while other solvers acknowledge constraints in the problem and solve them accordingly. The user must also be aware of the derivative information that is available. Some solvers require second-order information, while other solvers require only gradient or function information. The TaoMethod can also be set to TAO_NULL in the TaoCreate() routine if the user selects a method at runtime using the options database. The command line option -tao_method followed by an TAO method will override any method specified by the second argument. The command line option -tao_method tao_lmvm, for instance, will specify the limited memory variable metric method for unconstrained optimization. Note that the TaoMethod variable is a string that requires quotation marks in an application program, but quotation marks are not required at the command line. The method that TAO uses to solve an optimization problem can be changed at a later point in the program with the command TaoSetMethod(), whose arguments are a TAO solver and a string that uniquely identifies a method for solving the problem.
Each TAO solver that has been created should also be destroyed using the command
info = TaoDestroy(TAO_SOLVER solver);This routine frees the internal data structures used by the solver.
Although TAO and its solvers set default parameters that are useful for many problems, it may be necessary for the user to modify these parameters to change the behavior and convergence of various algorithms.
One convergence criterion for most algorithms concerns the
of digits of accuracy needed in the solution. In particular,
one convergence test employed by TAO attempts to stop when
the error in the constraints is less than
,
and either
Other stopping criteria include a minimum trust region radius or a maximum number of iterations. These parameters can be set with the routines TaoSetTrustRegionTolerance() and TaoSetMaximumIterates(). Similarly, a maximum number of function evaluations can be set with the command TaoSetMaximumFunctionEvaluations() .
The routine
int TaoSolveApplication(TAO_APPLICATION, TAO_SOLVER);will apply the solver to the application that has been created by the user.
To see parameters and performance statistics for the solver, the routine
int TaoView(TAO_SOLVER);can be used. This routine will display to standard output the number of function evaluations need by the solver and other information specific to the solver.
The progress of the optimization solver can be monitored with the runtime option -tao_monitor. Although monitoring routines can be customized, the default monitoring routine will print out several relevant statistics to the screen.
The user also has access to information about the current solution. The current iteration number, objective function value, gradient norm, infeasibility norm, and step length can be retrieved with the command
int TaoGetSolutionStatus(TAO_SOLVER tao, int* iterate, double* f,
double* gnorm, double *cnorm, double *xdiff,
TaoTerminateReason *reason)
The last argument returns
a code that indicates the reason that the solver terminated. Positive
numbers indicate that a solution has been found, while negative numbers
indicate a failure. A list of reasons can be found in the manual page
for TaoGetTerminationReason().
The user set vectors containing the solution and gradient before solving the problem, but pointers to these vectors can also be retrieved with the commands TaoGetSolution() and TaoGetGradient(). Dual variables and other relevant information are also available. This information can be obtained during user-defined routines such as a function evaluation and customized monitoring routine, or after the solver has terminated.
Each solver has a set of options associated with it that can be set with command line arguments. A brief description of these algorithms and the associated options are discussed in this chapter.
This solver keeps a set of
sorted vectors
and their corresponding
objective function values
. At each iteration,
is removed from
the set and replaced with
where
can be one of
depending upon the values of
each possible
.
The algorithm terminates when the residual
becomes sufficiently small. Because of
the way new vectors can be added to the sorted set,
the minimum function value and/or the residual may not be impacted at each iteration.
There are two options that can be set specifically for the Nelder-Mead algorithm,
-tao_nm_lamda <value> sets the initial set of vectors (
plus
value in each cartesion direction), the default value is
.
tao_nm_mu <value> sets the value of
,
the default is
.
The limited-memory, variable-metric method solves the system of equations
The primary factors determining the behavior of this algorithm are the
number of vectors stored for the Hessian approximation and the scaling matrix
used when computing the direction. The number of vectors stored can be set
with the command line argument -tao_lmm_vectors <int>;
is the
default
value. Increasing the number of vectors results in a better Hessian
approximation and can decrease the number of iterations required to compute
a solution to the optimization problem. However, as the number of vectors
increases, more memory is consumed and each direction calculation takes
longer to compute. Therefore, a trade off must be made between the
quality of the Hessian approximation, the memory requirements, and
the time to compute the direction.
During the computation of the direction, the inverse of an initial
Hessian approximation
is applied. The choice of
has a significant impact on the quality of the direction obtained
and can result in a decrease in the number of function and gradient
evaluations required to solve the optimization problem. However,
the calculation of
at each iteration can have a significant
impact on the time required to update the limited-memory BFGS
approximation and the cost of obtaining the direction. By default,
is a diagonal matrix obtained from the diagonal entries
of a Broyden approximation to the Hessian matrix. The calculation
of
can be modified with the command line argument
-tao_lmm_scale_type <none,scalar,broyden>. Each scaling
method is described below. The scalar and broyden
techniques are inspired by [].
An additional rescaling of the diagonal matrix can be applied to further improve performance when using the broyden scaling method. The rescaling method can be set with the command line argument -tao_lmm_rescale_type <none,scalar,gl>; scalar is the default rescaling method. The rescaling method applied can have a large impact on the number of function and gradient evaluations necessary to compute a solution to the optimization problem, but increases the time required to update the BFGS approximation. Each rescaling method is described below. These techniques are inspired by [].
Finally, a limit can be placed on the difference between the scaling matrix computed at this iteration and the previous value for the scaling matrix. The limiting type can be set with the command line argument -tao_lmm_limit_type <none,average,relative,absolute>; none is the default value. Each of these methods is described below when using the scalar scaling method. The techniques are the same when using the broyden scaling method, but are applied to each entry in the diagonal matrix.
The default values for the scaling are based on many tests using the unconstrained problems from the MINPACK-2 test set. These tests were used to narrow the choices to a few sets of values. These values were then run on the unconstrained problems from the CUTEr test set to obtain the default values supplied.
| Name | Value | Default | Description |
| -tao_lmm_vectors | int | 5 | Number of vectors for Hessian approximation |
| -tao_lmm_scale_type | none, scalar, broyden | broyden | Type of scaling method to use |
| -tao_lmm_scalar_history | int | 5 | Number of vectors to use when scaling |
| -tao_lmm_scalar_alpha | double | 1 | Value of |
| -tao_lmm_broyden_phi | double | 0.125 | Value of |
| -tao_lmm_rescale_type | none, scalar, gl | scalar | Type of rescaling method to use |
| -tao_lmm_rescale_history | int | 5 | Number of vectors to use when rescaling |
| -tao_lmm_rescale_alpha | double | 1 | Value of |
| -tao_lmm_rescale_beta | double | 0.5 | Value of |
| -tao_lmm_limit_type | none, average, relative, absolute | none | Type of limit to impose on scaling matrix |
| -tao_lmm_limit_mu | double | 1 | Value of |
| -tao_lmm_limit_nu | double | 100 | Value of |
The nonlinear conjugate gradient method can be viewed as an extensions of the conjugate gradient method for solving symmetric, positive-definite linear systems of equations. This algorithm requires only function and gradient evaluations as well as a line search. The TAO implementation uses a Moré-Thuente line search to obtain the step length. The nonlinear conjugate gradient method can be selected by using the TaoMethod tao_cg. For the best efficiency, function and gradient evaluations should be performed simultaneously when using this algorithm.
Five variations are currently supported by the TAO implementation: the Fletcher-Reeves method, the Polak-Ribiére method, the Polak-Ribiére-Plus method[25], the Hestenes-Stiefel method, and the Dai-Yuan method. These conjugate gradient methods can be specified by using the command line argument tao_cg_type <fr,pr,prp,hs,dy>, respectively. The default value is prp.
The conjugate gradient method incorporates automatic restarts when successive
gradients are not sufficiently orthogonal. TAO measures the orthogonality by
dividing the inner product of the gradient at the current point and the
gradient at the previous point by the square of the Euclidean norm of
the gradient at the current point. When the absolute value of this
ratio is greater than
, the algorithm restarts using the gradient
direction. The parameter
can be set using the command line argument
-tao_cg_eta <double>; 0.1 is the default value.
The Newton line-search method solves the symmetric system of equations
The system of equations is approximately solved by applying the conjugate gradient method, Steihaug-Toint conjugate gradient method, generalized Lanczos method, or an alternative Krylov subspace method supplied by PETSc. The method used to solve the systems of equations is specified with the command line argument -tao_nls_ksp_type <cg,stcg,gltr,petsc>; cg is the default. When the type is set to petsc, the method set with the PETSc -ksp_type command line argument is used. For example, to use GMRES as the linear system solver, one would use the the command line arguments -tao_nls_ksp_type petsc -ksp_type gmres. Internally, the PETSc implementations for the conjugate gradient methods and the generalized Lanczos method are used. See the PETSc manual for further information on changing the behavior of the linear system solvers.
A good preconditioner reduces the number of iterations required to solve the linear system of equations. For the conjugate gradient methods and generalized Lanczos method, this preconditioner must be symmetric and positive definite. The available options are to use no preconditioner, the absolute value of the diagonal of the Hessian matrix, a limited-memory BFGS approximation to the Hessian matrix, or one of the other preconditioners provided by the PETSc package. These preconditioners are specified by the command line argument -tao_nls_pc_type <none,ahess,bfgs,petsc>, respectively. The default is the bfgs preconditioner. When the preconditioner type is set to petsc, the preconditioner set with the PETSc -pc_type command line argument is used. For example, to use an incomplete Cholesky factorization for the preconditioner, one would use the command line arguments -tao_nls_pc_type petsc -pc_type icc. See the PETSc manual for further information on changing the behavior of the preconditioners.
The choice of scaling matrix can have a significant impact on the quality of the Hessian approximation when using the bfgs preconditioner and affect the number of iterations required by the linear system solver. The choices for scaling matrices are the same as those discussed for the limited-memory, variable-metric algorithm. For Newton methods, however, the option exists to use a scaling matrix based on the true Hessian matrix. In particular, the implementation supports using the absolute value of the diagonal of the Hessian matrix or the absolute value of the diagonal of the perturbed Hessian matrix. The scaling matrix to use with the bfgs preconditioner is set with the command line argument -tao_nls_bfgs_scale_type <bfgs,ahess,phess>; phess is the default. The bfgs scaling matrix is derived from the BFGS options. The ahess scaling matrix is the absolute value of the diagonal of the Hessian matrix. The phess scaling matrix is the absolute value of the diagonal of the perturbed Hessian matrix.
The perturbation
is added when the direction returned by the
Krylov subspace method is either not a descent direction, the Krylov method
diverged due to an indefinite preconditioner or matrix, or a direction of
negative curvature was found. In the two latter cases, if the step returned
is a descent direction, it is used during the line search. Otherwise, a
steepest descent direction is used during the line search. The perturbation
is decreased as long as the Krylov subspace method reports success and
increased if further problems are encountered. There are three cases:
initializing, increasing, and decreasing the perturbation. These cases
are described below.
When using stcg or gltr to solve the linear systems of equation,
a trust-region radius need to be initialized and updated. This trust-region
radius limits the size of the step computed. The method for initializing
the trust-region radius is set with the command line argument
-tao_nls_init_type <constant,direction,interpolation>;
interpolation, which chooses an initial value based on the
interpolation scheme found in [5], is the default. This
scheme performs a number of function and gradient evaluations to determine
a radius such that the reduction predicted by the quadratic model along the
gradient direction coincides with the actual reduction in the nonlinear
function. The iterate obtaining the best objective function value is
used as the starting point for the main line-search algorithm. The
constant method initializes the trust-region radius by using
the value specified with the -tao_trust0 <double> command line
argument, where the default value is 100. The direction technique
solves the first quadratic optimization problem by using a standard
conjugate gradient method and initializes the trust-region to
.
Finally, the method for updating the trust-region radius is set with the
command line argument
-tao_nls_update_type <step,reduction,interpolation>; step
is the default. The step method updates the trust-region
radius based on the value of
. In particular,
| Name | Value | Default | Description |
| -tao_nls_ksp_type | cg, stcg, gltr, petsc | cg | Type of Krylov subspace method to use when solving linear system |
| -tao_nls_pc_type | none, ahess, bfgs, petsc | bfgs | Type of preconditioner to use when solving linear system |
| -tao_nls_bfgs_scale_type | ahess, phess, bfgs | phess | Type of scaling matrix to use with BFGS preconditioner |
| -tao_nls_sval | double | Initial perturbation value | |
| -tao_nls_imin | double | Minimum initial perturbation value | |
| -tao_nls_imax | double | Maximum initial perturbation value | |
| -tao_nls_imfac | double | Factor applied to norm of gradient when initializing perturbation | |
| -tao_nls_pmax | double | Maximum perturbation when increasing value | |
| -tao_nls_pgfac | double | Growth factor applied to perturbation when increasing value | |
| -tao_nls_pmgfac | double | Factor applied to norm of gradient when increasing perturbation | |
| -tao_nls_pmin | double | Minimum perturbation when decreasing value; smaller values set to zero | |
| -tao_nls_psfac | double | Shrink factor applied to perturbation when decreasing value | |
| -tao_nls_pmsfac | double | Factor applied to norm of gradient when decreasing perturbation |
| Name | Value | Default | Description |
| -tao_nls_init_type | constant, direction, interpolation | interpolation | Method used to initialize trust-region radius when using stcg or gltr |
| -tao_nls_mu1_i | double | 0.35 | |
| -tao_nls_mu2_i | double | 0.50 | |
| -tao_nls_gamma1_i | double | 0.0625 | |
| -tao_nls_gamma2_i | double | 0.50 | |
| -tao_nls_gamma3_i | double | 2.00 | |
| -tao_nls_gamma4_i | double | 5.00 | |
| -tao_nls_theta_i | double | 0.25 | |
| -tao_nls_update_type | step, reduction, interpolation | step | Method used to update trust-region radius when using stcg or gltr |
| -tao_nls_nu1 | double | 0.25 | |
| -tao_nls_nu2 | double | 0.50 | |
| -tao_nls_nu3 | double | 1.00 | |
| -tao_nls_nu4 | double | 1.25 | |
| -tao_nls_omega1 | double | 0.25 | |
| -tao_nls_omega2 | double | 0.50 | |
| -tao_nls_omega3 | double | 1.00 | |
| -tao_nls_omega4 | double | 2.00 | |
| -tao_nls_omega5 | double | 4.00 | |
| -tao_nls_eta1 | double | ||
| -tao_nls_eta2 | double | 0.25 | |
| -tao_nls_eta3 | double | 0.50 | |
| -tao_nls_eta4 | double | 0.90 | |
| -tao_nls_alpha1 | double | 0.25 | |
| -tao_nls_alpha2 | double | 0.50 | |
| -tao_nls_alpha3 | double | 1.00 | |
| -tao_nls_alpha4 | double | 2.00 | |
| -tao_nls_alpha5 | double | 4.00 | |
| -tao_nls_mu1 | double | 0.10 | |
| -tao_nls_mu2 | double | 0.50 | |
| -tao_nls_gamma1 | double | 0.25 | |
| -tao_nls_gamma2 | double | 0.50 | |
| -tao_nls_gamma3 | double | 2.00 | |
| -tao_nls_gamma4 | double | 4.00 | |
| -tao_nls_theta | double | 0.05 |
The Newton trust-region method solves the constrained quadratic programming
problem
The quadratic optimization problem is approximately solved by applying
the Steihaug-Toint conjugate gradient method or generalized Lanczos
method to the symmetric system of equations
. The method
used to solve the system of equations is specified with the command line
argument -tao_ntr_ksp_type <stcg,gltr>; stcg is the default.
Internally, the PETSc implementations for the Steihaug-Toint method and the
generalized Lanczos method are used. See the PETSc manual for further
information on changing the behavior of these linear system solvers.
A good preconditioner reduces the number of iterations required to compute the direction. For the Steihaug-Toint conjugate gradient method and generalized Lanczos method, this preconditioner must be symmetric and positive definite. The available options are to use no preconditioner, the absolute value of the diagonal of the Hessian matrix, a limited-memory BFGS approximation to the Hessian matrix, or one of the other preconditioners provided by the PETSc package. These preconditioners are specified by the the command line argument -tao_ntr_pc_type <none,ahess,bfgs,petsc>, respectively. The default is the bfgs preconditioner. When the preconditioner type is set the to petsc, the preconditioner set with the PETSc -pc_type command line argument is used. For example, to use an incomplete Cholesky factorization for the preconditioner, one would use the command line arguments -tao_ntr_pc_type petsc -pc_type icc. See the PETSc manual for further information on changing the behavior of the preconditioners.
The choice of scaling matrix can have a significant impact on the quality of the Hessian approximation when using the bfgs preconditioner and affect the number of iterations required by the linear system solver. The choices for scaling matrices are the same as those discussed for the limited-memory, variable-metric algorithm. For Newton methods, however, the option exists to use a scaling matrix based on the true Hessian matrix. In particular, the implementation supports using the absolute value of the diagonal of the Hessian matrix. The scaling matrix to use with the bfgs preconditioner is set with the command line argument -tao_ntr_bfgs_scale_type <ahess,bfgs>; ahess is the default. The bfgs scaling matrix is derived from the BFGS options. The ahess scaling matrix is the absolute value of the diagonal of the Hessian matrix.
The method for computing an initial trust-region radius is set with the
command line argument -tao_ntr_init_type <constant,direction,interpolation>;
interpolation, which chooses an initial value based on the
interpolation scheme found in [5], is the default. This
scheme performs a number of function and gradient evaluations to determine
a radius such that the reduction predicted by the quadratic model along the
gradient direction coincides with the actual reduction in the nonlinear
function. The iterate obtaining the best objective function value is
used as the starting point for the main line-search algorithm. The
constant method initializes the trust-region radius by using
the value specified with the -tao_trust0 <double> command line
argument, where the default value is 100. The direction technique
solves the first quadratic optimization problem by using a standard
conjugate gradient method and initializes the trust-region to
.
Finally, the method for updating the trust-region radius is set with the
command line argument
-tao_ntr_update_type <reduction,interpolation>; reduction
is the default. The reduction method computes the ratio of the
actual reduction in the objective function to the reduction predicted
by the quadratic model for the full step,
, where
is the quadratic model. The radius is then updated as:
| Name | Value | Default | Description |
| -tao_ntr_ksp_type | stcg, gltr | stcg | Type of Krylov subspace method to use when solving linear system |
| -tao_ntr_pc_type | none, ahess, bfgs, petsc | bfgs | Type of preconditioner to use when solving linear system |
| -tao_ntr_bfgs_scale_type | ahess, bfgs | ahess | Type of scaling matrix to use with BFGS preconditioner |
| -tao_ntr_init_type | constant, direction, interpolation | interpolation | Method used to initialize trust-region radius |
| -tao_ntr_mu1_i | double | 0.35 | |
| -tao_ntr_mu2_i | double | 0.50 | |
| -tao_ntr_gamma1_i | double | 0.0625 | |
| -tao_ntr_gamma2_i | double | 0.50 | |
| -tao_ntr_gamma3_i | double | 2.00 | |
| -tao_ntr_gamma4_i | double | 5.00 | |
| -tao_ntr_theta_i | double | 0.25 | |
| -tao_ntr_update_type | reduction, interpolation | reduction | Method used to update trust-region radius |
| -tao_ntr_eta1 | double | ||
| -tao_ntr_eta2 | double | 0.25 | |
| -tao_ntr_eta3 | double | 0.50 | |
| -tao_ntr_eta4 | double | 0.90 | |
| -tao_ntr_alpha1 | double | 0.25 | |
| -tao_ntr_alpha2 | double | 0.50 | |
| -tao_ntr_alpha3 | double | 1.00 | |
| -tao_ntr_alpha4 | double | 2.00 | |
| -tao_ntr_alpha5 | double | 4.00 | |
| -tao_ntr_mu1 | double | 0.10 | |
| -tao_ntr_mu2 | double | 0.50 | |
| -tao_ntr_gamma1 | double | 0.25 | |
| -tao_ntr_gamma2 | double | 0.50 | |
| -tao_ntr_gamma3 | double | 2.00 | |
| -tao_ntr_gamma4 | double | 4.00 | |
| -tao_ntr_theta | double | 0.05 |
Bound constrained optimization algorithms
minimize
, subject to upper or
lower bounds on some of the variables.
These solvers also bounds on the variables as well as objective
function, gradient, and possibly Hessian information.
The TRON algorithm solves a reduced linear system defined by the rows and columns corresponding to the variables that lie between the upper and lower bounds. When running in parallel, these rows can either remain on their current processor or be redistributed evenly over all of the processors with the command TaoSelectSubset(). The TRON algorithm applies a trust region to the conjugate gradients to ensure convergence. The initial trust region can be set using the command TaoSetTrustRegionRadius() and the current trust region size can be found using the command TaoGetTrustRegionRadius(). The initial trust region can significantly alter the rate of convergence for the algorithm and should be tuned and adjusted for optimal performance.
This method is the bound constrained variant of the LMVM method for unconstrained optimization. It uses projected gradients to approximate the Hessian - eliminating the need for Hessian evaluations. The method can be set using TaoMethod tao_blmvm. The command TaoLMVMSetSize(), which sets the number of vectors to be used in the Hessian approximation, also applies to this method.
This method calculates points satisfying the first-order necessary optimality conditions. The method uses the mixed complementarity problem solvers from Section 4.3 to calculate the solutions. The choice of complementarity solver is specified with the runtime option -tao_kt_method with the default being the tao_ssils method.
Mixed complementarity problems, or box-constrained variational inequalities,
are related to nonlinear systems of equations. They are defined by a
continuously differentiable function,
, and bounds,
and
, on the variables such that
. Given this information,
is a solution to
MCP(
,
,
) if for each
we have at
least one of the following:

Simple complementarity conditions arise from the first-order optimality
conditions from optimization [16,17].
In the simple bound constrained optimization case, these conditions
correspond to MCP(
,
,
), where
is the objective function. In a one-dimensional setting these conditions
are intuitive. If the solution is at the lower bound, then the function must
be increasing and
. However, if the solution is at the
upper bound, then the function must be decreasing and
.
Finally, if the solution
is strictly between the bounds, we must be at a stationary point and
. Other complementarity problems arise in economics and
engineering [9], game
theory [23], and finance [14].
Evaluation routines for
and its Jacobian must be supplied prior
to solving the application.
The bounds,
, on the variables must also be
provided.
If no starting point is supplied, a default starting point of all zeros
is used.
TAO has two implementations of semismooth algorithms [22,7,8] for solving mixed complementarity problems. Both are based upon a reformulation of the mixed complementarity problem as a nonsmooth system of equations using the Fischer-Burmeister function [10]. A nonsmooth Newton method is applied to the reformulated system to calculate a solution. The theoretical properties of such methods are detailed in the aforementioned references.
The Fischer-Burmeister function,
, is defined as,

The two semismooth TAO solvers both solve the system
by applying
a nonsmooth newton method with a line-search. We calculate a direction,
,
by solving the system
where
is an element of the
-subdifferential [27] of
at
. If the
direction calculated does not satisfy a suitable descent condition, then
we use the negative gradient of the merit function,
, as
the search direction. A standard Armijo search [1] is
used to find the new iteration. Non-monotone searches
[11] are also available by setting
appropriate run-time options. See Section 6.2 for further
details.
The first semismooth algorithm available in TAO is not guaranteed to
remain feasible with respect to the bounds,
, and is termed
an infeasible semismooth method. This method can be specified using the
TaoMethod tao_ssils. In this case, the descent test used is
that