The unconstrained optimization problem is central to the development of optimization
software. Constrained optimization algorithms are often extensions of unconstrained
algorithms, while nonlinear least squares and nonlinear equation algorithms tend to be
specializations. In the unconstrained optimization problem, we seek a local
minimizer of a real-valued function, f(x), where x is a vector of
real variables. In other words, we
seek a vector, x*, such that
f(x*) <= f(x) for all x close to x*.
Global optimization algorithms try to find an x* that minimizes f over all possible vectors x. This is a much harder problem to solve. We do not discuss it here because, at present, no efficient algorithm is known for performing this task. For many applications, local minima are good enough, particularly when the user can draw on his/her own experience and provide a good starting point for the algorithm.
Newton's method gives rise to a wide and important class of algorithms that require
computation of the gradient vector

and the Hessian matrix,
![]()
Although the computation or approximation of the Hessian can be a time-consuming
operation, there are many problems for which this computation is justified. We describe
algorithms in which the user supplies the Hessian explicitly before moving on to a
discussion of algorithms that don't require the Hessian.
Newton's method forms a quadratic model of the objective function around the current
iterate
. The model function is
defined by
.
In the basic Newton method, the next iterate is obtained from the minimizer of
. When the Hessian matrix,
, is positive definite, the quadratic
model has a unique minimizer that can be obtained by solving the symmetric
linear system:
The next iterate is then
Convergence is guaranteed if the starting point is sufficiently close to a local minimizer x* at which the Hessian is positive definite. Moreover, the rate of convergence is quadratic, that is,
![]()
for some positive constant
.
In most circumstances, however, the basic Newton method has to be modified to achieve convergence.
Versions of Newton's method are implemented in the following software packages:
BTN, GAUSS, IMSL, LANCELOT, NAG, OPTIMA, PORT 3, PROC NLP, TENMIN, TN, TNPACK, UNCMIN, and VE08.
The NEOS Server also has an unconstrained minimization facility to solve these problems remotely over the Internet.
These codes obtain convergence when the starting point is not close to a minimizer by using either a line-search or a trust-region approach.
Line-search and trust-region techniques are suitable if the number of variables
is not too
large, because the cost per iteration is of order
. Codes for problems with a large number of variables
tend to use truncated Newton methods , which usually
settle for an approximate minimizer of the quadratic model.
So far, we have assumed that the Hessian matrix is available, but the algorithms are unchanged if the Hessian matrix is replaced by a reasonable approximation. Two kinds of methods use approximate Hessians in place of the real thing:
Finally, we mention two other approaches for unconstrained problems that are not so closely related to Newton's method:
Up To:
Down To:
Nondifferentiable Optimization
![]()
[ OTC Home Page | NEOS Guide | NEOS Server | Optimization Tree ]
Updated 28 March 1996