Degrees of Freedom for RBF Neural Network Design

  • Follow


I am using a MATLAB routine (NEWRB) for designing a particular type
of  Radial Basis Function Neural Network (RBFNN) regression model.
With this type of model each of the O outputs is a linear combination
of H Gaussian functions with equal scalar covariance matrices (i.e., a
single variance times the unit matrix). The squared distance in each
Gaussian is that between the current input vector of dimension I and
the
I-dimensional Gaussian center.

A more general type of RBFNN would use a different variance for each
Gaussian. A typical general design would use N input/output-target
pairs to obtain Neq = N*O equations which would be solved by
minimizing the mean squared error (MSE) between the O design output
target values and the RBFNN output. The unknown parameters to be
estimated are:

 H        variances
 H*I     Gaussian center compponents
 H*O   Gaussian to output weights
O         output bias weights

The total number is

Np = (I+1)*H + (H+1)*O  in general
Np = 1 + I*H + (H+1)*O  equal variances

The MSE is is obtained by dividing the sum squared error (SSE) by the
number of equations.

MSE = SSE/(N*O)

However, this error estimate is biased because it is obtained by using
the design data. One approach is to "adjust" the estimate by replacing
N by the N-Np the degrees of freedom available after N meaurements are
used to estimate Np parameters:

MSEa = SSE/(O*(N-Np))

The MATLAB design contains constraints whos effect I hope to quantify
by using Npeff, an "effective" number of parameters. Then the
estimation bias can be mitigated by using

MSEa = SSE/(O*(N-Npeff))

The MATLAB design:

1. Specified:
     a.  Single variance
     b.  Maximum allowable number of Gaussians, Hmax
     c.  SSEgoal
2. Starts with one Gaussian (H=1) center equal to the first
     input vector
3. Sequentially chooses additional centers by choosing the
    input vector yielding the highest output error (H = H+1).
4. Stops when either SSE <= SSEgoal and/or H = Hmax

My questions:

1. How is the I*H term in Np modified when each center component is
not determined independently. Instead, it is one of the I components
of a chosen input vector?
2. Without changing the algorithm or using nondesign data, is there a
better way to estimate an unbiased MSE?

Thanks in advance.

Greg
0
Reply heath (3875) 8/21/2011 5:54:24 AM


0 Replies
26 Views

(page loaded in 0.072 seconds)


Reply: