Neural Network Vectorization

Revision as of 18:42, 29 April 2011 (view source)

Ang (Talk | contribs)

← Older edit

Revision as of 18:56, 29 April 2011 (view source)

Ang (Talk | contribs)

Newer edit →

Line 117:

== Sparse autoencoder ==

-

The [~~http://ufldl.stanford.edu/wiki/index.php/~~Autoencoders_and_Sparsity sparse autoencoder] neural network has an additional sparsity penalty that constrains neurons' average firing rate to be close to some target activation <math>\rho</math>. ~~We take~~ into the account the sparsity penalty by computing the following:

+

The [[Autoencoders_and_Sparsity|sparse autoencoder]] neural network has an additional sparsity penalty that constrains neurons' average firing rate to be close to some target activation <math>\rho</math>. When performing backpropagation on a single training example, we had taken into the account the sparsity penalty by computing the following:

:<math>\begin{align}

Line 125:

\end{align}</math>

-

In the ''unvectorized'' case, this is computed as:

+

In the ''unvectorized'' case, this was computed as:

Line 137:

</syntaxhighlight>

-

~~Recall~~ that ~~when we vectorizing~~ the ~~gradient computations~~, <tt>delta2</tt> is now a matrix with <math>m</math> columns corresponding to the <math>m</math> training examples. ~~Furthermore~~, notice that the <tt>sparsity_delta</tt> term is the same regardless of ~~the~~ example we are processing. This suggests that vectorizing the computation above can be done by simply adding the same value to to each column when constructing the <tt>delta2</tt> matrix. Thus, to vectorize the above ~~computations~~, we can simply add <tt>sparsity_delta</tt> (e.g., using <tt>repmat</tt>) to <tt>delta2</tt>.

+

The code above still had a <tt>for</tt> loop over the training set, and <tt>delta2</tt> was a column vector.

+

In contrast, recall that in the vectorized case, <tt>delta2</tt> is now a matrix with <math>m</math> columns corresponding to the <math>m</math> training examples. Now, notice that the <tt>sparsity_delta</tt> term is the same regardless of what example we are processing. This suggests that vectorizing the computation above can be done by simply adding the same value to each column when constructing the <tt>delta2</tt> matrix. Thus, to vectorize the above computation, we can simply add <tt>sparsity_delta</tt> (e.g., using <tt>repmat</tt>) to each column of <tt>delta2</tt>.

Neural Network Vectorization

From Ufldl

Revision as of 18:56, 29 April 2011

Views

Personal tools

ufldl resources

wiki

Search

Toolbox

@@ Line 117: / Line 117: @@
 == Sparse autoencoder ==
-The [http://ufldl.stanford.edu/wiki/index.php/Autoencoders_and_Sparsity sparse autoencoder] neural network has an additional sparsity penalty that constrains neurons' average firing rate to be close to some target activation <math>\rho</math>. We take into the account the sparsity penalty by computing the following:
+The [[Autoencoders_and_Sparsity|sparse autoencoder]] neural network has an additional sparsity penalty that constrains neurons' average firing rate to be close to some target activation <math>\rho</math>.  When performing backpropagation on a single training example, we had taken into the account the sparsity penalty by computing the following:
 :<math>\begin{align}
@@ Line 125: / Line 125: @@
 \end{align}</math>
-In the ''unvectorized'' case, this is computed as:
+In the ''unvectorized'' case, this was computed as:
 <syntaxhighlight>
@@ Line 137: / Line 137: @@
 </syntaxhighlight>
-Recall that when we vectorizing the gradient computations, <tt>delta2</tt> is now a matrix with <math>m</math> columns corresponding to the <math>m</math> training examples.  Furthermore, notice that the <tt>sparsity_delta</tt> term is the same regardless of the example we are processing.  This suggests that vectorizing the computation above can be done by simply adding the same value to to each column when constructing the <tt>delta2</tt> matrix. Thus, to vectorize the above computations, we can simply add <tt>sparsity_delta</tt> (e.g., using <tt>repmat</tt>) to <tt>delta2</tt>.
+The code above still had a <tt>for</tt> loop over the training set, and <tt>delta2</tt> was a column vector.
+In contrast, recall that in the vectorized case, <tt>delta2</tt> is now a matrix with <math>m</math> columns corresponding to the <math>m</math> training examples.  Now, notice that the <tt>sparsity_delta</tt> term is the same regardless of what example we are processing.  This suggests that vectorizing the computation above can be done by simply adding the same value to each column when constructing the <tt>delta2</tt> matrix. Thus, to vectorize the above computation, we can simply add <tt>sparsity_delta</tt> (e.g., using <tt>repmat</tt>) to each column of <tt>delta2</tt>.