• Examples

Very Simple
2D Classification (neural network)
Classification - standard algorithms
Back-Propagation vs Cascade-Correlation
Time-sequence prediction
QSVD-ICA preprocessing

Real-life applications:
nt charge-current interactions selection (ICARUS experiment)
aLL estimation (COMPASS experiment)

QSVD-ICA preprocessing (Description still in preparation...)

Download example (...kB) - project and data files.

Short introduction:

QSVD (
quotient singular value decomposition) and ICA (independent component analysis) are techniques used for input data preprocessing. They give a possibility of better representation of data (network may learn more easily on transformed data). Also dimmensionality of data may be reduced by recognizing and eliminating noisy components. This example shows how to take advantages of these transformation.

QSVD
ICA:
   - part I
   - part II

QSVD:

This is an ortoghonal transformation from the original space to the space where axes point the directions of maximum values of
ssig/sbkg (where ssig and sbkg are the standard deviations of classes being separated, so each event should be accompanied with information about the class it belongs to). In other words this is just a rotation that shows the data from the most interesting angle.

Transformation matrix is calculated as:

A = UT,

where U is taken from SVD decomposition:

Cbkg-1Csig = USVT,

where Cbkg and Csig are the covariance matrices of background and signal data respectively;

transformed event feature vectors are calculated as:

y = Ax.

Rows of the matrix A are sorted so less informative rows with corresponding singular values close to 1.0 are placed at the lower rows. Calculated singular values (that are also values of ssig/sbkg) may be displayed through the menu Edit/Project Tree where the Transform block node should be expanded. Yellow marked values placed at the end of the list correspond to the rows of matrix A that are safe to remove (more). If value is on a red label - you have already excluded corresponding row from calculations.
Example of basic operations with QSVD transformation is contained in
svd_ica_basics.NetPrj project file. Signal and background classes in this example are 3D, but only 2 dimmensions are class-specific. Third dimmension is noise generated in the same way for both classes. Directions of axes of the original feature space has been chosen randomly. Scatter plots of each combination of feature vector components are on the images below. We will try to discover 2 informative dimmensions with QSVD transformation.





Push Go! button in the Setup dialog window of the QSVD transform block. This will release calculations of transformation matrix. Then events from the 3D input space block are transformed into 2D events and sent to the 2D reduced space data block. Directions of extreme values of ssig/sbkg are discovered properly and one redundant dimmension is removed. Check the QSVD/mode node in the Project Tree - there are two singular values (sValues) corredsponding to transformed space axes and one sValue on a red label (ssig/sbkg=1.0786). The image below shows events in the transformed sapace.

--top--   --up--

ICA

In many cases simple rotation is not enough to find an "interesting" point of view. ICA may be more helpful in these situations. It is a linear, but non-ortoghonal transformation, where directions of most non-gaussian data probability distribution functions (
pdf) are searched for. It is assumed that the more gaussian is the pdf, the more likeli it is just noise in this direction. Additionaly, pdf-s in each direction are uncorrelated to each other (more).
Following examples show how to separate mixed signals (part I) and how to expose structures that may not be visible in a original space (part II).

ICA - part I

This is most often published example of ICA application. Its goal is to separate four source signals basing on four different linear combinations of them. Example is contained in
svd_ica_basics.NetPrj project file. Mixed signals that are the input to the transformation are presented in the images below. Also the histograms of each of these combinations are shown - both, the signals and theirs histograms look similar, like a noise with gaussian distribution. ICA requires that input data has to be centered and normalized. This condition is met (not precisely, but enough for the computations in this case...) - values of m and s are shown in the legend in the image with histograms.









text









--top--   --up--

ICA - part II

text



text

--top--   --up--