L1 Regularized Logistic Regression


1. Methods
Learn basis elements by L1 regularized logistic regression with MAX1 scores after sigmoid transformation. After the regression, each feature will get a regression coefficient. The coefficient describe the size of the contribution of that feature. A positive regression coefficient means the element increases the probability of the outcome, while a negative coefficient means the element decreases the probability of that outcome. In other words, the learning method divides the learned template to two groups. After dimension reduction by deleating the basises with 0 pooled MAX1 scores, the computation is fast for discriminative methods.

2. Classification Experiments
2.1 Dataset. Number of training negatives 160, testing negatives 160.  5 numbers of positive training examples (5, 10, 20, 40, 80) are tested. We tried 3 different number of tuning psrameters: C = 1, 10, and 100.

2.2 Results of testing AUC.
  L1 regularized Logistic regression performs better when training positive sample size increases. The tuning parameters didn't affect the result very much, but when the value was bigger, the regression performs better.



2.2 Results of number of learning features.
  After L1 Regularized Logistic Regression, some coefficients of the features will be set to 0. Therefore, the features with non-zero coefficients will be the features learned/selected by the method. The plot shows the average number of selected features(includes features with positive or negative coefficient). Compared with the 80 features learned by active basis, L1 Regularized Logistic Regression choose more features, and especially when learning with large training positives.

2.3 Results of number of learning features with positive coefficient.  Although features with positive coefficient(positive features) are usually less than features with negative coefficient(negative features), the ratio between each other didn't change much when the traning positives increase.  The features with positive coefficient(positive features) can be viewed as the sketch of the  object.


2.3 Compare templates.

Number of training positive is 80; number of elements is 80; Length of Gabor = 17 pixels; Local normalization or not = 1; Range of displacement = 3 pixels; Subsample rate = 2 pixel; Image height and width = 85 by 127 pixels

Active Basis:


Number of sketches in the above active basis templates are, respectively, 5, 10, 15, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80

L1 regularized logistic regression: number of training negative is 160; C =100





Number of sketches in the above active basis templates are, respectively, 5, 10, 15, 20, 25, 30, 35, 40, 45, 50, 55, 60


=========================================================================================================================

Number of training positive is 80; number of elements is 80; Length of Gabor = 17 pixels; Local normalization or not = 1; Range of displacement = 3 pixels; Subsample rate = 2 pixel; Image height and width = 85 by 127 pixels

Active Basis:


Number of sketches in the above active basis templates are, respectively, 5, 10, 15, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80

L1 regularized logistic regression: number of training negative is 80; C =100





Number of sketches in the above active basis templates are, respectively, 5, 10, 15, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65