You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
or the log-linear classifier. In this model, the probabilities describing the possible outcomes of a single trial are modeled using a `logistic function <http://en.wikipedia.org/wiki/Logistic_function>`_.
686
+
logit regression, maximum-entropy classification (MaxEnt) or the log-linear
687
+
classifier. In this model, the probabilities describing the possible outcomes
688
+
of a single trial are modeled using a `logistic function
For large dataset, you may also consider using :class:`SGDClassifier` with 'log' loss.
745
743
746
744
.. topic:: Examples:
@@ -770,18 +768,19 @@ For large dataset, you may also consider using :class:`SGDClassifier` with 'log'
770
768
thus be used to perform feature selection, as detailed in
771
769
:ref:`l1_feature_selection`.
772
770
773
-
:class:`LogisticRegressionCV` implements Logistic Regression with
774
-
builtin cross-validation to find out the optimal C parameter.
775
-
"newton-cg", "sag" and "lbfgs" solvers are found to be faster
776
-
for high-dimensional dense data, due to warm-starting.
777
-
For the multiclass case, if `multi_class`
778
-
option is set to "ovr", an optimal C is obtained for each class and if
779
-
the `multi_class` option is set to "multinomial", an optimal C is
780
-
obtained that minimizes the cross-entropy loss.
771
+
:class:`LogisticRegressionCV` implements Logistic Regression with builtin
772
+
cross-validation to find out the optimal C parameter. "newton-cg", "sag" and
773
+
"lbfgs" solvers are found to be faster for high-dimensional dense data, due to
774
+
warm-starting. For the multiclass case, if `multi_class` option is set to
775
+
"ovr", an optimal C is obtained for each class and if the `multi_class` option
776
+
is set to "multinomial", an optimal C is obtained by minimizing the cross-
777
+
entropy loss.
781
778
782
779
.. topic:: References:
783
780
784
-
.. [3] Mark Schmidt, Nicolas Le Roux, and Francis Bach: `Minimizing Finite Sums with the Stochastic Average Gradient. <http://hal.inria.fr/hal-00860051/PDF/sag_journal.pdf>`_
781
+
.. [3] Christopher M. Bishop: Pattern Recognition and Machine Learning, Chapter 4.3.4
782
+
783
+
.. [4] Mark Schmidt, Nicolas Le Roux, and Francis Bach: `Minimizing Finite Sums with the Stochastic Average Gradient. <http://hal.inria.fr/hal-00860051/PDF/sag_journal.pdf>`_
0 commit comments