smile.classification.AbstractClassifier<int[]>

smile.classification.Maxent

All Implemented Interfaces:: Serializable, ToDoubleFunction<int[]>, ToIntFunction<int[]>, Classifier<int[]>

Direct Known Subclasses:: Maxent.Binomial, Maxent.Multinomial

public abstract class Maxent extends AbstractClassifier<int[]>

Maximum Entropy Classifier. Maximum entropy is a technique for learning probability distributions from data. In maximum entropy models, the observed data itself is assumed to be the testable information. Maximum entropy models don't assume anything about the probability distribution other than what have been observed and always choose the most uniform distribution subject to the observed constraints.

Basically, maximum entropy classifier is another name of multinomial logistic regression applied to categorical independent variables, which are converted to binary dummy variables. Maximum entropy models are widely used in natural language processing. Here, we provide an implementation which assumes that binary features are stored in a sparse array, of which entries are the indices of nonzero features.

See Also:

Nested Class Summary

Nested Classes

Modifier and Type

Class

Description

static class

Maxent.Binomial

Binomial maximum entropy classifier.

static class

Maxent.Multinomial

Multinomial maximum entropy classifier.

Nested classes/interfaces inherited from interface smile.classification.Classifier
Classifier.Trainer<T,M extends Classifier<T>>
Field Summary

Fields inherited from class smile.classification.AbstractClassifier
classes
Constructor Summary

Constructors

Constructor

Description

Maxent(int p, double L, double lambda, IntSet labels)

Constructor.
Method Summary

Modifier and Type

Method

Description

double

AIC()

Returns the AIC score.

static Maxent.Binomial

binomial(int p, int[][] x, int[] y)

Fits maximum entropy classifier.

static Maxent.Binomial

binomial(int p, int[][] x, int[] y, double lambda, double tol, int maxIter)

Fits maximum entropy classifier.

static Maxent.Binomial

binomial(int p, int[][] x, int[] y, Properties params)

Fits maximum entropy classifier.

int

dimension()

Returns the dimension of input space.

static Maxent

fit(int p, int[][] x, int[] y)

Fits maximum entropy classifier.

static Maxent

fit(int p, int[][] x, int[] y, double lambda, double tol, int maxIter)

Fits maximum entropy classifier.

static Maxent

fit(int p, int[][] x, int[] y, Properties params)

Fits maximum entropy classifier.

double

getLearningRate()

Returns the learning rate of stochastic gradient descent.

double

loglikelihood()

Returns the log-likelihood of model.

static Maxent.Multinomial

multinomial(int p, int[][] x, int[] y)

Fits maximum entropy classifier.

static Maxent.Multinomial

multinomial(int p, int[][] x, int[] y, double lambda, double tol, int maxIter)

Fits maximum entropy classifier.

static Maxent.Multinomial

multinomial(int p, int[][] x, int[] y, Properties params)

Fits maximum entropy classifier.

boolean

online()

Returns true if this is an online learner.

void

setLearningRate(double rate)

Sets the learning rate of stochastic gradient descent.

boolean

soft()

Returns true if this is a soft classifier that can estimate the posteriori probabilities of classification.

Methods inherited from class smile.classification.AbstractClassifier
classes, numClasses

Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait

Methods inherited from interface smile.classification.Classifier
applyAsDouble, applyAsInt, predict, predict, predict, predict, predict, predict, predict, predict, score, update, update, update

Constructor Details
- Maxent
  
  public Maxent(int p, double L, double lambda, IntSet labels)
  
  Constructor.
  
  Parameters:
  
  p - the dimension of input data.
  
  L - the log-likelihood of learned model.
  
  lambda - lambda > 0 gives a "regularized" estimate of linear weights which often has superior generalization performance, especially when the dimensionality is high.
  
  labels - the class label encoder.
Method Details
- fit
  
  public static Maxent fit(int p, int[][] x, int[] y)
  
  Fits maximum entropy classifier.
  
  Parameters:
  
  p - the dimension of feature space.
  
  x - training samples. Each sample is represented by a set of sparse binary features. The features are stored in an integer array, of which are the indices of nonzero features.
  
  y - training labels in [0, k), where k is the number of classes.
  
  Returns:
  
  the model.
- fit
  
  public static Maxent fit(int p, int[][] x, int[] y, Properties params)
  
  Fits maximum entropy classifier.
  
  Parameters:
  
  p - the dimension of feature space.
  
  x - training samples. Each sample is represented by a set of sparse binary features. The features are stored in an integer array, of which are the indices of nonzero features.
  
  y - training labels in [0, k), where k is the number of classes.
  
  params - the hyperparameters.
  
  Returns:
  
  the model.
- fit
  
  public static Maxent fit(int p, int[][] x, int[] y, double lambda, double tol, int maxIter)
  
  Fits maximum entropy classifier.
  
  Parameters:
  
  p - the dimension of feature space.
  
  x - training samples. Each sample is represented by a set of sparse binary features. The features are stored in an integer array, of which are the indices of nonzero features.
  
  y - training labels in [0, k), where k is the number of classes.
  
  lambda - lambda > 0 gives a "regularized" estimate of linear weights which often has superior generalization performance, especially when the dimensionality is high.
  
  tol - the tolerance for stopping iterations.
  
  maxIter - maximum number of iterations.
  
  Returns:
  
  the model.
- binomial
  
  public static Maxent.Binomial binomial(int p, int[][] x, int[] y)
  
  Fits maximum entropy classifier.
  
  Parameters:
  
  p - the dimension of feature space.
  
  x - training samples. Each sample is represented by a set of sparse binary features. The features are stored in an integer array, of which are the indices of nonzero features.
  
  y - training labels in [0, k), where k is the number of classes.
  
  Returns:
  
  the model.
- binomial
  
  public static Maxent.Binomial binomial(int p, int[][] x, int[] y, Properties params)
  
  Fits maximum entropy classifier.
  
  Parameters:
  
  p - the dimension of feature space.
  
  x - training samples. Each sample is represented by a set of sparse binary features. The features are stored in an integer array, of which are the indices of nonzero features.
  
  y - training labels in [0, k), where k is the number of classes.
  
  params - the hyperparameters.
  
  Returns:
  
  the model.
- binomial
  
  public static Maxent.Binomial binomial(int p, int[][] x, int[] y, double lambda, double tol, int maxIter)
  
  Fits maximum entropy classifier.
  
  Parameters:
  
  p - the dimension of feature space.
  
  x - training samples. Each sample is represented by a set of sparse binary features. The features are stored in an integer array, of which are the indices of nonzero features.
  
  y - training labels in [0, k), where k is the number of classes.
  
  lambda - lambda > 0 gives a "regularized" estimate of linear weights which often has superior generalization performance, especially when the dimensionality is high.
  
  tol - the tolerance for stopping iterations.
  
  maxIter - maximum number of iterations.
  
  Returns:
  
  the model.
- multinomial
  
  public static Maxent.Multinomial multinomial(int p, int[][] x, int[] y)
  
  Fits maximum entropy classifier.
  
  Parameters:
  
  p - the dimension of feature space.
  
  x - training samples. Each sample is represented by a set of sparse binary features. The features are stored in an integer array, of which are the indices of nonzero features.
  
  y - training labels in [0, k), where k is the number of classes.
  
  Returns:
  
  the model.
- multinomial
  
  public static Maxent.Multinomial multinomial(int p, int[][] x, int[] y, Properties params)
  
  Fits maximum entropy classifier.
  
  Parameters:
  
  p - the dimension of feature space.
  
  x - training samples. Each sample is represented by a set of sparse binary features. The features are stored in an integer array, of which are the indices of nonzero features.
  
  y - training labels in [0, k), where k is the number of classes.
  
  params - the hyperparameters.
  
  Returns:
  
  the model.
- multinomial
  
  public static Maxent.Multinomial multinomial(int p, int[][] x, int[] y, double lambda, double tol, int maxIter)
  
  Fits maximum entropy classifier.
  
  Parameters:
  
  p - the dimension of feature space.
  
  x - training samples. Each sample is represented by a set of sparse binary features. The features are stored in an integer array, of which are the indices of nonzero features.
  
  y - training labels in [0, k), where k is the number of classes.
  
  lambda - lambda > 0 gives a "regularized" estimate of linear weights which often has superior generalization performance, especially when the dimensionality is high.
  
  tol - the tolerance for stopping iterations.
  
  maxIter - maximum number of iterations.
  
  Returns:
  
  the model.
- dimension
  
  public int dimension()
  
  Returns the dimension of input space.
  
  Returns:
  
  the dimension of input space.
- soft
  
  public boolean soft()
  
  Description copied from interface: Classifier
  
  Returns true if this is a soft classifier that can estimate the posteriori probabilities of classification.
  
  Returns:
  
  true if soft classifier.
- online
  
  public boolean online()
  
  Description copied from interface: Classifier
  
  Returns true if this is an online learner.
  
  Returns:
  
  true if online learner.
- setLearningRate
  
  public void setLearningRate(double rate)
  
  Sets the learning rate of stochastic gradient descent. It is a good practice to adapt the learning rate for different data sizes. For example, it is typical to set the learning rate to eta/n, where eta is in [0.1, 0.3] and n is the size of the training data.
  
  Parameters:
  
  rate - the learning rate.
- getLearningRate
  
  public double getLearningRate()
  
  Returns the learning rate of stochastic gradient descent.
  
  Returns:
  
  the learning rate of stochastic gradient descent.
- loglikelihood
  
  public double loglikelihood()
  
  Returns the log-likelihood of model.
  
  Returns:
  
  the log-likelihood of model.
- AIC
  
  public double AIC()
  
  Returns the AIC score.
  
  Returns:
  
  the AIC score.

Class Maxent

References

Nested Class Summary

Nested classes/interfaces inherited from interface smile.classification.Classifier

Field Summary

Fields inherited from class smile.classification.AbstractClassifier

Constructor Summary

Method Summary

Methods inherited from class smile.classification.AbstractClassifier

Methods inherited from class java.lang.Object

Methods inherited from interface smile.classification.Classifier

Constructor Details

Maxent

Method Details

fit

fit

fit

binomial

binomial

binomial

multinomial

multinomial

multinomial

dimension

soft

online

setLearningRate

getLearningRate

loglikelihood

AIC