Defining the training set and training the sentiment analysis model

In the Sample construction step, split the data into the set that is used to train the model and the set that is used to test the model's accuracy.

  1. If you want to keep the split between the training and testing data as defined in the file that you uploaded, in the Construct training and test sets using field, select User-defined sampling based on "Type" column.
  2. If you want to ignore the split that is defined in the file and customize that split according to your business needs, perform the following actions:
    1. Select Uniform sampling.
    2. In the Training set field, specify the percentage of records that is randomly assigned to the training sample.
  3. Click Next.
  4. In the Model creation step, make sure that the Maximum Entropy check box is selected.
  5. Click Next.
    The model training and testing process starts.