Wizard for creating a Machine Learning Job

When we create a Machine Learning Job, we must know what we want it for and what (base) model we are going to use.

These means that is recommended that we create a Job for each specific task. for example, a job to train/ infer for document data extraction (ForTokenClassification),

another job to train/ infer for document classification (ForSequenceClassification) and so on.

In this example we are going to create a Job that is intended to fine-tune a new model for data extraction and infer from it.

1. Create a Machine Learning Job

On the main window of ChronoScan "Scan/ Input" tab > click "New Batch" > "Job Wizard" > enter desired Job name and click "Next".
On the next step select the "Generate an app for Machine Learning"

Now we can configure our Machine Learning Job in the following dialog:

Wizard for creating a Machine Learning Job

Base model

Select the pre-trained base model here

Top

Architecture

Select the desired architecture here
* ForTokenClassification (Data extraction)

* ForSequenceClassification (Document classification)

Top

Processor model

This is a list with already Fined-Tuned and saved models from ChronoScan trainings.
If we select a model from here, it means we are going to train over an already trained model.
In that case the below grid will populate with the entities (labels) contained in the models configuration and we can choose from them.
If we want to train from the base model, we leave this empty, and will create our labels from job fields, e-tags or manual annotations.

* Leave empty for training from the pre_trained base models (microsoft/layoutlmv3-base, -large...).

* Select a saved pre-trained model here to train from it.

Top

Name of the Fine-Tuned model
resulting from training

This is the resulting model from our training.
When the trainings are done, they get saved under this models name path.
Then we can activate the saved trained model that we desire, and in that moment they become a processor model, which is a model that can be used to infer and can also be a model to train from.

Top

List of entities (labels)

List of entities (labels) that an already trained model has configured.

Top

Once we configure our Job, we click OK and then we can create batches.