ML.NET Tutorial - Get started in 10 minutes

Intro

Purpose

Use ML.NET Model Builder in Visual Studio to train and use your first machine learning model with ML.NET.

Prerequisites

None.

Time to Complete

10 minutes + download/installation time

Scenario

An app that can predict whether the text from customer reviews is negative or positive sentiment.

Download and install

Download and install Visual Studio 2022.

Download Visual Studio 2022

During installation, the .NET desktop development workload should be selected along with the optional ML.NET Model Builder component. Using the link above should preselect all the prerequisites correctly, as shown on the following image:

Already have Visual Studio 2022?

If you already have Visual Studio 2022, ensure it's up to date and has the required workload installed:

Select the Windows key, type Visual Studio Installer, and press Enter.
If prompted, allow the installer to update itself.
If an update for Visual Studio 2022 is available, an Update button will be shown. Select it to update before modifying the installation. We recommend using the latest Visual Studio 2022 version for this tutorial.
Find your Visual Studio 2022 installation and select Modify.
Select .NET desktop development and make sure ML.NET Model Builder is selected on the right pane. Select the Modify button.

Upgrade to the latest version of Model Builder

Once you've enabled ML.NET Model Builder in Visual Studio, download and install the latest version.

Download the latest version of Model Builder

After downloading, install the extension by double clicking the .vsix file.

Check for Visual Studio updates

This tutorial is optimized for the latest version of Visual Studio. If you already have Visual Studio 2022, you can check for updates:

Select the Windows key, type Visual Studio Installer, and press Enter.
If prompted, allow the installer to update itself.
If an update is available, your Visual Studio 2022 installation will have an Update button. Select it to update.

Install .NET SDK

To build .NET apps, you need to download and install the .NET 8 SDK (Software Development Kit).

Install ML.NET CLI

The ML.NET command-line interface (CLI), provides tools for building machine learning models with ML.NET.

Note: Currently, ML.NET CLI is in Preview and only supports the latest LTS version of the .NET SDK (.NET 8).

FOR x64 MACHINES - Run the following command:

Note: If you're using a console other than Bash (for example, zsh, which is the new default for macOS), then you'll need to give mlnet executable permissions and include mlnet to the system path. Instructions on how to do this should appear in the terminal when you install mlnet (or any global tool). In general, the following command should work for most systems: chmod +x [PATH-TO-MLNET-CLI-EXECUTABLE]

Alternatively, you can try using the following command to run the mlnet tool:

Command prompt

~/.dotnet/tools/mlnet

If the command still gives you an error, use the I ran into an issue button below to report the issue and get help fixing the problem.

Create your app

Open Visual Studio and create a new .NET console app:

Select Create a new project from the Visual Studio 2022 start window.
Select the C# Console App project template.

Change the project name to myMLApp.
Make sure Place solution and project in the same directory is unchecked.

Select the Next button.
Select .NET 8.0 (Long Term support) as the Framework.
Select the Create button. Visual Studio creates your project and loads the Program.cs file.

Add machine learning

Right-click on the myMLApp project in Solution Explorer and select Add > Machine Learning Model.
In the Add New Item dialog, make sure Machine Learning Model (ML.NET) is selected.
Change the Name field to SentimentModel.mbconfig and select the Add button.

A new file named SentimentModel.mbconfig is added to your solution and the Model Builder UI opens in a new docked tool window in Visual Studio. The mbconfig file is simply a JSON file that keeps track of the state of the UI.

Model Builder will guide you through the process of building a machine learning model in the following steps.

Pick a scenario

To generate your model, you first need to select your machine learning scenario. Model Builder supports several scenarios:

Note: If the tutorial screenshots don't match with what you see, you may need to update your version of Model Builder. Go to Extensions > Manage Extensions to make sure that there are no available updates for Model Builder. The version used in this tutorial is 17.18.2.

In this case, you'll predict sentiment based on the content (text) of customer reviews.

In the Model Builder Scenario screen, select the Data classification scenario, since you're predicting which category a comment falls into (positive or negative).
After selecting the Data classification scenario, you must choose your training environment. While some scenarios support training in Azure, Classification currently only supports local training, so keep the Local environment selected and move on to the Data step.

Download and add data

Download the Sentiment Labelled Sentences datasets from the UCI Machine Learning Repository. Unzip sentiment labelled sentences.zip and save the yelp_labelled.txt file to the myMLApp directory.

Your Solution Explorer should look like the following:

Each row in yelp_labelled.txt represents a different review of a restaurant left by a user on Yelp. The first column represents the comment left by the user, and the second column represents the sentiment of the text (0 is negative, 1 is positive). The columns are separated by tabs, and the dataset has no header. The data looks like the following:

yelp_labelled.txt

Wow... Loved this place.	        1
Crust is not good.	        0
Not tasty and the texture was just nasty.	        0

Add data

In Model Builder, you can add data from a local file or connect to a SQL Server database. In this case, you'll add yelp_labelled.txt from a file.

Select File as the input data source type.
Browse for yelp_labelled.txt. Once you select your dataset, a preview of your data appears in the Data Preview section. Since your dataset does not have a header, headers are auto-generated ("col0" and "col1").
Under Column to predict (Label), select "col1". The Label is what you're predicting, which in this case is the sentiment found in the second column ("col1") of the dataset.
The columns that are used to help predict the Label are called Features. All of the columns in the dataset besides the Label are automatically selected as Features. In this case, the review comment column ("col0") is the Feature column. You can update the Feature columns and modify other data loading options in Advanced data options, but it is not necessary for this example.

After adding your data, go to the Train step.

Train your model

Now, you'll train your model with the yelp_labelled.txt dataset.

Model Builder evaluates many models with varying algorithms and settings based on the amount of training time given to build the best performing model.

Change the Time to train, which is the amount of time you'd like Model Builder to explore various models, to 60 seconds (you can try increasing this number if no models are found after training) . Note that for larger datasets, the training time will be longer. Model Builder automatically adjusts the training time based on the dataset size.
You can update the optimization metric and algorithms used in Advanced training options, but it is not necessary for this example.
Select Start training to start the training process. Once training starts, you can see the time remaining.

Training results

Once training is done, you can see a summary of the training results.

Best MacroAccuracy - This shows you the accuracy of the best model that Model Builder found. Higher accuracy means the model predicted more correctly on test data.
Best model - This shows you which algorithm performed the best during Model Builder's exploration.
Training time - This shows you the total amount of time that was spent training / exploring models.
Models explored (total) - This shows you the total number of models explored by Model Builder in the given amount of time.
Generated code-behind - This shows you the names of the files generated to help consume the model or train a new model.

If you want, you can view more information about the training session in the Machine Learning Output window.

After model training finishes, go to the Evaluate step.

In your terminal, run the following command (in your myMLApp folder):

Command prompt

mlnet classification --dataset "yelp_labelled.txt" --label-col 1 --has-header false --name SentimentModel  --train-time 60

What do these commands mean?

The mlnet classification command runs ML.NET with AutoML to explore many iterations of classification models in the given amount of train time with varying combinations of data transformations, algorithms, and algorithm options and then chooses the highest performing model.

--dataset: You chose yelp_labelled.txt as the dataset (internally, the CLI will split the one dataset into training and testing datasets).
--label-col: You must specify the target column you want to predict (or the Label). In this case, you want to predict the sentiment in the second column (zero-indexed columns means this is column "1").
--has-header: Use this option to specify if the dataset has a header. In this case, the dataset doesn't have a header, so it's false.
--name: Use this option to provide a name for your machine learning model and related assets. In this case, all assets associated with this machine learning model will have SentimentModel in the name.
--train-time: You must also specify the amount of time you'd like the ML.NET CLI to explore different models. In this case, 60 seconds (you can try increasing this number if no models are found after training). Note that for larger datasets, you should set a longer training time.

Progress

While the ML.NET CLI is exploring different models, it displays the following data:

Start training - This section shows each model iteration, including the trainer (algorithm) used and evaluation metrics for that iteration.
Time left - This and the progress bar will indicate how much time is left in the training process in seconds.
Best algorithm - This shows you which algorithm has performed the best so far.
Best score - This shows you the performance of the best model so far. Higher accuracy means the model predicted more correctly on test data.

If you want, you can view more information about the training session in the log file generated by the CLI.

Evaluate your model

The Evaluate step shows you the best-performing algorithm and the best accuracy and lets you try out the model in the UI.

Try out your model

You can make predictions on sample input in the Try your model section. The textbox is pre-filled with the first line of data from your dataset, but you can change the input and select the Predict button to try out different sentiment predictions.

In this case, 0 means negative sentiment and 1 means positive sentiment.

Note: If your model is not performing well (for example, if the Accuracy is low or if the model only predicts '1' values), you can try adding more time and training again. This is a sample using a very small dataset; for production-level models, you'd want to add a lot more data and training time.

After evaluating and trying out your model, move on to the Consume step.

Generate code

After training is completed, four files are automatically added as code-behind to the SentimentModel.mbconfig:

SentimentModel.consumption.cs: This file contains the model input and output classes and a Predict method that can be used for model consumption.
SentimentModel.evaluate.cs: This file contains a CalculatePFI method that uses the Permutation Feature Importance (PFI) technique to evaluate which features contribute most to the model predictions.
SentimentModel.mlnet: This file is the trained ML.NET model, which is a serialized zip file.
SentimentModel.training.cs: This file contains the code to understand the importance input columns have on your model predictions.

In the Consume step in Model Builder, a code snippet is provided which creates sample input for the model and uses the model to make a prediction on that input.

Model Builder also offers Project templates that you can optionally add to your solution. There are two project templates (a console app and a web API), both which consume the trained model.

Consume your model

The last step is to consume your trained model in the end-user application.

Replace the Program.cs code in your myMLApp project with the following code:

Program.cs

using MyMLApp;
// Add input data
var sampleData = new SentimentModel.ModelInput()
{
    Col0 = "This restaurant was wonderful."
};

// Load model and predict output of sample data
var result = SentimentModel.Predict(sampleData);

// If Prediction is 1, sentiment is "Positive"; otherwise, sentiment is "Negative"
var sentiment = result.PredictedLabel == 1 ? "Positive" : "Negative";
Console.WriteLine($"Text: {sampleData.Col0}\nSentiment: {sentiment}");

Run myMLApp (select Ctrl+F5 or Debug > Start Without Debugging). You should see the following output, predicting whether the input statement is positive or negative.

The ML.NET CLI has generated the trained model and code for you, so you can now use the model in .NET applications (for example, your SentimentModel console app) by following these steps:

In the command line, navigate to the consumeModelApp directory.
Command prompt
```
cd SentimentModel
```

Open the Program.cs in any code editor and inspect the code. The code should look similar to the following:

Program.cs

using System;

namespace SentimentModel.ConsoleApp
{
    class Program
    {
        static void Main(string[] args)
        {
            // Add input data
            SentimentModel.ModelInput sampleData = new SentimentModel.ModelInput()
            {
              Col0 = @"Wow... Loved this place."
            };

            // Make a single prediction on the sample data and print results
            var predictionResult = SentimentModel.Predict(sampleData);

            Console.WriteLine("Using model to make single prediction -- Comparing actual Col1 with predicted Col1 from sample data...\n\n");


            Console.WriteLine($"Col0: @{"Wow... Loved this place."}");
            Console.WriteLine($"Col1: {1F}");


            Console.WriteLine($"\n\nPredicted Col1: {predictionResult.PredictedLabel}\n\n");
            Console.WriteLine("=============== End of process, hit any key to finish ===============");
            Console.ReadKey();
        }
    }
}

Run your SentimentModel.ConsoleApp. You can do this by running the following command in the terminal (make sure you are in the SentimentModel directory):

Command prompt

dotnet run

The output should look something like this:

Command prompt

Using model to make single prediction -- Comparing actual Col1 with predicted Col1 from sample data...


Col0: Wow... Loved this place.
Col1: 1
Class                          Score
-----                          -----
1                              0.9651076
0                              0.034892436
=============== End of process, hit any key to finish ===============

Next steps

Congratulations, you've built your first machine learning model with ML.NET Model Builder!

Now that you've got the basics, continue to with this with self-guided learning module on Microsoft Learn, where you'll use sensor data to detect whether a manufacturing device is broken.

Microsoft Learn: Train a predictive maintenance model

ML.NET for Beginners

Let Luis introduce you to the concepts of machine learning & AI, explain what you can do with it, and guide you on how to get started with OpenAI, Azure AI Services, and ML.NET:

Model Builder guide

Learn more about ML.NET Model Builder

ML.NET samples

Explore the ML.NET samples on GitHub

Developer docs

Dig deeper with the documentation for ML.NET

ML.NET Tutorial - Get started in 10 minutes

Intro

Purpose

Prerequisites

Time to Complete

Scenario

Download and install

Already have Visual Studio 2022?

Upgrade to the latest version of Model Builder

Check for Visual Studio updates

Install .NET SDK

Install ML.NET CLI

Create your app

Add machine learning

Pick a scenario

Download and add data

Add data

Train your model

Training results

What do these commands mean?

Progress

Evaluate your model

Try out your model

Top models

Generate code

Consume your model

Next steps

ML.NET for Beginners

Model Builder guide

ML.NET samples

Developer docs

ML.NET for Beginners

ML.NET CLI Docs

ML.NET samples

Developer docs

Report an issue

Provide feedback