减肥为什么会口臭

百度央广网板块热点：标题：5G示范工程启动重庆最大窄带物联网NB-IoT商用重庆2018物联网生态高峰论坛暨重庆移动5G示范工程启动、NB-IoT商用发布在重庆举办，重庆移动董事长郭永宏做出上述表述。

Model conversion is an integrated development environment designed to help developers and AI engineers to convert, quantize, optimize and evaluate the pre-built machine learning models on your local Windows platform. It offers a streamlined, end-to-end experience for models converted from sources like Hugging Face, optimizing them and enabling inference on local devices powered by NPUs, GPUs, and CPUs.

Prerequisites

VS Code must be installed. Follow these steps to set up VS Code.
AI Toolkit extension must be installed. For more information, see install AI Toolkit.

Create project

Creating a project in model conversion is the first step toward converting, optimizing, quantizing and evaluating machine learning models.

Open the AI Toolkit view, and select Models > Conversion to launch model conversion
Start a new project by selecting New Model Project
Choose a base model
- Hugging Face Model: choose the base model with predefined recipes from the supported model list.
- Model Template : if the model is not included in the base model, select an empty template for your customized recipes (advanced scenario).
Enter project details: a unique Project Folder and a Project Name.

A new folder with the specified project name is created in the location you selected for storing the project files.

Note

The first time you create a model project, it might take a while to set up the environment.

A README.md file is included in each project. If you close it, you can reopen it via the workspace. Screenshot that shows model readme.

Supported models

Model Conversion currently supports a growing list of models, including top Hugging Face models in PyTorch format.

LLM models

Model Name	Hugging Face Path
Qwen2.5 1.5B Instruct	`Qwen/Qwen2.5-1.5B-Instruct`
DeepSeek R1 Distill Qwen 1.5B	`deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B`
Meta LLaMA 3.2 1B Instruct	`meta-llama/Llama-3.2-1B-Instruct`
Phi-3.5 Mini Instruct	`Phi-3.5-mini-instruct`

Non-LLM models

Model Name	Hugging Face Path
Intel BERT Base Uncased (MRPC)	`Intel/bert-base-uncased-mrpc`
BERT Multilingual Cased	`google-bert/bert-base-multilingual-cased`
ViT Base Patch16-224	`google/vit-base-patch16-224`
ResNet-50	`resnet-50`
CLIP ViT-B-32 (LAION)	`laion/CLIP-ViT-B-32-laion2B-s34B-b79K`
CLIP ViT Base Patch16	`clip-vit-base-patch16`
CLIP ViT Base Patch32	`clip-vit-base-patch32`

(Optional) Add model into existing project

Open the model project
Select Models > Conversion, and then select Add Models on the right panel.
Choose a base model or template, and then select Add.

A folder containing the new model files is created in the current project folder.

(Optional) Create a new model project

Open the model project
Select Models > Conversion, and then select New Project on the right panel.
Alternatively, close the current model project and create a new project from the start.

Run workflow

Running a workflow in model conversion is the core step that transform the pre-built ML model into an optimized and quantized ONNX model.

Select File > Open Folder in VS Code to open the model project folder.
Review the workflow configuration
1. Select Models > Conversion
2. Select the workflow template to view the conversion recipe.
Conversion

The workflow will always execute the conversion step, which transforms the model into ONNX format. This step cannot be disabled.

Quantization

This section enables you to configure the parameters for quantization.

Important
Hugging Face compliance alerts: During the quantization, we need the calibration datasets. You may be prompted to accept license terms before proceeding. If you missed the notification, the running process will be paused, waiting for your input. Please make sure notifications are enabled and that you accept the required licenses.
- Activation Type: this is the data type used to represent the intermediate outputs (activations) of each layer in the neural network.
- Weight Type: this is the data type used to represent the learned parameters (weights) of the model.
- Quantization Dataset: calibration dataset used for quantization.
  
  If your workflow uses a dataset that requires license agreement approval on Hugging Face (e.g., ImageNet-1k), you’ll be prompted to accept the terms on the dataset page before proceeding. This is required for legal compliance.
  1. Select the HuggingFace Access Token button to get your Hugging Face Access Token.
  2. Select Open to open the Hugging Face website.
  3. Get your token on Hugging Face portal and paste it Quick Pick. Press Enter.
- Quantization Dataset Split: dataset could have different splits like validation, train and test.
- Quantization Dataset Size: the number of data used to quantize the model.
For more information about activation and weight type, please see Data type selection.

You could also disable this section. In this case, the workflow will only convert the model to ONNX format but do not quantize the model.

Evaluation

In this section, you need to select the Execution Provider (EP) you want to use for evaluation, regardless of the platform on which the model was converted.
- Evaluate on: the target device that you want to evaluate the model. Possible values are:
  - Qualcomm NPU: to use this, you need a compatible Qualcomm device.
  - AMD NPU: to use this, you need a device with a supported AMD NPU.
  - Intel NPU: to use this, you need a device with a supported Intel NPU.
  - CPU: any CPU could work.
- Evaluation Dataset: dataset used for evaluation.
- Evaluation Dataset Split: dataset could have different splits like validation, train and test.
- Evaluation Dataset Size: the number of data used to evaluate the model.
You could also disable this section. In this case, the workflow will only convert the model to ONNX format but do not evaluate the model.
Run the workflow by selecting Run

A default job name is generated using the workflow name and timestamp (e.g., bert_qdq_2025-08-04_20-45-00) for easy tracking.

During the job running, you can Cancel the job by selecting the status indicator or the three-dot menu under Action in History board and select Stop Running.

Hugging Face compliance alerts: During the quantization, we need the calibration datasets. You may be prompted to accept license terms before proceeding. If you missed the notification, the running process will be paused, waiting for your input. Please make sure notifications are enabled and that you accept the required licenses.

Note

Model conversion and quantization: you can run workflow on any device expect for LLM models. The Quantization configuration is optimized for NPU only. It's recommended to uncheck this step if the target system is not NPU.

LLM model quantization: If you want to quantize the LLM models, a Nvidia GPU is required.

If you want to quantize the model on another device with GPU, you can setup environment by yourselves, please refer ManualConversionOnGPU. Please note that only "Quantization" step need the GPU. After quantization, you can evaluate the model on NPU or CPU.

Tips for re-evaluation

After a model has been successfully converted, you could use the re-evaluate function to perform evaluation again without the model conversion.

Go to the History board and find the model run job. Select the three-dot menu under Action to Re-evaluate the model.

You can choose the different EPs or datasets for re-evaluation

Screenshot that shows re-evaluation. It contains configurations such as name, system and datasets settings.

Tips for failed jobs

If your job is canceled or failed, you can select job name to adjust the workflow and run job again. To avoid accidental overwrites, each execution creates a new history folder with its own configuration and results.

View results

The History Board in Conversion is your central dashboard for tracking, reviewing, and managing all workflow runs. Each time you run a model conversion and evaluation, a new entry is created in the History Board—ensuring full traceability and reproducibility.

Find the workflow run that you want to review. Each run is listed with a status indicator (e.g. Succeeded, Cancelled)
Select the run name to view the conversion configurations
Select the logs under Status indicator to view logs and detailed execution results
Once the model converted successfully, you can view the evaluation results under Metrics. Metrics such as accuracy, latency and throughput are displayed alongside each run

Screenshot that shows history, including name, time, parameters and so on.

Use sample notebook for model inference

Go to the History board. Select the three-dot menu under Action.

Select Inference in Samples from the dropdown.
Choose the Python environment
- You'll be prompted to select a Python virtual environment. The default runtime is: C:\Users\{user_name}\.aitk\bin\model_lab_runtime\Python-WCR-win32-x64-3.12.9.
- Note that the default runtime contains everything needed, otherwise, manually install the requirements.txt
The sample will launch in a Jupyter Notebook. You can customize the input data or parameters to test different scenarios.

Tip

Model compatibility: Ensure the converted model supports the specified EPs in the inference samples

Sample location: Inference samples are stored alongside the run artifacts in the history folder.

Go to the History board. Select Export to share the model project with others. This copies the model project without history folder. If you want to share models with others, select the corresponding jobs. This copies the selected history folder containing the model and its configuration.

What you learned

In this article, you learned how to:

Create a model conversion project in AI Toolkit for VS Code.
Configure the conversion workflow, including quantization and evaluation settings.
Run the conversion workflow to transform a pre-built model into an optimized ONNX model.
View the results of the conversion, including metrics and logs.
Use the sample notebook for model inference and testing.
Export and share the model project with others.
Re-evaluate a model using different execution providers or datasets.
Handle failed jobs and adjust configurations for re-runs.
Understand the supported models and their requirements for conversion and quantization.

县级市市长什么级别	穿旗袍配什么发型好看	六月十三是什么日子	乳房头疼是什么原因	糖化血红蛋白偏高是什么意思
包干价是什么意思	团县委是什么单位	骨折吃什么钙片	为什么会得风湿	冰激凌和冰淇淋有什么区别
卫生纸属于什么垃圾	什么山不能爬脑筋急转弯	榴莲不能和什么一起吃	无印良品属于什么档次	李约瑟难题是什么
7月14号是什么节日	e代表什么	结肠多发息肉是什么意思	马来西亚信仰什么教	牙龈肿了吃什么消炎药

用牛奶敷脸有什么好处和坏处hcv8jop3ns8r.cn	心什么诚服hcv7jop5ns1r.cn	手淫是什么意思hcv7jop5ns3r.cn	颈椎挂什么科室hcv9jop4ns3r.cn	猫咪疫苗什么时候打hcv9jop6ns5r.cn
狮子的天敌是什么动物naasee.com	英国全称是什么jiuxinfghf.com	尿酸偏高有什么危害sanhestory.com	tb是什么hcv9jop0ns3r.cn	火奥念什么hcv8jop3ns3r.cn
疼痛门诊看什么病hcv8jop2ns1r.cn	系统性红斑狼疮不能吃什么hcv9jop6ns9r.cn	地米是什么药hcv8jop4ns6r.cn	蹭饭是什么意思hcv8jop1ns7r.cn	失眠是什么引起的zhongyiyatai.com
姓什么的人最多wzqsfys.com	chick什么意思hcv8jop0ns0r.cn	为什么一饿就胃疼hcv9jop4ns5r.cn	日本是什么时候投降的hcv9jop5ns4r.cn	一月份什么星座hcv8jop4ns0r.cn