model.engine speed is slowest than model.pt #12653

shaimaahamam · 2024-05-13T07:35:58Z

Search before asking

I have searched the YOLOv8 issues and discussions and found no similar questions.

Question

when convert model from model.pt to model.engine the size of model is increased and takes more time in prediction step.

Additional

No response

glenn-jocher · 2024-05-13T13:39:39Z

Hello! It sounds like you're experiencing slower performance with the .engine format compared to the .pt format. This could be due to several factors including the complexity of your model, the configuration of the TensorRT optimization, or the specific hardware you are running the model on.

To potentially improve the performance, you might want to look into the following:

Ensure your TensorRT version is fully compatible with your hardware.
Experiment with different workspace sizes in the export command to optimize memory usage, which could influence speed.
Check if the input size (imgsz) used during the export matches the one used during inferencing, as mismatches can lead to inefficiencies.

Here's an example command to adjust the workspace size during export:

yolo export model=path/to/model.pt format=engine workspace=8

If adjustments to these areas do not improve the performance, it might be helpful to profile both executions to understand where the bottleneck occurs.

shaimaahamam · 2024-05-14T09:53:01Z

I have RTX 3060 TI
driver version : 545.29.06
GPU 8 GB
Cuda version :12.3
Ubuntu 22.4
Ram size : 98 GB
CPU X86-64 AMD Ryzen 9 5900X 12-Core Processor

which version of python , tensorRT compatible with and number of workspace and batches ?

glenn-jocher · 2024-05-19T23:59:55Z

Hello! For your setup with an RTX 3060 Ti, here’s a quick guide to get you started:

Python Version: Python 3.8 or newer should work well.
TensorRT Version: With CUDA 12.3, you should use TensorRT 8.x. Make sure to download the version compatible with CUDA 12.3 from the NVIDIA website.
Workspace Size: Starting with a workspace size of 4 GB is generally a good balance. You can adjust this if needed:
```
yolo export model=yolov8n.pt format=engine workspace=4
```
Batch Size: If you're not facing memory issues, you can start with a batch size of 2 or more depending on your specific use case. Remember, larger batch sizes might increase throughput but also memory usage:
```
yolo export model=yolov8n.pt format=engine batch=2
```

Feel free to tweak these settings based on your performance and accuracy needs! 🚀

shaimaahamam added the question Further information is requested label May 13, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

model.engine speed is slowest than model.pt #12653

model.engine speed is slowest than model.pt #12653

shaimaahamam commented May 13, 2024

glenn-jocher commented May 13, 2024

shaimaahamam commented May 14, 2024

glenn-jocher commented May 19, 2024

model.engine speed is slowest than model.pt #12653

model.engine speed is slowest than model.pt #12653

Comments

shaimaahamam commented May 13, 2024

Search before asking

Question

Additional

glenn-jocher commented May 13, 2024

shaimaahamam commented May 14, 2024

glenn-jocher commented May 19, 2024