#

pruning

Here are 431 public repositories matching this topic...

intel / neural-compressor

SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime

sparsity pruning quantization knowledge-distillation auto-tuning int8 low-precision quantization-aware-training post-training-quantization awq int4 large-language-models gptq smoothquant sparsegpt fp4 mxformat

Updated May 29, 2024
Python

alibaba / TinyNeuralNetwork

TinyNeuralNetwork is an efficient and easy-to-use deep learning model compression framework.

deep-neural-networks deep-learning pytorch pruning model-compression model-converter quantization-aware-training post-training-quantization

Updated May 29, 2024
Python

huggingface / optimum-intel

🤗 Optimum Intel: Accelerate inference with Intel optimization tools

optimization intel transformers inference pruning quantization distillation onnx openvino diffusers

Updated May 29, 2024
Jupyter Notebook

quic / aimet

AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.

open-source machine-learning opensource deep-neural-networks compression deep-learning pruning quantization auto-ml network-quantization network-compression

Updated May 29, 2024
Python

tasket / wyng-backup

Fast backups for logical volumes & disk images

linux security backup xen incremental kvm img pruning btrfs qcow2 isolation lvm qubes-os xfs vmdk reflinks

Updated May 29, 2024
Python

sparseml

neuralmagic / sparseml

Libraries for applying sparsification recipes to neural networks with a few lines of code, enabling faster and smaller models

nlp sparsity tensorflow keras pytorch deep-learning-algorithms image-classification deep-learning-library pruning object-detection transfer-learning automl computer-vision-algorithms onnx deep-learning-models sparsification pruning-algorithms smaller-models sparsification-recipes

Updated May 29, 2024
Python

openvinotoolkit / nncf

Neural Network Compression Framework for enhanced OpenVINO™ inference

nlp sparsity compression deep-learning tensorflow transformers pytorch classification pruning object-detection quantization semantic-segmentation bert hawq onnx openvino mmdetection mixed-precision-training quantization-aware-training

Updated May 29, 2024
Python

muratonuryildirim / FOCIL

Code for the paper "FOCIL: Finetune-and-Freeze for Online Class-Incremental Learning by Training Randomly Pruned Sparse Experts"

sparsity pruning incremental-learning continual-learning

Updated May 28, 2024
Python

rdrachmanto / gace-characterize-pruning

Characterization study repository for model compression method: pruning

pruning model-compression edge-devices characterization-study

Updated May 28, 2024
Python

janumiko / pruning-benchmark

Architecture for pruning methods analysis using pytorch prune module

machine-learning research deep-learning pytorch pruning prune pytorch-pruning pytorch-prune

Updated May 29, 2024
Python

datawhalechina / leedl-tutorial

《李宏毅深度学习教程》（李宏毅老师推荐👍），PDF下载地址：https://github.com/datawhalechina/leedl-tutorial/releases

machine-learning tutorial reinforcement-learning deep-learning cnn transformer gan rnn pruning transfer-learning bert diffusion self-attention network-compression chatgpt leedl-tutorial

Updated May 27, 2024
Jupyter Notebook

luuyin / OWL

Official Pytorch Implementation of "Outlier Weighed Layerwise Sparsity (OWL): A Missing Secret Sauce for Pruning LLMs to High Sparsity"

sparsity pruning llm largelanguagemodel

Updated May 26, 2024
Python

mcthouacbb / Sirius

Chess engine

chess ai extensions engine evaluation bitboard pruning alpha-beta-pruning negamax reductions

Updated May 24, 2024
C++

ModelTC / llmc

This is the official implementation of "LLM-QBench: A Benchmark Towards the Best Practice for Post-training Quantization of Large Language Models", and it is also an efficient LLM compression tool with various advanced compression methods, supporting multiple inference backends.

benchmark deployment tool evaluation pruning quantization large-language-models llm

Updated May 24, 2024
Python

muratonuryildirim / CL-with-DST

Code for CPAL-2024 paper "Continual Learning with Dynamic Sparse Training: Exploring Algorithms for Effective Model Updates"

sparsity pruning incremental-learning continual-learning

Updated May 23, 2024
Python

ghimiredhikura / Awasome-Data-Free-ViT-Pruning

Data-Free-ViT-Pruning

pruning vit data-free no-finetune

Updated May 21, 2024

deepsparse

neuralmagic / deepsparse

Sparsity-aware deep learning inference runtime for CPUs

nlp performance computer-vision inference machinelearning pruning object-detection pretrained-models quantization cpus onnx sparsification llm-inference deepsparse

Updated May 21, 2024
Python

VainF / Torch-Pruning

[CVPR 2023] Towards Any Structural Pruning; LLMs / SAM / Diffusion / Transformers / YOLOv8 / CNNs

pruning model-compression channel-pruning network-pruning efficient-deep-learning depgraph structural-pruning cvpr2023

Updated May 18, 2024
Python

autorestic

cupcakearmy / autorestic

Config driven, easy backup cli for restic.

config cli backup incremental pruning restic deduplication incremental-backup config-driven

Updated May 17, 2024
Go

quic / aimet-pages

AIMET GitHub pages documentation

open-source machine-learning opensource deep-neural-networks compression deep-learning pruning quantization auto-ml network-quantization network-compression

Updated May 16, 2024
HTML

Improve this page

Add a description, image, and links to the pruning topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the pruning topic, visit your repo's landing page and select "manage topics."