Lei Mao's Log Book
Lei Mao's Log BookCurriculumBlogArticlesProjectsPublicationsReadingsLifeEssayPhotographyArchivesCategoriesTagsFAQs
  • Tags
  • Quantization

TensorRT Implicit Weight Quantization

 04-29-2025 04-29-2025 blog 8 minutes read (About 1265 words)
TensorRT Implicit Weight Quantization Caveats and Tricks

 
Deep Learning, 
Mathematics, 
TensorRT, 
Quantization  
  Read More

AWQ: Activation-Aware Weight Quantization

 01-01-2025 01-01-2025 blog 18 minutes read (About 2738 words)
Same Performance as Group-Wise Weight-Only Quantization But with Better Accuracy

 
Deep Learning, 
Mathematics, 
CUDA, 
Accelerated Computing, 
Quantization  
  Read More

PyTorch Eager Mode Quantization TensorRT Acceleration

 05-24-2024 05-24-2024 blog 7 minutes read (About 1051 words)
TensorRT Acceleration for PyTorch Native Eager Mode Quantization Models

 
Deep Learning, 
Python, 
Inference, 
PyTorch, 
Accelerated Computing, 
GPU, 
TensorRT, 
NVIDIA, 
Quantization  
  Read More

Quantization Unit Test

 03-25-2024 03-25-2024 blog 5 minutes read (About 748 words)
How To Unit Test Quantization Implementation

 
Deep Learning, 
Mathematics, 
Software Engineering, 
Quantization, 
Unit Test  
  Read More

Function Approximation Using Lookup Table and Interpolation

 09-22-2023 09-22-2023 blog 7 minutes read (About 1001 words)
Using Motorola CPU32 as an Example

 
Deep Learning, 
Computer Architecture, 
Quantization  
  Read More

PyTorch Quantization Aware Training

 12-06-2020 04-29-2021 blog 17 minutes read (About 2475 words)
PyTorch Inference Optimized Training Using Fake Quantization

 
PyTorch, 
Quantization, 
CNN  
  Read More

PyTorch Static Quantization

 11-28-2020 04-29-2021 blog 29 minutes read (About 4408 words)
PyTorch Static Quantization for Convolutional Neural Networks

 
PyTorch, 
Quantization, 
CNN  
  Read More

PyTorch Dynamic Quantization

 11-14-2020 04-29-2021 blog 8 minutes read (About 1193 words)
PyTorch Dynamic Quantization for Transformers

 
PyTorch, 
Transformer, 
Quantization, 
HuggingFace  
  Read More

Quantization for Neural Networks

 05-17-2020 02-09-2023 article an hour read (About 6957 words)
Mathematical Foundations to Neural Network Quantization

 
Machine Learning, 
Deep Learning, 
Mathematics, 
Neural Network, 
Quantization, 
Matrix Multiplication  
  Read More
Lei Mao

Lei Mao

Artificial Intelligence Machine Learning Computer Science

Menlo Park, California

Posts

1287

Categories

8

Tags

787

  Follow   Sponsor

Advertisement


Categories

  • article20
  • blog557
  • essay325
  • life294
  • miscellaneous2
  • photography61
  • project20
  • reading8

follow.it

Recents

02-01-2026

2025 年跑步总结

essay

01-31-2026

2026 Rotary Mission Ten Half Marathon 竞赛

life

01-27-2026

狗的素质等于人的素质

essay

01-26-2026

CUDA Rendezvous Stream

blog

01-24-2026

Pleasanton Ridge Regional Park 徒步

life

Archives

  • February 20261
  • January 202616
  • December 202531
  • November 202525
  • October 202524
  • See All >>

Tags

Outdoors299
California230
Hiking230
CPP119
Mathematics102
Deep Learning84
Photography75
CUDA70
Running61
Wildlife52
Bird46
Racing39
Python36
Software Engineering36
Machine Learning34
Movie33
Statistics32
Park31
Linux30
NVIDIA30
See All >>
Lei Mao's Log Book

© 2017-2026 Lei Mao  Powered by Hexo & Icarus
Site UV:  Site PV:

×