added
Custom & Auto Quantization, Job Cancellation, and Project Folder Creation
November 14th, 2024
🚀 New Features
Custom & Auto Quantization Support
- Introduced flexible quantization configuration options:
- Users can manually define bitwidth, symmetry, per-channel settings, and rounding behavior.
- Alternatively, automatic quantization selects optimal settings based on calibration data.
Quantizer Module Added
- Added a new
quantizer
module and corresponding launcher client API, enabling quantization workflows to be run through PyNetsPresso's unified interface.
Benchmark & Conversion Task Cancellation
- Users can now cancel benchmark and conversion tasks in progress, offering greater control during long-running operations.
Automatic Project Folder Creation
- When launching tasks, a project folder is now created automatically if one doesn’t exist, organizing output files consistently.
JWT Expiration Handling Improvement
- A 60-second buffer has been added to JWT token expiration checks to prevent unintentional session timeouts.
🐞 Bug Fixes
- Added
InternalServerError
exception to clearly distinguish server-side failures from user errors.
🧠 Why these matter
- These updates significantly improve the flexibility and robustness of the optimization pipeline.
- Quantization is now easier to configure or automate depending on user preference and hardware constraints.
- Automatic folder creation and cancellation support improve user experience, especially during iterative workflows.
- Better token handling and error classification reduce friction and simplify debugging.