added
Structured Neuron-level Pruning and Upload Progress Bar
July 24th, 2024
π New Features
Structured Neuron-level Pruning (SNP)
- Added a new structured pruning method targeting neuron-level granularity.
- Allows more fine-tuned control over model size and latency during optimization.
Upload Progress Bar
- Introduced a progress bar display when uploading large models or datasets via the SDK.
- Helps users track upload status more clearly and improve user experience.
Model Name Standardization & Deprecation Notices
- Unified naming convention for preloaded model names.
- Added deprecation warnings to guide users toward supported configurations.
Environment Variable Configuration
- Refactored internal configuration to support
HOST
andPORT
via environment variables. - Simplifies deployment in containerized or cloud environments.
π Bug Fixes
- Updated file-matching logic to support both
*best.pt
and*best_fx.pt
patterns during model search. - Fixed attribute reference from
model.name
tomodel_name
to prevent runtime errors.
π§ Why these matter
- The addition of Structured Neuron-level Pruning enables more granular and efficient model optimization, especially for edge deployment scenarios.
- Visual upload progress improves transparency and confidence during long upload operations.
- Unified naming and configuration approaches reduce user confusion and improve maintainability.
- Bug fixes ensure more reliable model handling and deployment flow.