improved
Inference Interface Update and Compressor Metadata Expansion
August 29th, 2024
π New Features
Inference Interface Update
- Refactored the inference interface for improved clarity and consistency.
- Provides a cleaner experience when running inference across multiple model formats.
Training Result in Compressor Metadata
- Added
training_result
field to compressor metadata, giving users visibility into pre-compression training performance.
Data Type in Benchmark Results
- Benchmark results now include a
data_type
field, helping users evaluate performance with clearer context.
π Bug Fixes
- Added
scipy
torequirements.txt
to resolve missing dependency issues in environments using compression or quantization. - Fixed incorrect
return_stage_idx
parameter in ResNet-50 backbone configuration. - Added missing
preprocess
andpostprocess
steps for full INT8 pipelines, ensuring accurate inference results.
π§ Why these matter
- These updates improve usability and output consistency in inference and benchmarking workflows.
- Enhancing metadata structures gives users more insight into pipeline stages and improves integration with downstream tools.
- Fixes to model configuration and INT8 processing ensure correctness and stability in common deployment paths.