ONE - On-device Neural Engine
|
Pass to quantize weights with GPTQ algorithm. More...
#include <QuantizeDequantizeWeightsWithGPTQPass.h>
Data Structures | |
struct | Context |
Public Member Functions | |
QuantizeDequantizeWeightsWithGPTQPass (std::unique_ptr< Context > &&ctx, HessianMap *hessian_map) | |
virtual const char * | name (void) const |
bool | run (loco::Graph *graph) |
Run the pass. | |
Public Member Functions inherited from logo::Pass | |
virtual | ~Pass ()=default |
Pass to quantize weights with GPTQ algorithm.
Definition at line 36 of file QuantizeDequantizeWeightsWithGPTQPass.h.
|
inline |
Definition at line 48 of file QuantizeDequantizeWeightsWithGPTQPass.h.
|
inlinevirtual |
Reimplemented from logo::Pass.
Definition at line 53 of file QuantizeDequantizeWeightsWithGPTQPass.h.
|
virtual |
Run the pass.
Implements logo::Pass.
Definition at line 149 of file QuantizeDequantizeWeightsWithGPTQPass.cpp.
References loco::active_nodes(), INFO, luci::layer_info_map(), LOGGER, luci::CircleNode::name(), and loco::output_nodes().
Referenced by package.infer.session::inference().