CPU support by ONNX as phi3?

#1
by alierenak - opened

As far as I know, this model currently doesn't support running on a CPU due to flash_attn constraints. Are there any plans to release an ONNX version, similar to what was done with phi3

Microsoft org

How to run onnx model

Sign up or log in to comment