Skip to content

Releases: microsoft/onnxruntime-genai

v0.3.0

21 Jun 21:23
964eb65
Compare
Choose a tag to compare

Release Notes

  • Phi-3 Vision model support for DML EP.
  • Addressed DML memory leak issue and crashes on long prompts.
  • Addressed crashes and slowness on CPU EP GQA on long prompts due to integer overflow issues.
  • Added the import lib for windows C API package.
  • Addressed a bug with get_output('logits') so that it returns the logits for the entire prompt and not for the last generated token.
  • Addressed a bug with querying the device type of the model so that it won't crash.
  • Added NetStandard 2.0 compatibility.

ONNX Runtime GenAI v0.3.0-rc2

30 May 17:24
d536387
Compare
Choose a tag to compare
Pre-release

Release Notes

  • Added support for the Phi-3-Vision model.
  • Added support for the Phi-3-Small model.
  • Removed usage of std::filesystem to avoid runtime issues when loading incompatible symbols from stdc++ and stdc++fs.