Add vLLM to local-apps #744

mgoin · 2024-06-07T17:32:19Z

vLLM is a high-throughput and memory-efficient open-source serving engine for LLMs.

vLLM is fast with:

Transparent Logo:

Add vLLM to local-apps

bed4332

mgoin requested review from osanseviero, SBrandeis, gary149, Wauplin, julien-c and pcuenca as code owners June 7, 2024 17:32

mgoin and others added 3 commits June 13, 2024 16:51

Merge branch 'main' into vllm-local-app

87b6b40

Merge branch 'main' into xyc

2cc641a

fixup: snippet format

3ae705b

krampstudio mentioned this pull request Sep 2, 2024

Adds vLLM as Option for Local App #693

Merged

Use vllm serve

6f980a0

Provide feedback