ModelTC / lightllm Public

Notifications You must be signed in to change notification settings
Fork 203
Star 2.6k

Code
Issues 61
Pull requests 5
Discussions
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Security
Insights

Issues: ModelTC/lightllm

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

61 Open 123 Closed

Author

Filter by author

Label

Filter by label

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Milestones

Filter by milestone

Assignee

Filter by who’s assigned

Assigned to nobody

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Issues list

支持encoder-only模型 bug

Something isn't working

#573 opened Oct 20, 2024 by EvanSong77

CPU Inference

#563 opened Oct 13, 2024 by JocelynPanPan

运行api_server正常，但是发送正常openai请求报错，说输入类型错误

#545 opened Sep 26, 2024 by xiaoshizijiayou

torch, triton版本确认及显存占用分析

#490 opened Aug 6, 2024 by LittleYouEr

请支持minicpmv2.5 bug

Something isn't working

#482 opened Aug 1, 2024 by LDLINGLINGLING

question about fp8 version of context_flashattention_nopad.py bug

Something isn't working

#479 opened Jul 30, 2024 by changyuanzhangchina

from lightllm_ppl_int8kv_flashdecoding_kernel import group8_int8kv_flashdecoding_stage1 bug

Something isn't working

#475 opened Jul 24, 2024 by AlvL1225

有不通过http的其他推理入口吗 bug

Something isn't working

#466 opened Jul 15, 2024 by mmdbhs

[Feature]: Suport for InternVL-Chat-V1-5 bug

Something isn't working

#462 opened Jul 10, 2024 by JingofXin

How do you decide the tune params for triton kernels?

#457 opened Jul 5, 2024 by sleepwalker2017

Add Support to Florence-2 ! bug

Something isn't working

#456 opened Jul 5, 2024 by KaifAhmad1

我看到列表里支持qwen7B，请问是否支持qwen1.5-14B呢？

#422 opened Jun 7, 2024 by koalaaaaaaaaa

[BUG]Ask aboout Qwen models with weight quantization . bug

Something isn't working

#408 opened May 15, 2024 by Cesilina

1 task

[Question] How does lightllm implement nopad batching?

#405 opened Apr 25, 2024 by Tomorrowdawn

请问是否有计划支持MiniCPM-V-2

#404 opened Apr 23, 2024 by xiabo0816

[BUG] There already is a lightllm in pypi bug

Something isn't working

#380 opened Mar 26, 2024 by rlippmann

1 task

weight only int4 is slower than cutlass int4

#362 opened Mar 19, 2024 by zhoutianzi666

Are there any efficient way to command kill the lightllm process?

#343 opened Mar 4, 2024 by yy9996

Qwen-14B-INT8 face the issue: 'QwenTransformerLayerWeight' object has no attribute 'q_weight_' bug

Something isn't working

#333 opened Feb 20, 2024 by wangr0031

[BUG] stop_words bug

Something isn't working

#326 opened Feb 2, 2024 by baisechundu

[BUG] Support for DeepSeek? bug

Something isn't working

#325 opened Feb 2, 2024 by suhjohn

是否能支持sqlcoder系列模型

#310 opened Jan 22, 2024 by 2496289471

Inconsistent Output between LightLLM and Transformers Inference Library bug

Something isn't working

#309 opened Jan 19, 2024 by Lvjinhong

请问lightllm可以离线推理吗，有没有参考代码 bug

Something isn't working

#308 opened Jan 19, 2024 by monkeyZhy

1 task

请问现在支持Yi-34B的awq 4bit部署吗？

#291 opened Jan 9, 2024 by xyfZzz

Previous 1 2 3 Next

Previous Next

ProTip! Mix and match filters to narrow down what you’re looking for.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly