Skip to content

[Feature Request]: Can MInference support chunked prefill? #186

@zongy17

Description

@zongy17

MInference requires vLLM to turn off chunked prefill currently. But vLLM v1 will force the enablement of chunked prefill, which makes MInference not usable with vLLM v1. It would be more practical if it were compatible with vLLM v1.

Metadata

Metadata

Assignees

Labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions