You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
MInference requires vLLM to turn off chunked prefill currently. But vLLM v1 will force the enablement of chunked prefill, which makes MInference not usable with vLLM v1. It would be more practical if it were compatible with vLLM v1.