Qwen3-Next-80B-A3B-Instruct

Hey, thanks for maintaining! I recently tried Next and Nemotron-3-Nano-30B-A3B-Q4_K_M. Both failed after the second completion (the first one always works), with sth. like:

> init: the tokens of sequence 0 in the input batch have inconsistent sequence positions:
>  - the last position stored in the memory module of the context (i.e. the KV cache) for sequence 0 is X = 16
>  - the tokens for sequence 0 in the input batch have a starting position of Y = 2
>  it is required that the sequence positions remain consecutive: Y = X + 1
> decode: failed to initialize batch
> llama_decode: failed to decode, ret = -1

Is it just something on my setup? Can you run these newer models successfully?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Qwen3-Next-80B-A3B-Instruct #42

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Qwen3-Next-80B-A3B-Instruct #42

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions