Purpose: Research for proposing new ONNX operators to support Qwen3.5/Qwen3-Next linear attention.
Date: 2026-02-27
Sources:
- HuggingFace transformers:
src/transformers/models/qwen3_next/modular_qwen3_next.py - HuggingFace transformers:
src/transformers/models/qwen3_5/modular_qwen3_5.py - flash-linear-attention library:
fla/ops/gated_delta_rule/