LLM的长文本处理为何频频“走神”?MuDAF给出了新答案。
查看图片
[CL]《MuDAF: Long-Context Multi-Document Attention Focusing through Contrastive Learning on Attention Heads》W Liu, N Wu, S Yang, W Ding... [Microsoft] (2025)
网页链接