EntropyLong: Effective Long-Context Training via Predictive Uncertainty

(arxiv.org)

15 points | by PaulHoule a day ago ago

No comments yet.