Full Attention Strikes Back: Transferring Full Attention into Sparse within Hundred Training Steps Paper โข 2605.16928 โข Published May 16 โข 95