Efficiently Harnessing Parameter Importance for Better TrainingTianjian Li, Haoran Xu, Philipp Koehn, Kenton MurrayLast updated on October 2023PDFRelatedError Norm Truncation: Robust Training in the Presence of Data Noise for Text Generation Models