"Hyperparameter Power Impact in Transformer Language Model Training."

Lucas Høyberg Puvis de Chavannes et al. (2021)

Details and statistics

DOI: 10.18653/V1/2021.SUSTAINLP-1.12

access: open

type: Conference or Workshop Paper

metadata version: 2022-08-01

a service of  Schloss Dagstuhl - Leibniz Center for Informatics