togethercomputer/gpt-neox
An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.
Stars: 2Language: Python
Give AlbumentationsX a star on GitHub — it powers this leaderboard
Star on GitHubAn implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.