SpeedLoader: An I/O efficient scheme for heterogeneous and distributed LLM operation

Yiqi Zhang, Yang You. SpeedLoader: An I/O efficient scheme for heterogeneous and distributed LLM operation. In Amir Globersons, Lester Mackey, Danielle Belgrave, Angela Fan, Ulrich Paquet, Jakub M. Tomczak, Cheng Zhang 0005, editors, Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, NeurIPS 2024, Vancouver, BC, Canada, December 10 - 15, 2024. 2024. [doi]

@inproceedings{ZhangY24-62,
  title = {SpeedLoader: An I/O efficient scheme for heterogeneous and distributed LLM operation},
  author = {Yiqi Zhang and Yang You},
  year = {2024},
  url = {http://papers.nips.cc/paper_files/paper/2024/hash/3d3a9e085540c65dd3e5731361f9320e-Abstract-Conference.html},
  researchr = {https://researchr.org/publication/ZhangY24-62},
  cites = {0},
  citedby = {0},
  booktitle = {Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, NeurIPS 2024, Vancouver, BC, Canada, December 10 - 15, 2024},
  editor = {Amir Globersons and Lester Mackey and Danielle Belgrave and Angela Fan and Ulrich Paquet and Jakub M. Tomczak and Cheng Zhang 0005},
}