Discamus continentiam augere, luxuriam coercere
Home -> Publications
Home
  Publications
    
edited volumes
  Awards
  Research
  Teaching
  Miscellaneous
  Full CV [pdf]
  BLOG






  Events








  Past Events





Publications of Torsten Hoefler
Copyright Notice:

The documents distributed by this server have been provided by the contributing authors as a means to ensure timely dissemination of scholarly and technical work on a noncommercial basis. Copyright and all rights therein are maintained by the authors or by other copyright holders, notwithstanding that they have offered their works here electronically. It is understood that all persons copying this information will adhere to the terms and constraints invoked by each author's copyright. These works may not be reposted without the explicit permission of the copyright holder.

Citation Listings: DBLP   CSB   Google Scholar   ACM Digital Library   Semantic Scholar   ORCID

Research overview                  Using Advanced MPI                 Edited volumes
      
filter by year
From to
filter by type
filter by tag (only from 2015-today)

Bibtex entry:

@inproceedings{,
  author={Saleh Ashkboos and Ilia Markov and Elias Frantar and Tingxuan Zhong and Xincheng Wang and Jie Ren and Torsten Hoefler and Dan Alistarh},
  title={{QUIK: Towards End-to-End 4-Bit Inference on Generative Large Language Models}},
  year={2024},
  month={Nov.},
  pages={3355-3371},
  booktitle={Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing (EMNLP'24)},
  location={Miami, FL, USA},
  publisher={Association for Computational Linguistics},
  source={http://www.unixer.de/~htor/publications/},
}


serving: 18.226.34.148:4597© Torsten Hoefler