Aug 29, 2023 Speed, Python: Pick Two. How CUDA Graphs Enable Fast Python Code for Deep Learning Aug 17, 2023 Fireworks.ai: Fast, Affordable, Customizable Gen AI Platform Jul 13, 2023 Multi-Query Attention is All You Need