titanml |awq |decoding |inference In the Fast Lane! Speculative Decoding - 10x Larger Model, No Extra Cost Rockayyy Posted on October 5, 2023