Announcement_2024 08 19

Released MagicDec: Breaking Throughput-Latency Trade-off for Long Context Generation with Speculative Decoding. Blog