Until now, AI services based on Large Language Models (LLMs) have mostly relied on expensive data center GPUs. This has ...