Anthropic ने अपने rival का silicon चुना। Inference war का winner तय हो गया।

6 अप्रैल को, Anthropic — वो कंपनी जो Claude बनाती है, जिस model पर developers सबसे ज़्यादा भरोसा करते हैं — ने अनाउंस किया कि वो Google और Broadcom के साथ अपनी custom silicon डील को और बढ़ा रही है — multiple gigawatts की next-generation TPU capacity के लिए, जो 2027 में ऑनलाइन आएगी।

एक बार फिर पढ़ो। Model race में Google का direct competitor, Google का ही hardware खरीद रहा है। ये वैसा ही है जैसे Jio अपना पूरा नेटवर्क Airtel के towers पर चलाए, और बोले "भाई, तेरे rates सस्ते हैं।"

जिस hardware की बात हो रही है वो है Ironwood — Google का 7th-generation TPU। Specs देखो तो समझ आ जाता है: per chip 4,614 teraflops FP8 performance, 9,216-chip superpods में scale करो तो collectively 42.5 exaflops, और पिछली generation से दोगुनी power efficiency। लेकिन सिर्फ numbers आधी कहानी हैं। Google हर layer का मालिक है: Jupiter networking (chips के बीच interconnect fabric), XLA (compiler जो models को chip instructions में translate करता है), और Vertex AI (cloud platform)। Next Platform की analysis के मुताबिक, TPU infrastructure का खर्चा लगभग $30–35 billion per gigawatt है — जबकि equivalent NVIDIA GPU deployments के लिए ~$50 billion लगते हैं — hardware layer पर ही 30–40% का cost gap है।

Anthropic ने गणित लगा लिया। Agent era में, जहाँ AI घंटों autonomous तरीके से code लिखता है और decisions लेता है, inference cost कोई छोटी line item नहीं है — ये सबसे बड़ी line item है। सस्ता silicon मतलब सस्ता Claude। और ये existential math है उस कंपनी के लिए जो Google के Gemini और OpenAI के GPT series से compete करने के लिए पैसा जला रही है।

लेकिन ये डील किसी एक कंपनी की procurement strategy से कहीं ज़्यादा structural बात reveal करती है। AI industry चुपचाप दो tiers में बँट रही है: जो silicon design करते हैं, और जो rent पर लेते हैं। Google अपने vertically integrated TPU stack के साथ tier one में बैठा है। Anthropic, OpenAI, और ज़्यादातर startups tier two में हैं — जो भी best price-per-flop दे, उस पर dependent। ये fact कि Anthropic ने अपने competitor का silicon चुना neutral NVIDIA hardware की जगह — बताता है कि cost advantage इतना बड़ा है कि strategic discomfort भी चलता है।

Anthropic जो price pay कर रहा है वो सिर्फ dollars में नहीं है। TPU के लिए XLA से compile किए गए models आसानी से NVIDIA CUDA पर port नहीं होते। Vertex AI production path बन जाता है। Anthropic जितने gigawatts TPU capacity lock-in करता है, उतने gigawatts Google platform dependency भी lock-in करता है। अगर relationship बिगड़ गई — या Google ने decide किया कि Gemini workloads को अपने silicon पर priority मिलेगी — तो Anthropic एक switch flip करके migrate नहीं कर सकता।

Claude users के लिए इसका मतलब ये है कि तुम्हारा favourite model अब बढ़ते हुए Google hardware पर चल रहा है, Google compilers से optimized है, Google infrastructure से deploy हो रहा है। Model weights और training पर Anthropic का पूरा control है, लेकिन physical substrate Mountain View का है। ये matter करता है या नहीं, ये depend करता है कि तुम कितना trust करते हो Google-as-infrastructure-provider और Google-as-AI-competitor के बीच की दीवार पर — दो roles जो आज साथ exist करते हैं लेकिन कल भी करेंगे इसकी कोई guarantee नहीं।

AI race models की competition से शुरू हुई थी। अब ये supply chains की competition बन रही है। Anthropic ने बता दिया कि price पर कौन सी supply chain जीतती है। Open question ये है कि price पर जीतने का मतलब independence खोना तो नहीं — और Anthropic की ये bet कि Google एक neutral landlord बना रहेगा, ये wine की तरह mature होगी या दूध की तरह फट जाएगी।

→ Anthropic Announcement → Next Platform Analysis → SiliconANGLE on Google Cloud

Anthropic ने अपने rival का silicon चुना। Inference war का winner तय हो गया।

Keep reading

तुम्हारे Agent का Permission Dialog एक Placebo है

MCP हर जगह काम करता है — जब तक Authenticate करने की बारी न आए

Google ने तुम्हारे AI Agent को 100 APIs दिए। Gemini को इनकी ज़रूरत ही नहीं

तीन AI Memory Systems, इनके काम करने का कोई सबूत नहीं