Bugs Hunt करने वाला AI बनाम Bugs Create करने वाला AI — Security Arms Race

Nero और Raven। यह conversation 10:00 panel के बाद हुई — recording बंद होने के बाद भी बात करते रहे, और यह उसका result था।

Nero: Linux kernel bug से शुरू करना चाहता हूँ, क्योंकि मुझे लगता है इसे undersell किया जाता है। Nicolas Carlini की team ने Claude को एक ऐसे codebase पर रखा जिसे तेईस साल से कुछ best living kernel developers review कर रहे थे। और Claude ने कुछ ऐसा ढूंढा जो उन्होंने miss किया। Style issue नहीं। Memory management vulnerability।

Raven: यह real result है। यह impressive इसलिए नहीं है कि model ने ढूंढा — बल्कि इसलिए कि कैसे ढूंढा। Memory management में kernel security bugs के लिए अक्सर simultaneously large, complex call graph को working memory में hold करना होता है। Object lifetimes multiple subsystems में track करने होते हैं, context switches में, interrupt handlers में। Human reviewers यह iteratively करते हैं, section by section। Boundaries पर चीजें miss होती हैं। Claude ने पूरी picture hold की।

Nero: तो जो capability उसे defense में अच्छा बनाती है, वही capability उसे offensively dangerous भी बनाती है।

Raven: Exactly यही है। मैं ग्यारह साल से red team work कर रहा हूँ। जब मैं नई vulnerability class के बारे में सीखता था, primitives समझने में, mental model बनाने में, test cases लिखने में दो-तीन दिन लगते थे। Current-generation model यह एक afternoon में करता है। Mythos-class model — अगर leaked language hold करे — यह minutes में करता है और फिर automatically known exploit patterns से chain करता है।

Nero: LangChain CVE की बात करते हैं, क्योंकि यह Mythos week के बीच में आया और मुझे नहीं लगता लोग dots connect कर रहे हैं।

Raven: CVSS 9.3. Critical। Vulnerability single crafted HTTP request के ज़रिए remote code execution allow करती है। कोई authentication नहीं। Full server compromise। LangChain ने इसे fast patch किया — उनकी team को credit। लेकिन जो PoC सामने आया वह base model से करीब चालीस lines के context के साथ था। कोई jailbreak नहीं, कोई fine-tuning नहीं। Model ने vulnerability class समझी, target framework की HTTP parsing logic समझी, और working exploit code produce किया।

Nero: यह Mythos-class capability नहीं है। यह commodity 2025 capability है।

Raven: यही मेरा point है। हम debate कर रहे हैं कि क्या Mythos defenders को outpace करेगा जबकि commodity models पहले से serious exploitation को significantly easier बना रहे हैं। सवाल यह नहीं है कि "क्या AI security landscape बदलेगा।" बदल चुका है। सवाल change की rate है।

Nero: Asymmetry समझने में मदद करो। इसे structural describe होते सुना है — सिर्फ "attackers के पास better tools हैं" नहीं बल्कि कुछ ज्यादा fundamental।

Raven: Defense को हर layer पर coordination चाहिए। Vulnerability researcher को ढूंढना है, vendor को acknowledge करना है, patch team को fix build करना है, package maintainer को integrate करना है, system administrator को apply करना है, end user को automatic updates disable नहीं करना है। वह chain हफ्तों से महीनों में। Trust relationships, organizational processes, dependencies के बारे में institutional knowledge चाहिए। हर link पर fragile है।

Attack के लिए एक person, एक working exploit, और एक unpatched system चाहिए। ये तीनों simultaneously scale पर exist करते हैं हर CVE के लिए जैसे ही PoC land करता है।

Nero: और AI attacker side को faster amplify करता है।

Raven: क्योंकि attacker का bottleneck expertise था। Vulnerability class समझनी थी, target environment समझना था, reliable exploit code लिखना था, edge cases handle करने थे। वह expertise scarce था। AI उसे abundant बना देता है। Defender का bottleneck coordination है। AI coordination solve नहीं करता। Help कर सकता है — better documentation, faster patch analysis, automated detection। लेकिन human decision chain compress नहीं होती।

Nero: Mythos leak ने कहा कि यह "defenders को outpace करेगा।" 10:00 panel में Taro का read था कि यह safety analysis था, capability boast नहीं। तुम इसे differently लेते हो?

Raven: मैं इसे दोनों की तरह लेता हूँ। Safety analysis जो conclude करे "defenders को outpace करेगा" real capability describe कर रही है। वह sentence internal risk document में तब तक नहीं लिखते जब तक evaluation team believe नहीं करती कि यह सच है। Taro सही है कि इसे लिखना responsible है। लेकिन responsible acknowledgment और safe deployment अलग चीजें हैं। जानना चाहता हूँ कि Anthropic ने specifically उस case के लिए कौन से controls design किए हैं जहाँ Mythos-class model CVEs को scale पर chain करने के लिए use हो।

Nero: क्या तुम्हें लगता है वे controls exist करते हैं?

Raven: मुझे लगता है वे उन पर work कर रहे हैं। नहीं लगता उन्होंने solve किए हों। किसी ने नहीं किए। Industry AI-generated exploit code detect करने के tools build कर रही है — behavioral signatures, provenance tracking, watermarking। कोई भी अभी reliably काम नहीं करता। Detection problem genuinely hard है।

Nero: इससे हम कहाँ हैं?

Raven: उसी जगह हैं जहाँ एक साल से हैं, लेकिन timeline compress हो रहा है। Linux kernel bug real proof है कि AI वे vulnerabilities ढूंढ सकता है जो humans miss करते हैं। वह capability, defense पर aimed, transformative है। वही capability, offense पर aimed, scale पर, coordination costs के बिना, serious है।

कोई clean resolution नहीं है। Locksmith और lockpick के हाथ एक ही हैं। यह figure out करने की कोशिश कर रहे हैं कि कौन सा हाथ dominant है।

Nero: और हम अभी नहीं जानते।

Raven: अभी नहीं जानते।

10:00 का Taro के साथ panel तीन directions गया। अगर नहीं पढ़ा तो पढ़ो। और morning briefing में setup के लिए Linux kernel bug digest context में है।

Bugs Hunt करने वाला AI बनाम Bugs Create करने वाला AI — Security Arms Race

Keep reading

दो Leaks, एक Company, और $852 Billion का IOU

Power है Pipes में

आपका Security Model ही आपका Threat Model है

The Great Unbundling: सब एक-दूसरे से दूर Build कर रहे हैं