+−生成侧沦陷，验证侧堰塞湖 · AI 与软件开发产业 · 2026-07Generation collapses, verification dams up · AI & software · Jul 2026

AI 把「打字」的成本压到趋近零——但软件的瓶颈从来不是打字。生成侧全面沦陷，验证侧筑起新堰塞湖 AI drove the cost of 'typing' toward zero — but software's bottleneck was never typing. The generation side collapses; the verification side dams up

最小单元＝「一次变更」（a diff）：一个代码差异从意图出发，经评审测试、进入生产、被监控、可回滚，最终沉淀为组织能力。软件业的本质是「围绕变更的风险管理」，不只是代码生产——AI 只压缩了「敲键盘」这一环。The atomic unit is 'a change' (a diff): a code difference that goes from intent through review and tests, into production, monitored, revertible, and finally settling as organizational capability. Software is 'risk management around change,' not just code production — AI only compressed the 'keystroke' step.

「写代码」与「交付软件」正在被重新定价：需求澄清、架构权衡、验证信任、事故责任——这四件原本就昂贵的事反而更值钱。DORA 把 AI 定义为「放大器」：它加速健康的系统，也放大既有的失调。'Writing code' and 'delivering software' are being re-priced: requirement clarification, architectural trade-offs, verification-trust, and incident accountability — the four already-expensive things get dearer. DORA calls AI an 'amplifier': it speeds up healthy systems and amplifies existing dysfunction.

主脊是 SDLC「一次变更」八节点（需求→架构→编码→测试→评审→部署→监控→事故责任），每节点标注传统 vs AI + 吞噬度。另含加速鞭打诚实层、coding agent 四代跃迁、经验断层双轨、五块硬骨头。这是一张批判性行业解剖，不是 AI 颂歌。相邻议题见姊妹图：一人公司 / 独立开发者→startup、AI 生成代码的安全→security、初级岗位断层与培训→edu。 The spine is the eight-node 'one change' SDLC (requirements → architecture → coding → testing → review → deploy → monitoring → incident accountability), each tagged traditional vs AI + how far AI has eaten it. Plus the Acceleration Whiplash honesty layer, the coding-agent four-generation leap, the experience-gap dual track, and five hard bones. A critical dissection, not an AI hymn. Adjacent topics on siblings: the one-person company / indie dev → startup, the security of AI-written code → security, the junior-role gap & training → edu.

传统节点Traditional

生成侧 · AI 吞噬Generation · eaten

验证侧 · 堰塞湖Verification · the dam

agent · 资本重构Agent · capital

加速鞭打 · 经验断层Whiplash · the gap

41–46%

生产环境新代码由 AI 生成（GitHub Copilot 平均 46%、Java 高达 61%，较 2022 的 27% 大涨；预计短期破 60%）——打字成本趋零Of new production code, 41–46% is AI-generated (Copilot avg 46%, Java up to 61%, from 27% in 2022; expected to top 60% soon) — typing cost near zero

+441%

PR 评审中位时间上升（Faros「加速鞭打」，2.2 万开发者）；同时生产事故 +242.7%、churn +861%、31.3% 的 PR 无人评审就合并Median PR-review time rose (Faros 'Acceleration Whiplash,' 22k devs); alongside production incidents +242.7%, churn +861%, and 31.3% of PRs merged with no review

$2B ARR

Cursor（Anysphere）ARR，2025-01 的 1 亿 → 2026-02 的 20 亿美元——零到 20 亿约三年，有记录以来最快的 B2B 软件公司（D 轮估值 293 亿）Cursor's ARR, from $100M (Jan 2025) to $2B (Feb 2026) — zero to $2B in ~3 years, the fastest B2B software company on record (Series D at $29.3B)

−20%

美国 22–25 岁软件开发者就业，2025-09 较 2022 峰值下降近 20%（Stanford，论文口径）——阶梯的第一级正被抽掉。⚠️AI 归因仍有争议US software-developer employment for ages 22–25 fell ~20% by Sep 2025 vs the 2022 peak (Stanford, paper) — the first rung of the ladder is being pulled. ⚠️AI attribution is still contested

口径警告：本页是批判性行业分析，非工具选型 / 投资建议，刻意保留反方证据。AI 生成代码占比分三层读：公司 / 整体级 41–46%（A/D）、预测 60%（预测）、YC 极端团队 95%（非普遍，B/C）。采用率：DORA 90% 与 Stack Overflow 84% 双锚（均 A，样本 / 问法不同）。初级岗位降幅以 Stanford「22–25 岁 −20%」为主锚（论文），其余（−67% 招聘量、SignalFire −65% / −76% vs 2019、Indeed −49%）按基期 / 口径并列。AI 就业归因有争议：NBER（丹麦数据）发现 ChatGPT 采用两年后「精确的零效应」，中国官方多归因经济下行——避免「AI 单因论」。CodeRabbit「1.7× 重大问题 / 2.74× 漏洞率」是两个不同指标；Cursor「$60B 被 SpaceX/xAI 收购」为未证实传闻（D）。快速变动的营收 / 估值均带日期戳；厂商自述 / 预测打 D 级。每张卡片右上角 A/B/C/D＝证据强度。 Basis warning: a critical industry analysis, not tool-selection / investment advice, deliberately keeping counter-evidence. Read AI-code share in three tiers: company/overall 41–46% (A/D), forecast 60% (forecast), YC extreme teams 95% (not typical, B/C). Adoption: DORA 90% and Stack Overflow 84% as dual anchors (both A, different samples/phrasing). Junior-role decline anchors on Stanford's '22–25, −20%' (paper); others (−67% postings, SignalFire −65%/−76% vs 2019, Indeed −49%) shown by base-year/basis. AI job-attribution is contested: NBER (Danish data) finds a 'precise zero' two years after ChatGPT adoption; Chinese officials mostly cite the downturn — avoid an 'AI-only' story. CodeRabbit's '1.7× major issues / 2.74× vuln rate' are two different metrics; Cursor's '$60B SpaceX/xAI acquisition' is an unverified rumor (D). Fast-moving revenue/valuation carry date stamps; vendor claims/forecasts are grade D. Each card's top-right A/B/C/D = evidence strength.

◆ 诚实层 · 加速鞭打（最重）The honesty layer · the Acceleration Whiplash

一个 diff，两侧命运：生成变便宜，验证变贵One diff, two fates: generation gets cheap, verification gets dear

AI「吞噬」SDLC 不是一条单调线性推进，而是「生成侧全面沦陷、验证侧筑起新堰塞湖」的双侧分化。把一次变更画成一个 diff——绿色的「+」全被 AI 接管，红色的「−」反而成了新的价值中心、新的瓶颈、新的预算。AI 'eating' the SDLC isn't a monotonic march; it's a two-sided divergence: the generation side collapses, the verification side dams up. Draw one change as a diff — the green '+' is all taken over by AI, while the red '−' becomes the new value center, new bottleneck, new budget.

a/software-economics.diff一次变更 · 重新定价one change · re-priced

+生成侧（成本趋零）generation (cost → 0)：打字 · 样板 Boilerplate · CRUD · 脚手架 · 胶水代码 · 回归 / 单元测试 · 简单修复 · 文档生成typing · boilerplate · CRUD · scaffolding · glue code · regression/unit tests · simple fixes · docs

−验证侧（反而更贵）verification (gets dearer)：需求正确 · 架构权衡 · 系统稳定 · 责任可追 · 成本可控 —— 新瓶颈 = 新价值 = 新岗位 = 新预算correct requirements · architectural trade-offs · system stability · traceable accountability · cost control — new bottleneck = new value = new budget

两点关键修正，避免把「吞噬」看成单调线性：① 需求定义不是被吞噬，而是被抬升为最贵、最高杠杆的环节；② 评审是矛盾节点——AI 能评审，但它同时把待评审代码量放大十倍，评审反而成为吞吐瓶颈。Two corrections against a monotonic view: ① requirement definition isn't eaten but elevated to the dearest, highest-leverage step; ② review is the paradox node — AI can review, but it also multiplies the code awaiting review tenfold, so review becomes the throughput bottleneck.

▚ 加速鞭打 · Faros AI《2026 工程报告》（2.2 万开发者 / 4000+ 团队）The whiplash · Faros AI 2026 (22k devs / 4000+ teams)

生成提速的代价，全都记在验证侧The cost of faster generation lands entirely on verification

+441%

PR 评审中位时间median PR-review time

+242%

每个 PR 引发的生产事故production incidents per PR

+861%

代码 churn（提交后即被删 / 覆盖）code churn (soon deleted/overwritten)

+54%

人均 bugbugs per developer

31.3%

PR 完全无人评审就合并of PRs merged with no review

交叉印证：DORA 2025（约 5000 人）AI 采用率 90%（+14pct）、65% 高度依赖，但 30% 几乎不信任——AI 改善几乎每个维度，唯独软件交付稳定性下降。Stack Overflow 2025：66% 把「几乎正确但不完全对」列为主要挫败，45% 认为调试 AI 代码更花时间。GitClear（2.11 亿行）：重构比例 25%→不足 10%、重复率飙约四倍。METR：资深开源开发者用 AI 实际慢 19%，却自认快 24%（43pct 感知偏差）。Cross-checks: DORA 2025 (~5000): 90% adoption (+14pt), 65% heavily rely, yet 30% barely trust it — AI improves nearly every dimension except delivery stability. Stack Overflow 2025: 66% cite 'almost right but not quite' as a top frustration, 45% say debugging AI code takes longer. GitClear (211M lines): refactoring 25%→under 10%, duplication up ~4×. METR: experienced OSS devs were actually 19% slower with AI while feeling 24% faster (a 43-point perception gap).

≋ 奈奎斯特稳定判据 · AI 是放大器，不是解药The Nyquist criterion · AI is an amplifier, not a cure

DORA 2025 借控制论的奈奎斯特稳定判据作比：任何控制系统，都必须以至少两倍于被控系统的速度运行。当「生成」提速十倍，「验证 / 控制」环节若不同步提速，系统必然失控——这正是加速鞭打的物理学。而 Anthropic 约 40 万个 Claude Code 会话分析显示：人仍主导「做什么（what）」的规划，AI 主导「怎么做（how）」的执行——AI 还没吃掉「问题定义权」。DORA 2025 borrows the control-theory Nyquist stability criterion: any control system must run at at least twice the speed of the system it controls. When 'generation' speeds up tenfold and 'verification/control' doesn't keep pace, the system goes unstable — this is the physics of the whiplash. And Anthropic's analysis of ~400,000 Claude Code sessions shows humans still drive the 'what' (planning) while AI drives the 'how' (execution) — AI hasn't taken the problem-definition rights.

AI 把「打字」的成本压到趋近零——但软件的瓶颈从来不是打字。生成侧全面沦陷，验证侧筑起新堰塞湖 AI drove the cost of 'typing' toward zero — but software's bottleneck was never typing. The generation side collapses; the verification side dams up

从这张图看到的五条规律Five patterns this map makes visible