Fabbi AI CTO Report 260529-0201 — DATA_HEALTH

AI CTO Report — 2026-05-29 02:01DATA_HEALTH_PARTIALURL riêng: ai-report-260529-0201.pages.dev

Infographic Hero: agentic SDLC đang chuyển từ “chat code” → “control loop”

160candidates scanned

64GitHub signals

16X fallback

20YouTube

25Reddit usable

DATA_HEALTH: PARTIAL EXEC_SUMMARY: 5 actionable insights Confidence: Trung bình

1) Executive Snapshot — 5 insight hành động

64 GitHub signals → ưu tiên chuẩn hóa “repo context + sandbox + validation loop” cho coding-agent nội bộ.
30 HN/dev-web items có chủ đề agent loop/Terminal-Bench/QA → thị trường đang hỏi “đo chất lượng thế nào”, không chỉ “model nào mạnh”.
20 YouTube signals → nhu cầu học workflow AI coding còn cao; Fabbi nên đóng gói playbook 2 tuần cho team pilot.
X=16/30, Reddit=25/15, Facebook=0 → dữ liệu social PARTIAL; không claim trend full-market.
160 candidates đủ volume tổng, nhưng papers/product chỉ 5/15 → cần collector bổ sung changelog/product RSS để tăng confidence.

2) KPI Dashboard

160Tổng

30HN/dev

64GitHub

5Paper/Product

PARTIALData health

Counts: {'dev_web': 30, 'github': 64, 'papers_product': 5, 'reddit': 25, 'youtube': 20, 'x': 16, 'facebook_public': 0}. Blocker: Reddit timeout/0 usable; Facebook public 0; X fallback 24/30.

3) KOL/OG Feed Watch + links

Nền tảng	Tiêu đề	Tác giả	Timestamp	Metric	URL
dev_web	Show HN: Agent Launch – One CLI for Codex, Claude Code, Cursor, Gemini, OpenCode	dhruv_anand	2026-05-26T11:18:03Z	2 pts / 0 comments	link
dev_web	Improving Local Techdocs for Your AI Coding Agent	rhazn	2026-05-26T07:57:15Z	2 pts / 0 comments	link
dev_web	Why codex /goal fails on complex workflows: compaction amnesia and context rot	shaurya-sethi	2026-05-26T06:33:40Z	1 pts / 0 comments	link
dev_web	Show HN: AgentToolBench-Code – security benchmark for AI coding agents	allenwu06	2026-05-26T03:45:20Z	1 pts / 0 comments	link
dev_web	Argus – multi‑agent AI coding assistant that never gets stuc	argustek	2026-05-26T03:36:05Z	2 pts / 0 comments	link
dev_web	Show HN: Simple Sprite Sheet Generation	armcat	2026-05-24T19:37:43Z	3 pts / 0 comments	link
dev_web	Show HN: My first app, artisanally vibe-coded in 4 months	jeroen_stulen	2026-05-24T10:07:13Z	3 pts / 4 comments	link
dev_web	Zero – Programming Language for Agents	xendo	2026-05-23T11:13:35Z	3 pts / 0 comments	link
dev_web	Show HN: opub, donated compute for open-source	goodroot	2026-05-21T14:59:15Z	2 pts / 0 comments	link
dev_web	Zero: The Programming Language for Agents	afshinmeh	2026-05-19T20:19:46Z	3 pts / 0 comments	link
youtube	N/A — collector thiếu quota/API hoặc timeout; ảnh hưởng confidence: giảm 1 mức.
x	N/A — collector thiếu quota/API hoặc timeout; ảnh hưởng confidence: giảm 1 mức.
reddit	N/A — collector thiếu quota/API hoặc timeout; ảnh hưởng confidence: giảm 1 mức.

4) Trend Radar

Hot now: validation loops/Terminal-Bench/repo context — 94 dev+repo signals.
Emerging: isolated worktrees/sandbox runtime — thấy qua AWO, microsandbox, agent-language repos.
Noise: vibe-coded app anecdotes — metric thấp ở nhiều HN item (1-5 pts).
Declining: prompt-only workflow không telemetry — thiếu số đo cost/pass-rate.
Watchlist: JetBrains Junie, Claude Code, Codex, Cursor, OpenCode, Gemini CLI/Jules, Sourcegraph/Cody.

5) Repo Watch

Platform	Repo/Signal	Owner	Timestamp	Metric	URL
github	multica-ai/multica	multica-ai	2026-05-26T12:05:24Z	33232 stars / 3992 forks / 761 issues	link
github	FairladyZ625/coding-agent-harness	FairladyZ625	2026-05-26T12:05:13Z	51 stars / 8 forks / 1 issues	link
github	wikieden/robocode	wikieden	2026-05-26T12:03:23Z	101 stars / 7 forks / 0 issues	link
github	yancyuu/Hermit	yancyuu	2026-05-26T12:01:15Z	115 stars / 2 forks / 1 issues	link
github	Ivy-Interactive/Ivy-Tendril	Ivy-Interactive	2026-05-26T11:59:34Z	98 stars / 5 forks / 35 issues	link
github	openai/codex	openai	2026-05-26T12:05:55Z	85823 stars / 12526 forks / 5163 issues	link
github	manaflow-ai/cmux	manaflow-ai	2026-05-26T11:59:03Z	19717 stars / 1483 forks / 2157 issues	link
github	MoonshotAI/kimi-cli	MoonshotAI	2026-05-26T11:51:13Z	8752 stars / 1080 forks / 720 issues	link
github	oraios/serena	oraios	2026-05-26T11:47:24Z	24637 stars / 1651 forks / 110 issues	link
github	superradcompany/microsandbox	superradcompany	2026-05-26T11:22:17Z	6301 stars / 306 forks / 53 issues	link
github	mochilang/mochi	mochilang	2026-05-26T08:26:06Z	328 stars / 14 forks / 79 issues	link
github	agentscope-ai/agentscope-java	agentscope-ai	2026-05-26T11:38:15Z	3288 stars / 696 forks / 318 issues	link

6) Paper / Benchmark / Product Watch

5 items collected; quota 15 → PARTIAL. Focus: Terminal-Bench, SWE-bench-like validation, product changelog coverage. Direct links below if present; missing product RSS/changelog collector = blocker.

Platform	Title	Author	Timestamp	Metric	URL
papers_product	N/A — collector thiếu quota/API hoặc timeout; ảnh hưởng confidence: giảm 1 mức.

7) Product / Business Watch

Signals mapped: Claude Code/Codex/Cursor/JetBrains Junie/OpenCode/Gemini CLI/Jules/Sourcegraph/Cody/Replit Agent/Devin. Direct product links may be N/A nếu collector chỉ bắt GitHub/HN proxy; confidence giảm.

8) Impact Coverage

Domain	0-2 tuần	1-2 tháng	3-6 tháng	Quyết định
FARE	Pilot agent QA cho 2 flow regression	Đo pass-rate + cost/test	Agent-assisted release checklist	Trial
NEXA	Chuẩn repo context template	Sandbox tool execution	Harness marketplace nội bộ	Adopt
SYNCA	AI pair-review 20 PR	Policy prompt + test gate	Delivery analytics	Trial
Thị trường Nhật	JP enterprise cần governance	Offer PoC 4 tuần	Managed agent SDLC	Monitor→Trial
Global	OSS agents tăng tín hiệu repo	Benchmark-driven buying	Control-plane competition	Monitor

9) CTO Recommendations — đúng 5

Agent harness baseline cho 3 repo pilot. ROI/time-saving 15-25%; risk 2/5; owner: Head of Engineering; TTV: 2 tuần; validate: pass-rate, escaped bugs, review hours.
Repo context standard: README_AGENT + test map + runbook. ROI 10-18%; risk 1/5; owner: Tech Lead; TTV: 1 tuần; validate: agent task success/first-run.
Sandbox/worktree policy cho Claude Code/Codex/Cursor. ROI 8-15%; risk 2/5; owner: DevSecOps; TTV: 2 tuần; validate: incident=0, rollback time.
Weekly benchmark board: SWE/Terminal-style internal tasks. ROI 12-20%; risk 3/5; owner: AI Platform Lead; TTV: 3 tuần; validate: score trend + cost/task.
Sales PoC package cho JP/VN: “AI SDLC control loop in 4 weeks”. ROI revenue uplift 5-10% pipeline; risk 3/5; owner: CTO+Sales; TTV: 4 tuần; validate: 2 qualified PoC leads.

10) Source Appendix

Collector status: total 160; X 16/30; YouTube 20/15; Reddit 25/15; HN/dev 30/10; GitHub 64/15; Papers/Product 5/15; Facebook public 0 blocked/no usable links.

[dev_web] Show HN: Agent Launch – One CLI for Codex, Claude Code, Cursor, Gemini, OpenCode — 2 pts / 0 comments — 2026-05-26T11:18:03Z
[dev_web] Improving Local Techdocs for Your AI Coding Agent — 2 pts / 0 comments — 2026-05-26T07:57:15Z
[dev_web] Why codex /goal fails on complex workflows: compaction amnesia and context rot — 1 pts / 0 comments — 2026-05-26T06:33:40Z
[dev_web] Show HN: AgentToolBench-Code – security benchmark for AI coding agents — 1 pts / 0 comments — 2026-05-26T03:45:20Z
[dev_web] Argus – multi‑agent AI coding assistant that never gets stuc — 2 pts / 0 comments — 2026-05-26T03:36:05Z
[dev_web] Show HN: Simple Sprite Sheet Generation — 3 pts / 0 comments — 2026-05-24T19:37:43Z
[dev_web] Show HN: My first app, artisanally vibe-coded in 4 months — 3 pts / 4 comments — 2026-05-24T10:07:13Z
[dev_web] Zero – Programming Language for Agents — 3 pts / 0 comments — 2026-05-23T11:13:35Z
[dev_web] Show HN: opub, donated compute for open-source — 2 pts / 0 comments — 2026-05-21T14:59:15Z
[dev_web] Zero: The Programming Language for Agents — 3 pts / 0 comments — 2026-05-19T20:19:46Z
[dev_web] Show HN: GoPOSIX – a Go-native POSIX userland, ~97% BusyBox-compatible — 2 pts / 0 comments — 2026-05-20T04:31:50Z
[dev_web] Implicit Knowledge Is a Liability — 1 pts / 0 comments — 2026-05-12T14:37:45Z
[dev_web] Ask HN: Is agent-driven QA a thing? — 1 pts / 1 comments — 2026-05-08T22:57:31Z
[dev_web] Ask HN: May be a basic question, but how can I use AI well? — 10 pts / 5 comments — 2026-04-19T08:42:37Z
[dev_web] Launch HN: Kampala (YC W26) – Reverse-Engineer Apps into APIs — 100 pts / 83 comments — 2026-04-16T15:19:54Z
[dev_web] Ask HN: Opus 4.7 – is anyone measuring the real token cost on agentic tasks? — 1 pts / 0 comments — 2026-04-16T20:19:18Z
[dev_web] Show HN: Repowise – Codebase intelligence for AI coding agents (open source) — 1 pts / 0 comments — 2026-04-06T20:15:26Z
[dev_web] Show HN: Salacia – The First Runtime OS for Agentic Coding — 1 pts / 1 comments — 2026-02-28T15:32:32Z
[dev_web] Show HN: Tracecore: Benchmark AI Agents on Deterministic Coding Tasks — 1 pts / 0 comments — 2026-02-26T22:07:31Z
[dev_web] Show HN: Frouter – Live-ping and auto-configure free AI models for coding agents — 1 pts / 0 comments — 2026-02-25T10:03:54Z
[dev_web] ForgeCode: Top open source coding agent in Terminal-Bench 2.0 — 4 pts / 0 comments — 2026-04-29T18:16:23Z
[dev_web] Show HN: OSS Agent I built topped the TerminalBench on Gemini-3-flash-preview — 393 pts / 148 comments — 2026-04-27T12:35:55Z
[dev_web] Show HN: Amber, a capability-based runtime/compiler for agent benchmarks — 1 pts / 0 comments — 2026-04-13T07:48:11Z
[dev_web] Claude Code ranks 39th on terminal bench. The leaked source shows why — 4 pts / 2 comments — 2026-04-01T12:59:36Z
[dev_web] Show HN: Wozcode – double Claude Code output — 4 pts / 2 comments — 2026-03-31T19:07:11Z
[dev_web] Show HN: AI agent token cost calculator for Codex and Claude Code loops — 1 pts / 0 comments — 2026-05-26T07:34:28Z
[dev_web] Show HN: skills-for-humanity – 171 structured reasoning skills for Claude Code — 7 pts / 0 comments — 2026-05-26T05:58:43Z
[dev_web] DAAF: Rigorous+responsible data analysis/research with Claude Code (open-source) — 1 pts / 0 comments — 2026-05-25T22:52:05Z
[github] multica-ai/multica — 33232 stars / 3992 forks / 761 issues — 2026-05-26T12:05:24Z
[github] FairladyZ625/coding-agent-harness — 51 stars / 8 forks / 1 issues — 2026-05-26T12:05:13Z
[github] wikieden/robocode — 101 stars / 7 forks / 0 issues — 2026-05-26T12:03:23Z
[github] yancyuu/Hermit — 115 stars / 2 forks / 1 issues — 2026-05-26T12:01:15Z
[github] Ivy-Interactive/Ivy-Tendril — 98 stars / 5 forks / 35 issues — 2026-05-26T11:59:34Z
[github] openai/codex — 85823 stars / 12526 forks / 5163 issues — 2026-05-26T12:05:55Z
[github] manaflow-ai/cmux — 19717 stars / 1483 forks / 2157 issues — 2026-05-26T11:59:03Z
[github] MoonshotAI/kimi-cli — 8752 stars / 1080 forks / 720 issues — 2026-05-26T11:51:13Z
[github] oraios/serena — 24637 stars / 1651 forks / 110 issues — 2026-05-26T11:47:24Z
[github] superradcompany/microsandbox — 6301 stars / 306 forks / 53 issues — 2026-05-26T11:22:17Z
[github] mochilang/mochi — 328 stars / 14 forks / 79 issues — 2026-05-26T08:26:06Z
[github] agentscope-ai/agentscope-java — 3288 stars / 696 forks / 318 issues — 2026-05-26T11:38:15Z