AI-Driven E2E Testing · Live Demo of SkyTest Agent
Feels magical — operates a browser like a human using computer use
Burns context tokens magically too. Pass/Fail signal is unreliable. Slow.
Raw Claude/GPT with browser access — completes flows end-to-end
Hard to control. No reliable Pass/Fail. Can't assert confidently.
AI generates Playwright scripts fast — looks great on paper
You still have code to maintain. The dream is fault-tolerant, like a human tester — not more code.
→ SkyTest sits in the gap: plain-English test cases + structured Pass/Fail results
Key insight: cheap vision + reasoning models are now good enough for this job. The economics finally work.
謝謝 · Questions?