For every step, the Test Report shows three tabs: AI Request, AI Response, Screenshot.
The AI Request tab includes URL, step UID, action, direction, type, argument, element, location, context, intent, and justification.
This is the layer most testing tools hide. We show it because debugging a step is impossible without it.
Filter and sort by browser to see how your suite behaves across the matrix.
A failed test is not always a real failure. Sometimes the test runs the same flow twice, fails the first time, passes the second. Most teams either ignore the flake or rerun the whole suite. Both waste time.
From any test row, mark it as flaky. The system tags the test, surfaces it in flakiness analytics, and keeps a count over time. You see which tests need fixing versus which tests are catching real bugs.
From the test row menu, click Mark as flaky. No separate workflow.
Flaky tests appear in your flakiness view alongside their flake count and last failure.
Once you ship a fix, verify the test passes consistently across new runs.
Filter by:
See per-step replay with AI requests inline. Live demo, your test runs.
Request a Demo