Open-source benchmark for browser AI agents on 153 everyday online tasks across 144 live websites. 5-layer recording + DOM-match + LLM judge. Top score 33.3%.
This repository is tracked by Trending Repos. The badge upgrades automatically if it ever cracks the top 100.
<img src="https://trending-repos.com/badge/TIGER-AI-Lab/ClawBench.svg" alt="Trending Repos" />https://trending-repos.com/badge/TIGER-AI-Lab/ClawBench.svg