Tap: The Interface Protocol for AI Agents — Forge Once, Run Forever
Tap is a universal protocol that separates the 'figuring out' from the 'doing' for AI agents interacting with web interfaces. It allows AI to analyze and forge deterministic scripts that can be executed without further AI involvement.
Why it matters
Tap provides a scalable and efficient way for AI agents to interact with web interfaces, reducing the need for expensive and unreliable step-by-step automation.
Key Points
- 1Tap protocol separates the analysis and verification of web interfaces from the actual execution
- 2Tap provides 8 kernel primitives and 16 standard library operations for building AI-powered automation
- 3Tap supports two runtimes: Chrome Extension and Playwright, with 81 ready-to-use skills for popular websites
- 4Tap is MCP-native, allowing integration with AI agents like Claude Code
Details
The article introduces Tap, a new interface protocol for AI agents that aims to make web automation more efficient and reliable. The key insight is that operating a web interface is a solved problem once the necessary steps are figured out, and that's where AI excels. Tap separates this 'figuring out' phase from the actual execution, allowing AI to forge deterministic scripts that can be run repeatedly without further AI involvement. The protocol consists of 8 kernel primitives and 16 standard library operations, which can be implemented by new runtimes to gain the full functionality. Tap currently supports two runtimes: a Chrome Extension and Playwright. It also comes with 81 ready-to-use skills for popular websites like Twitter, Reddit, and GitHub. Tap is designed to be MCP-native, allowing it to be easily integrated with AI agents like Claude Code.
No comments yet
Be the first to comment