Browser Use

Web ブラウザの自動操作を通じて、AI エージェントにオンラインタスク実行能力を与える Python フレームワーク。Playwright ベースで、LLM と組み合わせた複雑な Web インタラクション自動化を実現します。

原文: 🌐 Make websites accessible for AI agents. Automate tasks online with ease.

#ブラウザ自動化#AIエージェント#Python#ai-agents#ai-tools#browser-automation#browser-use#llm#playwright#python

リポジトリ統計

⭐ Stars

93k

🍴 Forks

10.5k

⚠️ Open Issues

232

🌿 Language

Python

📄 License

MIT

🕒 最終更新

2026.05.09 (今日)

📅 公開日

2024.11.01

🌿 Branch

main

from browser_use import Agent, Browser, ChatBrowserUse # from browser_use import ChatGoogle # ChatGoogle(model='gemini-3-flash-preview') # from browser_use import ChatAnthropic # ChatAnthropic(model='claude-sonnet-4-6') import asyncio async def main(): browser = Browser( # use_cloud=True, # Use a stealth browser on Browser Use Cloud ) agent = Agent( task="Find the number of stars of the browser-use repo", llm=ChatBrowserUse(), # llm=ChatGoogle(model='gemini-3-flash-preview'), # llm=ChatAnthropic(model='claude-sonnet-4-6'), browser=browser, ) await agent.run() if __name__ == "__main__": asyncio.run(main())

browser-use open https://example.com # URLに移動 browser-use state # クリック可能な要素を表示 browser-use click 5 # インデックスで要素をクリック browser-use type "Hello" # テキストを入力 browser-use screenshot page.png # スクリーンショットを取得 browser-use close # ブラウザを閉じる

🌤️ Want to skip the setup? Use our cloud for faster, scalable, stealth-enabled browser automation!

🤖 LLM Quickstart

Direct your favorite coding agent (Cursor, Claude Code, etc) to Agents.md
Prompt away!

👋 Human Quickstart

1. Create environment and install Browser-Use with uv (Python>=3.11):

uv init && uv add browser-use && uv sync
# uvx browser-use install  # Run if you don't have Chromium installed

2. [Optional] Get your API key from Browser Use Cloud:

# .env
BROWSER_USE_API_KEY=your-key
# GOOGLE_API_KEY=your-key
# ANTHROPIC_API_KEY=your-key

3. Run your first agent:

from browser_use import Agent, Browser, ChatBrowserUse
# from browser_use import ChatGoogle  # ChatGoogle(model='gemini-3-flash-preview')
# from browser_use import ChatAnthropic  # ChatAnthropic(model='claude-sonnet-4-6')
import asyncio

async def main():
    browser = Browser(
        # use_cloud=True,  # Use a stealth browser on Browser Use Cloud
    )

    agent = Agent(
        task="Find the number of stars of the browser-use repo",
        llm=ChatBrowserUse(),
        # llm=ChatGoogle(model='gemini-3-flash-preview'),
        # llm=ChatAnthropic(model='claude-sonnet-4-6'),
        browser=browser,
    )
    await agent.run()

if __name__ == "__main__":
    asyncio.run(main())

Check out the library docs and the cloud docs for more!

Open Source vs Cloud

We benchmark Browser Use across 100 real-world browser tasks. Full benchmark is open source: browser-use/benchmark.

Use the Open-Source Agent

You need custom tools or deep code-level integration
We recommend pairing with our cloud browsers for leading stealth, proxy rotation, and scaling
Or self-host the open-source agent fully on your own machines

Use the Fully-Hosted Cloud Agent (recommended)

Much more powerful agent for complex tasks (see plot above)
Easiest way to start and scale
Best stealth with proxy rotation and captcha solving
1000+ integrations (Gmail, Slack, Notion, and more)
Persistent filesystem and memory

Demos

📋 Form-Filling

Task = "Fill in this job application with my resume and information."

Example code ↗

🍎 Grocery-Shopping

Task = "Put this list of items into my instacart."

grocery-use-large.mp4

Example code ↗

💻 Personal-Assistant.

Task = "Help me find parts for a custom PC."

pc-use-large.mp4

Example code ↗

💡See more examples here ↗ and give us a star!

🚀 Template Quickstart

Want to get started even faster? Generate a ready-to-run template:

uvx browser-use init --template default

This creates a browser_use_default.py file with a working example. Available templates:

default - Minimal setup to get started quickly
advanced - All configuration options with detailed comments
tools - Examples of custom tools and extending the agent

You can also specify a custom output path:

uvx browser-use init --template default --output my_agent.py

💻 CLI

Fast, persistent browser automation from the command line:

browser-use open https://example.com    # Navigate to URL
browser-use state                       # See clickable elements
browser-use click 5                     # Click element by index
browser-use type "Hello"                # Type text
browser-use screenshot page.png         # Take screenshot
browser-use close                       # Close browser

The CLI keeps the browser running between commands for fast iteration. See CLI docs for all commands.

Claude Code Skill

For Claude Code, install the skill to enable AI-assisted browser automation:

mkdir -p ~/.claude/skills/browser-use
curl -o ~/.claude/skills/browser-use/SKILL.md \
  https://raw.githubusercontent.com/browser-use/browser-use/main/skills/browser-use/SKILL.md

Integrations, hosting, custom tools, MCP, and more on our Docs ↗

FAQ

What's the best model to use?

We optimized ChatBrowserUse() specifically for browser automation tasks. On avg it completes tasks 3-5x faster than other models with SOTA accuracy.

Pricing (per 1M tokens):

Input tokens: $0.20
Cached input tokens: $0.02
Output tokens: $2.00

For other LLM providers, see our supported models documentation.

Should I use the Browser Use system prompt with the open-source preview model?

Yes. If you use ChatBrowserUse(model='browser-use/bu-30b-a3b-preview') with a normal Agent(...), Browser Use still sends its default agent system prompt for you.

You do not need to add a separate custom "Browser Use system message" just because you switched to the open-source preview model. Only use extend_system_message or override_system_message when you intentionally want to customize the default behavior for your task.

If you want the best default speed/accuracy, we still recommend the newer hosted bu-* models. If you want the open-source preview model, the setup stays the same apart from the model= value.

Can I use custom tools with the agent?

Yes! You can add custom tools to extend the agent's capabilities:

Browser Use

Browser Use

リポジトリ統計

ドキュメント

🤖 LLM クイックスタート

👋 ヒューマンクイックスタート

オープンソース版 vs クラウド版

デモ

📋 フォーム入力

タスク = 「職務経歴書と個人情報を使ってこの求人応募書を記入してください」

🍎 食料品買い物

タスク = 「この買い物リストをインスタカートに追加してください」

💻 パーソナルアシスタント

タスク = 「カスタムPCのパーツを見つけるのを手伝ってください」

💡詳しい例をここで確認 ↗し、スターをください！

🚀 テンプレートクイックスタート

💻 CLI

Claude Code Skill

統合、ホスティング、カスタムツール、MCP、およびその他について、ドキュメント ↗

よくある質問

🤖 LLM Quickstart

👋 Human Quickstart

Open Source vs Cloud

Demos

📋 Form-Filling

Task = "Fill in this job application with my resume and information."

🍎 Grocery-Shopping

Task = "Put this list of items into my instacart."

💻 Personal-Assistant.

Task = "Help me find parts for a custom PC."

💡See more examples here ↗ and give us a star!

🚀 Template Quickstart

💻 CLI

Claude Code Skill

Integrations, hosting, custom tools, MCP, and more on our Docs ↗

FAQ

同じカテゴリの他のツール

Browser Use

Browser Use

リポジトリ統計

ドキュメント

🤖 LLM クイックスタート

👋 ヒューマンクイックスタート

オープンソース版 vs クラウド版

デモ

📋 フォーム入力

タスク = 「職務経歴書と個人情報を使ってこの求人応募書を記入してください」

🍎 食料品買い物

タスク = 「この買い物リストをインスタカートに追加してください」

💻 パーソナルアシスタント

タスク = 「カスタムPCのパーツを見つけるのを手伝ってください」

💡詳しい例をここで確認 ↗し、スターをください！

🚀 テンプレートクイックスタート

💻 CLI

Claude Code Skill

統合、ホスティング、カスタムツール、MCP、およびその他について、ドキュメント ↗

よくある質問

🤖 LLM Quickstart

👋 Human Quickstart

Open Source vs Cloud

Demos

📋 Form-Filling

Task = "Fill in this job application with my resume and information."

🍎 Grocery-Shopping

Task = "Put this list of items into my instacart."

💻 Personal-Assistant.

Task = "Help me find parts for a custom PC."

💡See more examples here ↗ and give us a star!

🚀 Template Quickstart

💻 CLI

Claude Code Skill

Integrations, hosting, custom tools, MCP, and more on our Docs ↗

FAQ

同じカテゴリの他のツール

OpenClaw

AutoGPT

opencode

Langflow

Dify

LangChain