UI Automation Tool

Midscene.js

Interact, Query and Assert by Natural Language

Whether you're brainstorming, testing, or automating UI interactions, Midscene.js provides the perfect balance of simplicity and functionality through natural language commands.

Action

Execute a series of actions by describing the steps in natural language using .ai or .aiAction methods.

Query

Extract customized data from the UI using .aiQuery. Simply describe the JSON format you want, and AI will understand the page.

Assert

Perform assertions on the page using .aiAssert with natural language descriptions, making test maintenance easier.

Visual Reports

Get detailed visual reports after each run, with animated replays and step-by-step analysis for better debugging.

Visualized Reports for Better Debugging

Midscene provides comprehensive visual reports after each automation run, making debugging and maintenance more intuitive and efficient.

  • Animated replay of each automation step
  • Interactive playground for prompt adjustment
  • Detailed step-by-step execution analysis
Midscene Visual Report Demo

Simple Yet Powerful

// 👀 type keywords, perform a search
await ai('type "Headphones" in search box, hit Enter');

// 👀 find the items, return in JSON
const items = await aiQuery(
  "{itemTitle: string, price: Number}[], find item in list and corresponding price"
);

console.log("headphones in stock", items);

// 👀 assert by natural language
await aiAssert("There is a category filter on the left");

Multiple Ways to Integrate

YAML Scripts

Write automation scripts in YAML format for a more declarative approach to UI testing.

Chrome Extension

Control desktop Chrome directly through the Midscene extension for quick automation tasks.

Puppeteer Integration

Seamlessly integrate with Puppeteer for powerful browser automation capabilities.

Playwright Support

Use Midscene with Playwright for cross-browser testing and automation.

Community & Resources

About Midscene

Midscene is an open-source UI automation tool that makes testing and interaction with web applications intuitive through natural language. It supports both general-purpose LLMs and specialized open-source models for enhanced performance and privacy.

GitHub

Star us on GitHub and contribute to the project.

View on GitHub

Documentation

Learn more about features and implementation details.

Read the Docs

Twitter

Follow us for the latest updates and features.

@midscene_ai