Trendshift - Ask AI

base on Open-source, vision-first browser agent <div align="center"> <img src="assets/banner.png" alt="Magnitude Text Logo"/> </div> <br/> <div align="center"> <a href="https://docs.magnitude.run/getting-started/introduction" target="_blank"><img src="https://img.shields.io/badge/📕-Docs-0369a1?style=flat-square&labelColor=0369a1&color=gray" alt="Documentation" /></a> <img src="https://img.shields.io/badge/License-Apache%202.0-0369a1?style=flat-square&labelColor=0369a1&color=gray" alt="License" /> <a href="https://discord.gg/VcdpMh9tTy" target="_blank"><img src="https://img.shields.io/discord/1305570963206836295?style=flat-square&logo=discord&logoColor=white&label=Discord&labelColor=5865F2&color=gray" alt="Discord" /></a> <a href="https://x.com/tgrnwld" target="_blank"><img src="https://img.shields.io/badge/-Follow%20Tom!-000000?style=flat-square&labelColor=000000&color=gray&logo=x&logoColor=white" alt="Follow Tom" /></a> <a href="https://x.com/ndrsrkl" target="_blank"><img src="https://img.shields.io/badge/-Follow%20Anders!-000000?style=flat-square&labelColor=000000&color=gray&logo=x&logoColor=white" alt="Follow Anders" /></a> </div> <hr/> > 🚀 New: Magnitude is state-of-the-art, scoring [94% on WebVoyager](https://github.com/magnitudedev/webvoyager)! Magnitude uses vision AI to enable you to control your browser with natural language. - 🧭 **Navigate** - Sees and understands any interface to plan out actions - 🖱️ **Interact** - Executes precise actions using mouse and keyboard - 🔍 **Extract** - Intelligently extracts useful structured data - ✅ **Verify** - Built-in test runner with powerful visual assertions You can use it to automate tasks on the web, integrate between apps without APIs, extract data, test your web apps, or as a building block for your own browser agents. ![Video showing Magnitude tests running in a terminal and agent taking actions in the browser](assets/readme.gif) ↕️ Magnitude in action! ↕️ ```ts // Magnitude can handle high-level tasks await agent.act('Create a task', { // Optionally pass data that the agent will use where appropriate data: { title: 'Use Magnitude', description: 'Run "npx create-magnitude-app" and follow the instructions', }, }); // It can also handle low-level actions await agent.act('Drag "Use Magnitude" to the top of the in progress column'); // Intelligently extract data based on the DOM content matching a provided zod schema const tasks = await agent.extract( 'List in progress tasks', z.array(z.object({ title: z.string(), description: z.string(), // Agent can extract existing data or new insights difficulty: z.number().describe('Rate the difficulty between 1-5') })), ); ``` ## Get started ### Running your first browser automation ```bash npx create-magnitude-app ``` This will create a new project and walk you through the steps for setting up Magnitude. It will also create an example script that you can run right away! ### Using the test runner To install the test runner for use in an **existing** web app, please run: ```bash npm i --save-dev magnitude-test && npx magnitude init ``` This will create a basic tests directory `tests/magnitude` with: - `magnitude.config.ts`: Magnitude test configuration file - `example.mag.ts`: An example test file For information on how to run tests and integrate into CI/CD see [here](https://docs.magnitude.run/core-concepts/running-tests). > [!NOTE] > Magnitude requires a large **visually grounded** model. We recommend Claude Sonnet 4 for the best performance, but are also compatible with Qwen-2.5VL 72B. See [docs](https://docs.magnitude.run/customizing/llm-configuration) for more information. ## Why Magnitude? ❌ **Problem #1:** Most browser agents draw numbered boxes around page elements - doesn't generalize well due to complex modern sites ✅ **Solution: Vision-first architecture** * Visually grounded LLM specifies pixel coordinates * True generalization independent of DOM structure * Future-proof architecture for desktop apps, VMs, etc. ❌ **Problem #2:** Most browser agents follow "high-level prompt + tools = work until done" - works for demos, not production ✅ **Solution: Controllable & repeatable automation** * Flexible abstraction levels (granular actions vs. flows) * Custom actions + prompts at agent and action level * Deterministic runs via native caching system *(in progress)* ## Additional info Please see [our docs](https://docs.magnitude.run) for more information on how to best build Magnitude automations and test cases. ## Contact If you are an enterprise and want more features or support, feel free to reach out to us at [email protected] or schedule a call [here](https://cal.com/tom-greenwald/30min) to discuss your needs. You can also join our <a href="https://discord.gg/VcdpMh9tTy" target="_blank">Discord community</a> for help or any suggestions! ", Assign "at most 3 tags" to the expected json: {"id":"14136","tags":[]} "only from the tags list I provide: [{"id":77,"name":"3d"},{"id":89,"name":"agent"},{"id":17,"name":"ai"},{"id":54,"name":"algorithm"},{"id":24,"name":"api"},{"id":44,"name":"authentication"},{"id":3,"name":"aws"},{"id":27,"name":"backend"},{"id":60,"name":"benchmark"},{"id":72,"name":"best-practices"},{"id":39,"name":"bitcoin"},{"id":37,"name":"blockchain"},{"id":1,"name":"blog"},{"id":45,"name":"bundler"},{"id":58,"name":"cache"},{"id":21,"name":"chat"},{"id":49,"name":"cicd"},{"id":4,"name":"cli"},{"id":64,"name":"cloud-native"},{"id":48,"name":"cms"},{"id":61,"name":"compiler"},{"id":68,"name":"containerization"},{"id":92,"name":"crm"},{"id":34,"name":"data"},{"id":47,"name":"database"},{"id":8,"name":"declarative-gui "},{"id":9,"name":"deploy-tool"},{"id":53,"name":"desktop-app"},{"id":6,"name":"dev-exp-lib"},{"id":59,"name":"dev-tool"},{"id":13,"name":"ecommerce"},{"id":26,"name":"editor"},{"id":66,"name":"emulator"},{"id":62,"name":"filesystem"},{"id":80,"name":"finance"},{"id":15,"name":"firmware"},{"id":73,"name":"for-fun"},{"id":2,"name":"framework"},{"id":11,"name":"frontend"},{"id":22,"name":"game"},{"id":81,"name":"game-engine "},{"id":23,"name":"graphql"},{"id":84,"name":"gui"},{"id":91,"name":"http"},{"id":5,"name":"http-client"},{"id":51,"name":"iac"},{"id":30,"name":"ide"},{"id":78,"name":"iot"},{"id":40,"name":"json"},{"id":83,"name":"julian"},{"id":38,"name":"k8s"},{"id":31,"name":"language"},{"id":10,"name":"learning-resource"},{"id":33,"name":"lib"},{"id":41,"name":"linter"},{"id":28,"name":"lms"},{"id":16,"name":"logging"},{"id":76,"name":"low-code"},{"id":90,"name":"message-queue"},{"id":42,"name":"mobile-app"},{"id":18,"name":"monitoring"},{"id":36,"name":"networking"},{"id":7,"name":"node-version"},{"id":55,"name":"nosql"},{"id":57,"name":"observability"},{"id":46,"name":"orm"},{"id":52,"name":"os"},{"id":14,"name":"parser"},{"id":74,"name":"react"},{"id":82,"name":"real-time"},{"id":56,"name":"robot"},{"id":65,"name":"runtime"},{"id":32,"name":"sdk"},{"id":71,"name":"search"},{"id":63,"name":"secrets"},{"id":25,"name":"security"},{"id":85,"name":"server"},{"id":86,"name":"serverless"},{"id":70,"name":"storage"},{"id":75,"name":"system-design"},{"id":79,"name":"terminal"},{"id":29,"name":"testing"},{"id":12,"name":"ui"},{"id":50,"name":"ux"},{"id":88,"name":"video"},{"id":20,"name":"web-app"},{"id":35,"name":"web-server"},{"id":43,"name":"webassembly"},{"id":69,"name":"workflow"},{"id":87,"name":"yaml"}]" returns me the "expected json"

AI prompts