Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks
-
Updated
Dec 12, 2025 - Python
Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks
Open Source Generative Process Automation (i.e. Generative RPA). AI-First Process Automation with Large ([Language (LLMs) / Action (LAMs) / Multimodal (LMMs)] / Visual Language (VLMs)) Models
OmniMCP uses Microsoft OmniParser and Model Context Protocol (MCP) to provide AI models with rich UI context and powerful interaction capabilities.
AI-powered computer control for automated testing. Factifai uses vision models (Claude, GPT-4o, Gemini) to interact with applications naturally - clicking, typing, and verifying results just like a human would.
AI agent that controls a computer
Cappuccino is an GUI Agent based on desktop screen. It is a Manus-like AI Agent that can be deployed locally.
Docker implementation of the OmniParser screen parsing tool
Effortless Deployment and Integration for SOTA Screenshot Parsing and Action Models
🤖 Control your Gemini computer effortlessly with a unified console for browser automation, desktop management, and intelligent task execution.
Working self-hosted version of Microsoft's OmniParser Image-to-text model, up-to-date with fixed dependency/compatibility issues and Fly GPU deployment.
Placeholder for Omniparser Schemas used by universal-etl-parser
Multi-format data transformation library for Go. Transform between JSON, CSV, EDI, XML, and text using schema-driven configurations.
Computer Use Agent Using ADK
macOS computer control CLI for AI agents & OpenClaw — control any app on your Mac using natural language labels. Powered by OmniParser v2 (YOLOv8 + Florence-2). Click, type, scroll, screenshot & automate anything.
Add a description, image, and links to the omniparser topic page so that developers can more easily learn about it.
To associate your repository with the omniparser topic, visit your repo's landing page and select "manage topics."