โ— Shell
clean mode source โ†—

OpenAdapt.AI

OpenAdapt.AI

AI-First Process Automation for the Desktop Era

OpenAdapt.AI is an open-source Generative Process Automation platform that transforms desktop automation through machine learning. Record human demonstrations, train vision-language models, and deploy agents that adapt to any software environment.

๐ŸŽฏ Core Capabilities

  • Learn by Demonstration: Automatically learns from user interactions to generate automation scripts
  • Adaptive Intelligence: Adapts to software changes and complex environments using multimodal models
  • Privacy-First: Built-in data protection with PII/PHI detection and redaction
  • Cross-Platform: Works with browsers and desktop applications across macOS and Windows
  • Multi-Model Support: Integrates with OpenAI, Anthropic, Google, Ollama, and vLLM
  • Open Source: Community-driven development with MIT licensing

๐Ÿ—๏ธ Ecosystem

Core Platform

Repository Description Status
OpenAdapt โญ Main platform orchestrating GUI automation with LMMs. Meta-package integrating all ecosystem components ๐ŸŸข Active

Modular Components

Data Capture & Processing

Repository Description Status
openadapt-capture Production-ready event streams with time-aligned media for GUI interaction recording ๐ŸŸข Active
openadapt-viewer HTML viewer components for visualization dashboards and benchmark results ๐ŸŸข Active
openadapt-web Web interface components for browser-based interaction ๐ŸŸข Active

Machine Learning & Intelligence

Repository Description Status
openadapt-ml ML training toolkit for multimodal GUI-action models ๐ŸŸข Active
openadapt-grounding UI element detection and localization with OmniParser integration ๐ŸŸข Active
openadapt-retrieval Multimodal demo retrieval system for similarity search in GUI automation ๐ŸŸข Active

Evaluation & Quality

Repository Description Status
openadapt-evals Evaluation infrastructure and benchmarks for GUI agent performance testing ๐ŸŸข Active

Privacy & Security

Repository Description Status
openadapt-privacy PII/PHI detection and redaction for GUI automation data with Presidio integration ๐ŸŸข Active

Execution & Runtime

Repository Description Status
openadapt-agent Production execution engine with safety gates, audit logging, and human-in-the-loop confirmation ๐ŸŸข Active

Observability

Repository Description Status
openadapt-telemetry Error tracking and usage analytics with privacy filtering for platform monitoring ๐ŸŸข Active

Integration & Deployment Tools

Repository Description Status
OpenAdapter Effortless deployment platform for screenshot parsing and action models on AWS EC2 ๐ŸŸข Active
OmniMCP Model Context Protocol integration with OmniParser for AI UI interaction (68 stars) ๐ŸŸข Active
OmniMCP.web Web interface for OmniMCP server ๐ŸŸข Active

๐Ÿค Community Contributions

OpenAdapt has contributed deployment infrastructure to major open-source projects:

Microsoft OmniParser

  • PR #52: Add Dockerfile and client.py; deploy to EC2 on AWS via Github Actions
  • Status: Open (most commented PR in the repository)
  • Contribution: Production deployment infrastructure with Docker, client library, and AWS automated deployment
  • Impact: Enables production deployment of OmniParser for screen parsing tasks

Microsoft Set-of-Mark (SoM)

  • PR #19: Add Dockerfile and client.py; deploy to EC2 on AWS via Github Actions
  • Status: Merged (first PR to the repository)
  • Contribution: deploy.py, Gradio interface, client library, and AWS deployment automation
  • Impact: Provided the foundation for SoM's production deployment capabilities

Research Forks & Foundations

Repository Description Status
OpenCUA Fork of Computer-Use Agents framework that cites OpenAdapt as foundational ๐ŸŸก Fork
OmniParser Fork of Microsoft's OmniParser with deployment enhancements ๐ŸŸก Fork
SoM Fork of Set-of-Mark visual prompting with deployment infrastructure ๐ŸŸก Fork

Development Tools & Libraries

Repository Description
PydanticPrompt Library for documenting Pydantic models to generate structured LLM outputs

Critical Dependencies

Repository Description
atomacos macOS automation library (fork) - critical dependency for macOS support
pynput Input control library (fork) - critical dependency for cross-platform input handling

Research & Experimentation

Repository Description
OmniParser Fork of Microsoft's OmniParser for screen parsing with OpenAdapt-specific modifications
SoM Set-of-Mark visual prompting technique for precise UI element targeting

Configuration & Infrastructure

Repository Description
.github Organization-wide GitHub configuration, templates, and this profile README

๐Ÿš€ Quick Start

# Install core platform
pip install openadapt

# Or use uv for faster installation
uv pip install openadapt

# Record a demonstration
openadapt record

# Train a model
openadapt train

# Evaluate performance
openadapt eval

๐Ÿ“š Documentation

๐Ÿค Contributing

We welcome contributions across all repositories:

  • Frontend Development: openadapt-web, openadapt-viewer
  • Machine Learning: openadapt-ml, openadapt-grounding, openadapt-retrieval
  • Data Capture: openadapt-capture
  • Privacy & Security: openadapt-privacy
  • Agent Runtime: openadapt-agent
  • Evaluation: openadapt-evals
  • Documentation: All repositories

See individual repository CONTRIBUTING.md files for specific guidelines.

๐Ÿ“Š Project Status

OpenAdapt.AI is currently in alpha. While functional, some features are still under development. We're actively working towards production-ready releases across the ecosystem.

๐Ÿ’ผ Enterprise & Consulting

Professional implementation services and organizational consulting are available. Contact info@openadapt.ai for details.

๐Ÿ’– Sponsorship

Support OpenAdapt.AI development through GitHub Sponsors. Your contributions help maintain and expand this open-source ecosystem.

๐Ÿ“„ License

All OpenAdapt.AI repositories are licensed under the MIT License unless otherwise specified. See individual repository LICENSE files for details.