Harbor
-
harbor Public
Harbor is a framework for running agent evaluations and creating and using RL environments.
-
terminal-bench-science Public
Terminal-Bench-Science: Evaluating AI Agents on Complex Real-World Scientific Workflows in the Terminal
harbor-framework/terminal-bench-science’s past year of commit activity -
terminal-bench-3 Public
Measuring agents' ability to get work done on a computer
harbor-framework/terminal-bench-3’s past year of commit activity -
awesome-harbor Public
A curated list of awesome Harbor ecosystem projects
harbor-framework/awesome-harbor’s past year of commit activity