Official GitHub repository for SafetyBench, a benchmark to evaluate the safety of large language models (LLMs).
Self-evaluating interview for AI coders.
A customizable framework for efficient large model evaluation and performance benchmarking.
A collection of benchmarks and datasets for evaluating large language models (LLMs).
VideoMind is a Chain-of-LoRA Agent designed for long video reasoning using human-like processes.
A third-party music player providing local services, desktop lyrics, music downloads, and high sound quality.
AI模型接口管理与分发系统,支持多种大模型统一调用,并提供企业和个人使用的分发管理服务。
An AI-powered custom node for ComfyUI designed to enhance workflow automation and provide intelligent assistance.
Code for Segment Any Motion in Videos, enabling motion segmentation in video sequences.
c/ua is a Docker Container for Computer-Use AI Agents, enabling AI agents to control operating systems in virtual containers.
Self-hosted collection of powerful web-based tools for everyday tasks without ads or tracking.
SuperCoder is a coding agent that simplifies development workflows by interpreting natural language commands in the terminal.