DeTikZify synthesizes graphics programs for scientific figures from sketches using TikZ.
Instant voice cloning by MIT and MyShell. Audio foundation model.
Fuse ChatTTS with OpenVoice to clone your personalized voice from a 10-second audio clip upload.
Foundational Models for State-of-the-Art Speech and Text Translation.
Orpheus TTS is an open-source system for human-sounding speech synthesis using Llama-3b backbone.
This is MCP server for Claude that gives it terminal control, file system search and diff file editing capabilities.
A curated list of open source GitHub repositories related to ChatGPT and OpenAI API.
A series of AI tools designed to enhance productivity for solo entrepreneurs and small businesses.
Large Language Model in Action is a GitHub repository demonstrating various implementations and applications of large language models.
A GUI Agent application based on UI-TARS that allows you to control your computer using natural language.
RF-DETR is a real-time object detection model architecture developed by Roboflow, SOTA on COCO & designed for fine-tuning.