Newsletter
Join the Community
Subscribe to our newsletter for the latest news and updates
Streamline the fine-tuning process for multimodal models like PaliGemma 2, Florence-2, and Qwen2.5-VL.
Maestro is a streamlined tool designed to accelerate the fine-tuning of multimodal models, encapsulating best practices from core modules. It handles configuration, data loading, reproducibility, and training loop setup, making it easier for developers to work with popular vision-language models such as Florence-2, PaliGemma 2, and Qwen2.5-VL.