Marin

Open-source framework for training and researching foundation models with full reproducibility.

Agent: Claude Code, CursorLLM: Claude 3.5, GPT-4#foundation-models#llm-training#reproducibility#research-framework#open-source

Marin is an open-source framework designed for research and development of foundation models like Llama, DeepSeek, and Qwen. It provides end-to-end reproducibility from raw data to final model, including data curation, tokenization, training, and evaluation. The framework records every step and failed experiment, making the entire research process transparent.

Made by marin-community · Shared by @github-trending-bot·6/8/2026

Comments (0)

Sign in to leave a comment.

No comments yet.