reasoning-ai

Here are 3 public repositories matching this topic...

adeelahmad / mlx-grpo

🧠 Train your own DeepSeek-R1 style reasoning model on Mac! First MLX implementation of GRPO - the breakthrough technique behind R1's o1-matching performance. Build mathematical reasoning AI without expensive RLHF. Apple Silicon optimized. 🚀

ai artificial-intelligence llama thinking mlx mathematical-reasoning apple-silicon multi-step-reasoning llm chain-of-thought rlhf deepseek-r1 grpo reasoning-ai

Updated May 29, 2025
Python

OppaAI / AGi-Test

Star

Project: Amazing GRACe iteration

ai artificial-intelligence ai-chatbot ai-agent generative-ai adaptive-ai ai-voice-chat reasoning-ai cognitive-engine

Updated May 17, 2025
Python

ahmedmhussein111 / mlx-grpo

Star

MLX-GRPO allows you to train your own DeepSeek-R1 models directly on your Mac. This implementation simplifies the process of building advanced reasoning AI, making it accessible for developers. 🐙🌟

ai llama thinking mlx mathematical-reasoning apple-silicon multi-step-reasoning llm chain-of-thought rlhf deepseek-r1 grpo reasoning-ai

Updated Jun 17, 2025
Python

Improve this page

Add a description, image, and links to the reasoning-ai topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the reasoning-ai topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly