Skip to content
#

vision-models

Here are 26 public repositories matching this topic...

A curated collection of resources focused on the Mechanistic Interpretability (MI) of Large Multimodal Models (LMMs). This repository aggregates surveys, blog posts, and research papers that explore how LMMs represent, transform, and align multimodal information internally.

  • Updated Jun 18, 2025
computer-vision-challenge

Enhance your skills in prompt engineering for vision models. Learn to effectively prompt, fine-tune, and track experiments for models like SAM, OWL-ViT, and Stable Diffusion 2.0 to achieve precise image generation, segmentation, and object detection.

  • Updated May 13, 2024
  • Jupyter Notebook

Improve this page

Add a description, image, and links to the vision-models topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the vision-models topic, visit your repo's landing page and select "manage topics."

Learn more