Luis Quintanilla avatarLuis Quintanilla
HomeAboutContactSearchSubscribe
BlogrollPodrollYouTubeForums
Starter PacksTravel Guides
AlbumsPlaylists
RadioTags
SnippetsWikiPresentationsRead Later

multimodal

A list of content tagged multimodal

Responses

  • Kimi K2.5: Visual Agentic Intelligence
  • MMCTAgent: Enabling multimodal reasoning over large video and image collections
  • Qwen3-Omni
  • Omnineural 4B
  • Ultravox - An open, fast, and extensible multimodal LLM
  • V-JEPA: The next step toward Yann LeCun’s vision of advanced machine intelligence (AMI)
  • OpenAI Sora - Creating video from text

Bookmarks

  • Video models are zero-shot learners and reasoners
  • MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training
  • Ferret: Refer and Ground Anything Anywhere at Any Granularity
  • PointLLM: Empowering Large Language Models to Understand Point Clouds