Skill issue: stop deploying vision language models, use them with Skills to build e2e vision apps on edge

SponsorEngineering trackconfirmed

Skill issue: stop deploying vision language models, use them with Skills to build e2e vision apps on edge

Day: Day 2 — Session Day 1
Time: 11:40am-12:00pm
Room: Track 2
Track: Vision & OCR

Accessible with the Engineering pass and above.

About this session

With the boom of vision language models barrier of entry to build vision apps are much lower so developers tend to use them right away. However, these models are very large and inefficient in production. In this talk, I will go through combining vision language models with Skills to build end-to-end vision apps from training to deployment using HF Skills, on top of showing the state-of-the-art in small computer vision/multimodal models.

Topics

Coding AgentsVision (OCR, Screen, Video, Embodied)

Speaker

Merve Noyan

Developer Advocate · Hugging Face