Name: Browser LLMs
Uploaded: 2025-01-03 00:29:48 UTC
Duration: 20 min 25 s
Description: In this episode, we look at leveraging the end user's hardware and WebGPU to run large language models directly in the user's browser.

Browser LLMs

Episode #486 by

David Kimura

Dec 8, 2024

Previous (#485) Episode Next (#487)

Summary

In this episode, we look at leveraging the end user's hardware and WebGPU to run large language models directly in the user's browser.
rails artificial intelligence stimulusjs 20:25

Mark as Watched Watch Later

Chapters

Introduction (0:00)
Picking a model (3:43)
Adding Web LLM (4:55)
Setting up the chat stimulus controller (5:12)
Setting up the view (8:11)
Clear the local storage (9:53)
Creating the submit functionality (10:39)
Demo (14:18)
Streaming the response (14:46)
Stream Demo (17:36)
Final thoughts (18:19)

Student & Teacher

$ 9 /mo

Valid School Email Required

Same Access as Pro

Subscribe Now

Pro Monthly

$ 19 /mo

Access to Pro Episodes

Invite to Slack Channel

Priority Suggestions

Ad Free

Subscribe Now

Pro Annual

$ 190 /yr

Access to Pro Episodes

Invite to Slack Channel

Priority Suggestions

Ad Free

Subscribe Now

Teams

$ 57 /mo

3 Users Minimum

$19.00 / user / month

Same Access as Pro

Subscribe to Teams

Learning Paths

Video Logs new

Blog

Merchandise

Forums new

Summary

Chapters